May 11, Wednesday
14:00 – 15:20
Learning Structured Prediction Models for Hebrew Syntactic Parsing
Graduate seminar
Lecturer : Mr. Yoav Goldberg
Affiliation : CS, BGU
Location : 202/37
Host : CS, BGU
I discuss the syntactic-parsig problem: the automatic assignment of
syntactic structure to sentences in natural language text, with a
focus on parsing Hebrew, a language with rich morphology.
Syntactic-parsing is an instance of a structured-prediction, a
subfield of machine-learning concerned with learning to map complex
inputs (like sentences) to complex outputs (like trees). I present
the common framework for structured prediction, based on
problem-decomposition and dynamic-programming, and show some cases in
natural language where independence assumptions of that framework
fail. I then present Easy-First parsing, a novel algorithm for
syntactic parsing which is based on a different set of assumptions,
and which makes it easy to incorporate a much richer feature sets than
the dynamic-programming approach. The algorithm works by modeling the
set of decisions made by constructing the output structure instead of
directly modeling the structure itself. As a consequence, at each
step the algorithm can observe partial structures and exploit rich
contextual information to guide further decisions. The algorithm
works in a greedy fashion by delegating the search to the training
phase, making it particularly fast at inference time. I present
results of experiments on Hebrew data sets indicating a parsing
accuracy of up to 81.5%, a substantial improvement over other
approaches. The method also works well for English, leading to
state-of-the-art parsing results.
Structured prediction techniques have applications in other domains, I
briefly report on application to computational biology (RNA structured
prediction) where we also reach state-of-the-art results (joint work
with Shay Zakov and Michal Ziv-Ukelson).