Full Forest Treebanking

Loading...
Thumbnail Image

Date

Authors

Packard, Woodley

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In this thesis, I present a new method of producing treebanks using constraint-based grammars. Rather than requiring an explicitly enumerated set of candidate analyses per utterance, my method works from an implicit representation, allowing the annotator to efficiently select the correct analysis from trillions of possibilities, without requiring the user or the computer to store or iterate over all of them. I explain the advantages and disadvantages of this method, and show the details and motivation for the algorithms that make it possible. Relative to comparable prior art (i.e. top-N treebanking), my solution enables higher coverage treebanks without a significant reduction in annotation speed, and reduces storage and computational resource consumption.

Description

Thesis (Master's)--University of Washington, 2015

Keywords

annotation; grammar; HPSG; treebank

Citation

DOI

Collections