UR Research > Computer Science Department > CS Ph.D. Theses >

Dealing with quantifi er scope ambiguity in natural language understanding

URL to cite or link to: http://hdl.handle.net/1802/28308

HafeziManshadi_rochester_0188E_10672.pdf   2.51 MB (No. of downloads : 1086)
PDF of thesis.
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.
Quanti fier scope disambiguation (QSD) is one of the most challenging problems in deep natural language understanding (NLU) systems. The most popular approach for dealing with QSD is to simply leave the semantic representation (scope-) underspecifi ed and to incrementally add constraints to filter out unwanted readings. Scope underspeci fication has to solve an algorithmic problem: whether a representation with a given set of constraints has any scoping (i.e. whether it is satisfiable), and if so, how to efficiently enumerate possible scopings. The problem is NP-complete in general. It had been an open question whether there exists a tractable set within the context of the most popular constraint-based underspecification frameworks, covering all structures occurring in practice.

I show that the answer to this question is "yes". With no increase in time/space complexity, I extend the previously found tractable subset to include a family of sentences known not to be covered. Moreover, I defi ne a mathematically formalized, yet linguistically justi fied, notion of "coherence" and prove that all coherent natural language sentences belong to this subset, and hence prove that the underspecifi ed representation of all coherent sentences can be solved in polynomial-time.

The other way to deal with QSD is to actually resolve the ambiguity using rule-based or statistical methods. There has been a lack of work on statistical QSD, as a result of the lack of extensive annotated corpora. Previous corpora, and hence previous statistical QSD systems, only scope two explicitly quanti fied (i.e. no defi nite, indefi nite, bare) noun phrases (NPs) per sentence, mainly because even the hand-annotation of full QSD is very challenging.

I propose the fi rst annotation scheme for QSD, addressing many of the challenges that need to be dealt with in hand-annotation. Using this scheme, we have developed the fi rst corpus of English text, annotated with "comprehensive" QSD. In this corpus, a) all NPs in a sentence, regardless of the type of the article, have been scoped; b) the scope of operators such as frequency adverbials and negations have been labeled; and c) distributivity vs. collectivity of plurals has been addressed. Finally, I propose the fi rst comprehensive automatic QSD system, by defi ning a probabilistic framework for learning to build partial orders. The model has been trained and tested on our corpus. The performance is quite encouraging and could motivate further work in this area.
Contributor(s):
Mohammad Mehdi Hafezi Manshadi (1977 - ) - Author

James F. Allen - Thesis Advisor

Primary Item Type:
Thesis
Identifiers:
Local Call No. AS38.661
Language:
English
Subject Keywords:
Computational semantics; Natural language processing; Natural language understanding; Quantifier scope disambiguation; Quantifier scoping; Underspecification
Sponsor - Description:
Office of Naval Research (ONR) - N000141110417
National Science Foundation (NSF) - IIS-1012205
First presented to the public:
3/14/2016
Originally created:
2014
Date will be made available to public:
2016-03-14   
Original Publication Date:
2014
Previously Published By:
University of Rochester
Place Of Publication:
Rochester, N.Y.
Citation:
Extents:
Number of Pages - xv, 174 p.
Illustrations - ill.
License Grantor / Date Granted:
John Dickson / 2014-03-26 08:44:28.837 ( View License )
Date Deposited
2014-03-26 08:44:28.837
Date Last Updated
2016-03-14 11:30:20.661
Submitter:
John Dickson

Copyright © This item is protected by copyright, with all rights reserved.

All Versions

Thumbnail Name Version Created Date
Dealing with quantifi er scope ambiguity in natural language understanding1 2014-03-26 08:44:28.837