Word Sense Disambiguation (WSD) is a historical NLP task aimed at linking words in contexts to discrete sense inventories and it is usually cast as a multi-label classification task. Recently, several neural approaches have employed sense definitions to better represent word meanings. Yet, these approaches do not observe the input sentence and the sense definition candidates all at once, thus potentially reducing the model performance and generalization power. We cope with this issue by reframing WSD as a span extraction problem — which we called Extractive Sense Comprehension (ESC) — and propose ESCHER, a transformer-based neural architecture for this new formulation. By means of an extensive array of experiments, we show that ESC unleashes the full potential of our model, leading it to outdo all of its competitors and to set a new state of the art on the English WSD task. In the few-shot scenario, ESCHER proves to exploit training data efficiently, attaining the same performance as its closest competitor while relying on almost three times fewer annotations. Furthermore, ESCHER can nimbly combine data annotated with senses from different lexical resources, achieving performances that were previously out of everyone’s reach. The model along with data is available at https://github.com/SapienzaNLP/esc.

ESC: Redesigning WSD with Extractive Sense Comprehension / Barba, Edoardo; Pasini, Tommaso; Navigli, Roberto. - (2021), pp. 4661-4672. (Intervento presentato al convegno North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021 tenutosi a Online) [10.18653/v1/2021.naacl-main.371].

ESC: Redesigning WSD with Extractive Sense Comprehension

Edoardo Barba
Primo
;
Tommaso Pasini
Secondo
;
Roberto Navigli
Ultimo
2021

Abstract

Word Sense Disambiguation (WSD) is a historical NLP task aimed at linking words in contexts to discrete sense inventories and it is usually cast as a multi-label classification task. Recently, several neural approaches have employed sense definitions to better represent word meanings. Yet, these approaches do not observe the input sentence and the sense definition candidates all at once, thus potentially reducing the model performance and generalization power. We cope with this issue by reframing WSD as a span extraction problem — which we called Extractive Sense Comprehension (ESC) — and propose ESCHER, a transformer-based neural architecture for this new formulation. By means of an extensive array of experiments, we show that ESC unleashes the full potential of our model, leading it to outdo all of its competitors and to set a new state of the art on the English WSD task. In the few-shot scenario, ESCHER proves to exploit training data efficiently, attaining the same performance as its closest competitor while relying on almost three times fewer annotations. Furthermore, ESCHER can nimbly combine data annotated with senses from different lexical resources, achieving performances that were previously out of everyone’s reach. The model along with data is available at https://github.com/SapienzaNLP/esc.
2021
North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021
word sense disambiguation; transformers; deep learning;lLexical semantics
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
ESC: Redesigning WSD with Extractive Sense Comprehension / Barba, Edoardo; Pasini, Tommaso; Navigli, Roberto. - (2021), pp. 4661-4672. (Intervento presentato al convegno North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021 tenutosi a Online) [10.18653/v1/2021.naacl-main.371].
File allegati a questo prodotto
File Dimensione Formato  
Barba_ESC_2021.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 781.38 kB
Formato Adobe PDF
781.38 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1585586
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 48
  • ???jsp.display-item.citation.isi??? 26
social impact