UniMC - Pubblicazioni Aperte Digitali

An established method for MWE extraction is the combined use of previously identified POS-patterns and association measures. However, the selection of such POSpatterns is rarely debated. Focusing on Italian MWEs containing at least one adjective, we set out to explore how candidate POS-patterns listed in relevant literature and lexicographic sources compare with POS sequences exhibited by statistically significant n-grams including an adjective position extracted from a large corpus of Italian. All literature-derived patterns are found—and new meaningful candidate patterns emerge—among the top-ranking trigrams for three association measures. We conclude that a final solid set to be used for MWE extraction will have to be further refined through a combination of association measures as well as manual inspection.

Extracting MWEs from Italian corpora: a case study for refining the POS-pattern methodology

Malvina Nissim;CASTAGNOLI, SARA;Francesca Masini

2014-01-01

Abstract

An established method for MWE extraction is the combined use of previously identified POS-patterns and association measures. However, the selection of such POSpatterns is rarely debated. Focusing on Italian MWEs containing at least one adjective, we set out to explore how candidate POS-patterns listed in relevant literature and lexicographic sources compare with POS sequences exhibited by statistically significant n-grams including an adjective position extracted from a large corpus of Italian. All literature-derived patterns are found—and new meaningful candidate patterns emerge—among the top-ranking trigrams for three association measures. We conclude that a final solid set to be used for MWE extraction will have to be further refined through a combination of association measures as well as manual inspection.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno di pubblicazione del prodotto

2014

Codice ISBN

9781937284879

File in questo prodotto:

File	Dimensione	Formato
Nissim_Extracting-MWEs-from_2014.pdf solo utenti autorizzati Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: DRM non definito Dimensione 533.25 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	533.25 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11393/241617

Citazioni

ND

2

ND

social impact