© Nakladatelství
KAROLINUM 2018

RSS RSS   facebook


visa visa
maestro maestro

webmaster

VŠECHNY ZDE NABÍZENÉ PUBLIKACE MÁME SKLADEM

košík

VÁŠ NÁKUP


0 POLOŽEK
CENA: 0 VČETNĚ DPH



Domácí stránka  > JAZYKOVĚDA  > detail titulu

DETAIL TITULU:

Lexical Association Measures. Collocation Extraction

Ústav formální a aplikované lingvistiky MFFUK 2009

vázaná132 str.
ISBN 9788090417557

obálka
-10% 185,-
167,-
1-2 ks

This publication is devoted to an empirical study of lexical association measures and their application to collocation extraction. It presents a comprehensive inventory of lexical association measures and their evaluation on four reference data sets of collocation candidates: Czech dependency bigrams from the Prague Dependency Treebank, surface bigrams from the same source, instances of the latter from the Czech National Corpus, and Swedish distance verb-noun combinations obtained from the PAROLE corpus. The collocation candidates in the reference data sets were manually annotated and labeled as collocations or non-collocations by expert linguists. The evaluation scheme applied in this work is based on measuring the quality of ranking collocation candidates according to their chance to form collocations. The methods are compared by precision-recall curves, mean average precision scores, and appropriate tests of statistical significance. Further, the study focuses on the possibility of combining lexical association measures and discusses empirical results of several combination methods that significantly improve state of the art in collocation extraction. The work is concluded by a description of a model reduction algorithm that significantly reduces the number of combined measures without any statistically significant difference in performance.
This publication has been awarded the title "The best book of the Faculty of Mathematics and Physics for 2011" by Charles University in Prague.