Araujo, Lourdes and Pérez-Agüera, José R. Enriching thesauri with hierarchical relationships by pattern matching in dictionaries. FinTAL - 5th International Conference on Natural Language Processing, 2006, pp. 268-279. [Journal article (Paginated)]
Preview |
PDF
fintal.pdf Download (123kB) | Preview |
English abstract
This paper proposes a pattern matching method applied to dictionaries to identify hierarchical relationships between terms. In this work we focus on this type of relationship because we use it in the automatic generation of thesauri, which are used to improve information retrieval tasks. However the method can also be applied to identify other semantic relationships. We distinguish two kinds of patterns: structural patterns, composed of a sequence of part-of-speech tags, and key patterns, typical of dictionary entries, composed of some key terms, along with some part-of-speech tags. This kind of patterns are automatically extracted for the dictionary entries by means of stochastic techniques. The thesaurus, that has been partially constructed previously, is then extended with the new relationships obtained by applying the patterns to a dictionary. We have based the system evaluation on the results obtained with and without the thesaurus in an information retrieval task proposed by the Cross-Language Evaluation Forum (CLEF). The results of these experiments have revealed a clear improvement on the performance.
Item type: | Journal article (Paginated) |
---|---|
Keywords: | automatic thesaurus extraction, information retrieval, query expansion,pattern matching, dictionary |
Subjects: | L. Information technology and library technology > LL. Automated language processing. L. Information technology and library technology > LM. Automatic text retrieval. |
Depositing user: | José Ramón Pérez Agüera |
Date deposited: | 09 Nov 2006 |
Last modified: | 02 Oct 2014 12:05 |
URI: | http://hdl.handle.net/10760/8351 |
References
Downloads
Downloads per month over past year
Actions (login required)
View Item |