Ibekwe-SanJuan, Fidelia and SanJuan, Eric Mining textual data through term variant clustering : the TermWatch system., 2004 . In RIAO 2004 Coupling approaches, coupling media and coupling languages for information retrieval, Avignon (France), 26-28 April 2004. [Conference paper]
Preview |
PDF
riao-04-ibesan.pdf Download (620kB) | Preview |
English abstract
We present a system for mapping the structure of research topics in a corpus. TermWatch portrays the "aboutness" of a corpus of scientific and technical publications by bridging the gap between pure statistical approaches and symbolic techniques. In the present paper, an experiment on unsupervised textmining is performed on a corpus of scientific titles and abstracts from 16 prominent IR journals. The preliminary results showed that TermWatch was able to capture low occurring phenomena which the usual clustering methods based on co-occurrence may not highlight. The results also reflect the expressive power of terminological variations as a means to capture the structure of research topics contained in a corpus.
Item type: | Conference paper |
---|---|
Keywords: | Thematic mapping, term clustering, information visualization, domain maps, knowledge representation |
Subjects: | I. Information treatment for information services > IB. Content analysis (A and I, class.) I. Information treatment for information services > ID. Knowledge representation. B. Information use and sociology of information > BB. Bibliometric methods |
Depositing user: | Fidelia Ibekwe-SanJuan |
Date deposited: | 10 Aug 2005 |
Last modified: | 02 Oct 2014 12:01 |
URI: | http://hdl.handle.net/10760/6642 |
References
Downloads
Downloads per month over past year
Actions (login required)
View Item |