Mining textual data through term variant clustering : the TermWatch system
(2004) Mining textual data through term variant clustering : the TermWatch system. In Proceedings RIAO 2004 Coupling approaches, coupling media and coupling languages for information retrieval, pp. 487-503, Avignon (France).
Full text available as: |
Abstract
We present a system for mapping the structure of research topics in a corpus. TermWatch portrays the "aboutness" of a corpus of scientific and technical publications by bridging the gap between pure statistical approaches and symbolic techniques. In the present paper, an experiment on unsupervised textmining is performed on a corpus of scientific titles and abstracts from 16 prominent IR journals. The preliminary results showed that TermWatch was able to capture low occurring phenomena which the usual clustering methods based on co-occurrence may not highlight. The results also reflect the expressive power of terminological variations as a means to capture the structure of research topics contained in a corpus.
| Keywords: | Thematic mapping, term clustering, information visualization, domain maps, knowledge representation |
|---|---|
| Subjects: | I. Information treatment for information services > IB. Content analysis (A and I, class.) I. Information treatment for information services > ID. Knowledge representation. B. Information use and sociology of information. > BB. Bibliometric methods. |
| ID Code: | 4488 |
| Deposited By: | Ibekwe-SanJuan, Fidelia |
| Deposited On: | 10 August 2005 |
| All fields: | Show all fields |
Archive Staff Only: edit this record

