Ontology-based text summarization. The case of Texminer

Hípola, Pedro, A. Senso, José, Leiva-Mederos, Amed and Domínguez-Velasco, Sandor Ontology-based text summarization. The case of Texminer. Library Hi Tech, 2014, vol. 32, n. 2, pp. 229-248. [Journal article (Paginated)]

Text
Ontology_based_text_summarization_The_case_of_Texminer.pdf - Published version
Download (444kB)

Official URL: http://dx.doi.org/10.1108/LHT-01-2014-0005

Alternative locations: http://www.ugr.es/~phipola/Ontology_based_text_summarization_The_case_of_Texminer.pdf, http://www.emeraldinsight.com/doi/full/10.1108/LHT-01-2014-0005

English abstract

Purpose – The purpose of this paper is to look into the latest advances in ontology-based text summarization systems, with emphasis on the methodologies of a socio-cognitive approach, the structural discourse models and the ontology-based text summarization systems. Design/methodology/approach – The paper analyzes the main literature in this field and presents the structure and features of Texminer, a software that facilitates summarization of texts on Port and Coastal Engineering. Texminer entails a combination of several techniques, including: socio-cognitive user models, Natural Language Processing, disambiguation and ontologies. After processing a corpus, the system was evaluated using as a reference various clustering evaluation experiments conducted by Arco (2008) and Hennig et al. (2008). The results were checked with a support vector machine, Rouge metrics, the F-measure and calculation of precision and recall. Findings – The experiment illustrates the superiority of abstracts obtained through the assistance of ontology-based techniques. Originality/value – The authors were able to corroborate that the summaries obtained using Texminer are more efficient than those derived through other systems whose summarization models do not use ontologies to summarize texts. Thanks to ontologies, main sentences can be selected with a broad rhetorical structure, especially for a specific knowledge domain.

Item type:	Journal article (Paginated)
Keywords:	Information retrieval, Software evaluation, Ontologies, Indexing, Programming, Automatic summarization systems, Texminer
Subjects:	L. Information technology and library technology > LL. Automated language processing.
Depositing user:	Pedro Hipola
Date deposited:	10 Aug 2015 05:30
Last modified:	10 Aug 2015 05:30
URI:	http://hdl.handle.net/10760/25540

Check full metadata for this record

References

Downloads

Downloads per month over past year

Actions (login required)

View Item

Facebook

Twitter

RSS