REINA at WebCLEF 2007: Selecting Usefull Snippets

G.-Figuerola, Carlos, Alonso-Berrocal, José-Luis, Zazo, Ángel F. and Rodríguez-Vázquez-de-Aldana, Emilio REINA at WebCLEF 2007: Selecting Usefull Snippets., 2007 . In CLEF 2007 Workshop, Budapest (Hungary), 19-21 September. [Conference paper]

[thumbnail of figuerola2007reina.pdf]
Preview
PDF
figuerola2007reina.pdf

Download (70kB) | Preview

English abstract

The task for this year consist in retrieve snippets or pieces of text from web documents about several topics. The extraction of such snippets can be approached in several ways, as well as the selection of most usefull of them. We describe the segementation process adopted, and the selection of snippets carried out.

Item type: Conference paper
Keywords: Web retrieval, Text segmentation, Recuperación de la información, Segmentación del texto
Subjects: L. Information technology and library technology > LM. Automatic text retrieval.
L. Information technology and library technology > LL. Automated language processing.
Depositing user: R. Gómez-Díaz
Date deposited: 17 Nov 2009
Last modified: 02 Oct 2014 12:15
URI: http://hdl.handle.net/10760/13629

References

[1] Carlos G. Figuerola, José Luis Alonso Berrocal, Ángel F. Zazo Rodríguez, and Emilio Rodríguez. REINA at WebCLEF 2006: Mixing fields to improve retrieval. In A. Nardi, C. Peters, and J.L. Vicedo, editors, ABSTRACTS CLEF 2006 Workshop, 20-22 September, Alicante, Spain. Results of the CLEF 2006 Cross-Language System Evaluation Campaign, 2006.

[2] Mark Pilgrim. Universal Encoding Detector. http://chardet.freeparser.org

[3] Amit Singhal, Chris Buckley, and Mandar Mitra. Pivoted document length normalization. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 18–22, 1996, Zurich, Switzerland (Special Issue of the SIGIR Forum), pages 21–29. ACM, 1996.

[4] Angel F. Zazo, Carlos G. Figuerola, José Luis Alonso Berrocal, and Emilio Rodríguez. Reformulation of queries using similarity thesauri. Information Processing & Management, 41(5):1163– 1173, 2005.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item