A simple approach to the Spanish-English bilingual retrieval task. (2001)

G.-Figuerola, Carlos and Alonso-Berrocal, José-Luis and Zazo, Ángel F. and Gómez-Díaz, Raquel . A simple approach to the Spanish-English bilingual retrieval task. (2001)., 2001 In: Cross-Language Information Retrieval and Evaluation, Workshop of the Cross-Languge Evaluation Forum. Springer, pp. 224-229. [Book chapter]

[img]
Preview
PDF
figuerola2001simple.pdf

Download (29kB) | Preview

English abstract

This paper describes our participation in the CLEF bilingual retrieval task (formulating queries in Spanish to retrieve documents in English), using an information retrieval (IR) system based on the vector model. Our aim was to use a simple approach to solve the problem, without expecting to obtain great results, especially owing to the short time available. The queries formulated in Spanish were translated to English by a commercial machine translation system. The translations were filtered to eliminate stop words, and then the remaining terms were stemmed using a standard stemmer. Results were poorer than those obtained through monolingual retrieval with original English queries, the difference being slightly over 15%.

Item type: Book chapter
Keywords: Information Retrieval, Recuperación de la información
Subjects: L. Information technology and library technology > LM. Automatic text retrieval.
L. Information technology and library technology > LL. Automated language processing.
Depositing user: R. Gómez-Díaz
Date deposited: 17 Nov 2009
Last modified: 02 Oct 2014 12:15
URI: http://hdl.handle.net/10760/13663

References

1. Agirre, E., Atserias, J., Padró, L. and Rigau, G.: Combining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation Computers and the Humanities. Special Double Issue on SensEval. Eds. Martha Palmer and Adam Kilgarriff. 34:1,2, (2000). http://www.lsi.upc.es/~nlp/papers/chum99-arpa.ps.gz

2. Figuerola, C.G., Alonso Berrocal, J.L. and Zazo, A.F.: Diseño de un motor de recuperación para uso experimental y educativo. BiD: textos universitaris de biblioteconomia i documentació, 4 (2000) http://http://www.ub.es/biblio/bid/04figue2.htm

3. Fluhr, C.: Multilingual Information Retrieval. In Cole, R. A. et al.: Survey of the Sate of the Art in Human Language Technology, Standford University, Stanford, CA,(1995) 391-305

http://www.cse.ogi.edu/CSLU/HLTsurvey/ch8node7.html

4. Hull, D.A. and Grefenstette, G.: Queryng Across Languages: A Dictionary-Based Approach to Multilingual Intormation Retrieval. SIGIR96 (1996) 49-57

5. Kowalski, G.: Information Retrieval Systems - Theory and implementation. Kluwer Academic Publishers (1997)

6. Oard, D. and Dorr, B.J.: A Survey of Multilingual Text Retrieval. (1996)http://www.clis.umd.edu/dlrg/filter/papers/mlir.ps

7. Phillips, I.: Porter's stemming algorithm. Perl script. http://www.perl.com/CPAN-local/authors/Ian_Phillipps/Stem-0.1.tar.gz

8. Porter, M.F.: An algorithm for suffix stripping. Program, 14(3) (1980) 130-137

9. Systran Software: SYSTRAN-Translation Technologies, Language Translator, Online dictionary, Translate English (2000) [http://www.systransoft.com]


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item