Analysis of automatic translation of questions for question-answering systems

García Santiago, María Dolores and Olvera-Lobo, María Dolores Analysis of automatic translation of questions for question-answering systems. Information Research, 2010, vol. 15, n. 4. [Journal article (Unpaginated)]

[thumbnail of Analysis of automatic translation of questions for.pdf]
Preview
Text
Analysis of automatic translation of questions for.pdf

Download (433kB) | Preview

English abstract

Multilingual question-answering systems can provide users with specific data in response to queries by searching for a minimal fragment of text that applies to the query, regardless of the language in which the question is formulated and the answer is found. The aim of this paper is to analyse the automatic translation of questions (intended as queries input to a cross-language, question-answering system) from German and French into the Spanish language. Method. The methodology used for evaluation, based on automatic and subjective measures, appraises whether the translation will serve as input to a system. That is, does the question retain its validity and fulfil its function, allowing a proper response to be found? Analysis. The main features of multilingual question-answering systems are described and then we analyse the effectiveness of the translations achieved through three popular online translating tools: Google Translator, Promt and Worldlingo. Results. Our findings serve to identify which is the most reliable translator for both pairs of languages overall. However, an even more reliable option would be to use two different translators, depending on which of the two source languages is being dealt with. Conclusions. The results contribute to the realm of innovative search systems by enhancing our understanding of online translators and their potential in the context of multilingual information retrieval.

Item type: Journal article (Unpaginated)
Keywords: question-answering systems, translation, multilingual information retrieval
Subjects: L. Information technology and library technology
Depositing user: Maria Dolores/ M.D. Olvera Lobo
Date deposited: 23 Jul 2018 11:37
Last modified: 23 Jul 2018 11:37
URI: http://hdl.handle.net/10760/32868

References

Abusalah, M.,Tait, J. & Oakes, M. (2005). Literature review of cross language information retrieval. Proceedings of World Academy of Science, Engineering and Technology, 4, 175-177.

Belkin, N.J. & Croft, W. B. (1987). Retrieval techniques. Annual Review of Information Science and Technology. 22 109-146.

Bloom, B.S. & Krathwohl, D.R. (1956). Taxonomy of educational objectives: the classification of educational goals, by a committee of college and university examiners. Handbook 1: Cognitive domain. New York, NY: Longmans.

Bos, J. & Nissim, M. (2006). Cross-lingual question answering by answer translation. In C. Peters (Ed.), Working Notes for the CLEF 2006 Workshop, 20-22 September, Alicante, Spain. Retrieved 17 November, 2010 from http://www.clef-campaign.org/2006/working_notes/workingnotes2006/bosCLEF2006.pdf (Archived by WebCite® at http://www.webcitation.org/5uJHOlGWR)

Chang Y., Xu H.B. & Bai, S. (2003). TREC 2003 question answering track at CAS-ICT. In . M. Voorhees and Lori P. Buckland (Eds.). The Twelfth Text Retrieval Conference (TREC 2003), Gaithersburg, Maryland. (pp. 147-151). Retrieved 17 November, 2010 from http://trec.nist.gov/pubs/trec12/papers/chinese-acad-sci.qa.final.pdf (Archived by WebCite® at http://www.webcitation.org/5uJIJ2aEx)

Clough, P., Müller, H., Deselaers, T., Grubinger, M., Lehmann, T., Jensen, J. & Hersh, W. (2006). The CLEF 2005 cross-language image retrieval track. In C. Peters, F.C. Gey, J. Gonzalo, H. Müller, G.J.F. Jones, M. Kluck, et al. (Eds.), Accessing Multilingual Information Repositories. 6th Workshop of the Cross-Language Evaluation Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005, , (pp. 535-557). Berlin, Heidelberg: Springer-Verlag. (Lecture Notes in Computer Science, 4022)

Clough, P., Müller, H. & Sanderson, M. (2005), (2004). The CLEF cross-language image retrieval track (ImageCLEF) 2004. In P. Enser, Y. Kompatsiaris, N. E. O'Connor, A. F. Smeaton & A.W.M. Smeulders, (Eds.). Proceedings of the third International Conference Image and Video Retrieval (CIVR) 2004, Dublin, Ireland, July 21-23, 2004. (p. 2066.) Berlin, Heidelberg: Springer-Verlag. (Lecture Notes in Computer Science, 3115)

Cross-Language Evaluation Forum. (2008). Guidelines for the participants in QA@CLEF 2008. Retrieved 19 November 2010 from http://clef-qa.fbk.eu/2008/download/QA@CLEF08_Guidelines-for-Participants_new.pdf (Archived by WebCite® at http://www.webcitation.org/5utnRpnH0)

Dabbadie, M., Hartley A., King M., Miller K.J., Mustafa El Hadi W., Popescu-Belis A. et al. (2002). A hands-on study of the reliability and coherence of evaluation metrics. In M. King, (Ed.). Workbook of the LREC 2002 Workshop on Machine Translation Evaluation: Human Evaluators Meet Automated Metrics. Las Palmas de Gran Canaria, Spain, 27 May 2002, (pp.8-16). Retrieved 18 november 2010 http://www.mt-archive.info/LREC-2002-WS-MTEval.pdf (Archived by WebCite® at http://www.webcitation.org/5utoE1adF)

Forehand, M. . (2005). Bloom's taxonomy: original and revised. In M. Orey (Ed.), Emerging perspectives on learning, teaching and technology. Athens, GA: University of Georgia. Retrieved 8 March, 2010 from http://www.coe.uga.edu/epltt/bloom.htm. (Archived by WebCite® at http://www.webcitation.org/5utoUtcPJ)

Garcí,a Cumbreras, M. ., Ureña López, L.A., Martínez Santiago, F., Montejo Raez, A. (2005). Bú,squeda de respuestas multilingü,e: clasificació,n de preguntas en españ,ol basadas en aprendizaje [Multilingual question-answering: classification learning-based of questions in Spanish] . Procesamiento del lenguaje natural. No. 34, 31-40. Retrieved 18 November, 2010 from http://www.sepln.org/revistaSEPLN/revista/34/03.pdf (Archived by Webcite® at http://www.webcitation.org/5uvGZxOCU)

Goutte, C. (2006). Automatic evaluation of machine translation quality. Presentation at the European Community, Xerox Research Centre Europe, on January 27 2006, Meylan, France. Retrieved 8 March 2010 from http://www.xrce.xerox.com/content/download/6980/52379/file/MTeval.pdf (Archived by Webcite® at http://www.webcitation.org/5uvH3utpE)

Grunwald, D. (2009). Worldlingo fights to stay on top of Internet translation market niche .GTS Blog. http://blog.gts-translation.com/2009/08/17/worldlingo-fights-to-stay-on-top-of-internet-translation-market-niche/ (Archived by WebCite® at http://www.webcitation.org/5v4nmRnD7)

Grunwald, D. (2009). Worldlingo fights to stay on top of Internet translation market niche .GTS Blog. http://blog.gts-translation.com/2009/08/17/worldlingo-fights-to-stay-on-top-of-internet-translation-market-niche/ (Archived by Webcite® at http://www.webcitation.org/5uL9r3xIg )

Hallet, C., Scott, D &Power, R. (2007). Composing questions through conceptual authoring. Computational Linguistics, 33(1) 105-133.

Hermjakob, U. (2001). Parsing and question classification for question answering. In Proceedings of the Workshop on Open-Domain Question Answering at Annual Meeting of the Association for Computational Linguistics (ACL) 2001, Volume 12, (pp. 1-6). Stroudsburg, PA: Association for Computational Linguistics. Retrieved 18 November 2010. http://www.isi.edu/~ulf/papers/acl01-qa-parsing.pdf (Archived by Webcite® at http://www.webcitation.org/5uL9vd9zX).

Kishida, K., Chen, K., Lee, S., Kuriyama, K., Kando, N. & Chen, H. (2007). Overview of cross-lingual information retrieval (CLIR) task at the Sixth NTCIR Workshop. In Proceedings of NII Test Collection for IR Systems (NTCIR) 6 Workshop Meeting, June 15–18. Tokyo: National Institute for Informatics. Retrieved 18 November 2010 from http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings6/NTCIR/79.pdf (Archived by Webcite® at http://www.webcitation.org/5uLBOzqkn)

Lin, C.Y. & Hovy, E. (2003). Automatic evaluation of summaries using N-gram co-occurrence statistics. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1 (NAACL '03) Edmonton, May-June 2003. Volume 1 (NAACL '03), (pp. 71-78). Stroudsburg, PA: Association for Computational Linguistics. Retrieved 8 March 2010 from http://acl.ldc.upenn.edu/N/N03/N03-1020.pdf (Archived by Webcite® at http://www.webcitation.org/5uvHEaPBz)

López-Ostenero, F.,Gonzalo, J. &Verdejo, F. (2004). Búsqueda de información multilingüe: estado del arte. [Multilingual information searching:state of art] Inteligencia Artificial, Revista Iberoamericana de Inteligencia Artificial. 8(22), 11-35.

Nguyen, D., Overwijk, A., Hauff, C., Trieschnigg, D. R.B., Hiemstra, D. & de Jong, F.M.G. (2009). WikiTranslate: query translation for cross-lingual information retrieval using only Wikipedia. In C. Peters, Th. Deselaers, N. Ferro, J. Gonzalo, G.J.F. Jones, M. Kurimo, et al. (Eds.). Evaluating Systems for Multilingual and Multimodal Information Access. Proceedings of the 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008, Aarhus, Denmark, September 17-19, 2008. (pp. 58-65). Berlin, Heidelberg: Springer-Verlag. (Lecture Notes in Computer Science, 5706)

Niessen, S., Och, F.J., Leusch, G. & Ney, H. (2000). An evaluation tool for machine translation: fast evaluation for MT research. InProceedings of the 2nd International Conference on Language Resources and Evaluation (LREC-2000), Athens, Greece, May-June 2000. (pp. 39-45). Paris: European Language Resources Association. Retrieved 5 May 2008 from http://www.mt-archive.info/LREC-2000-Niessen.pdf (Archived by Webcite® at http://www.webcitation.org/5uMd0IGCE)

Oard, D. W. & Diekema, A.R. (1998). Cross-language information retrieval. Annual Review of Information Science and Technology, 33, 223-256.

Oard, D. W., He, D. & Wang, J. (2008). User-assisted query translation for interactive cross-language information retrieval. Information Processing & Management, 44(1) 181-211.

Pérez, A., González, J., Casacuberta, F. & Torres, I. (2004). Traducció,n automá,tica mediante transductores estocá,sticos de estados finitos basados en gramá,ticas k-explorables . [Automatic translation by finite state stochastic transductors based on k-testable grammars]. In E. Sanchis Arnal, (Ed.). Actas de las III Jornadas en Tecnologías del Habla, Techabla, Valencia, Noviembre 2004. (pp. 207-213). Retrieved 19 November 2010 from http://lorien.die.upm.es/~lapiz/rtth/JORNADAS/III/actas3JTH.pdf (Archived by WebCite® at http://www.webcitation.org/5uvHQwbSL)

Sloman, A. (2005). Towards an ontology for factual questions. Birmingham, UK: Birmingham University. [Incomplete draft notes (Part of CoSy deliverable DR.02.01: Requirements study for representation)]. Retrieved 11 December, 2010 from http://www.cs.bham.ac.uk/~axs/misc/question-ontology.pdf (Archived by Webcite® at )

Sokolova, S. (2009). How the computer translates. St. Petersburg, Russia: PROMT Ltd. Retrieved 17 November, 2010 from http://www.promt.com/company/technology/pdf/e_how_computer_translates_sokolova.pdf (Archived by WebCite® at http://www.webcitation.org/5uJS8YAho)

Tillman, C., Vogel, S., Ney, H. & Zubiaga, A. (1997). A DP based search using monotone alignments in statistical translation. InProceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, Madrid, Spain, July 1997. (pp. 289-296). Stroudsburg, PA Association for Computational Linguistics.

Tomá,s, J., Mas, J.A. & Casacuberta, F. (2003). A quantitative method for machine translation evaluation. In Proceedings of the Euoropean Association of Computational Linguistics Workshop on Evaluation initiatives in NLP: are evaluation methods, metrics and resources reusable?, Budapest, Hungary. (pp. 27-34). Stroudsburg, PA: Association for Computational Linguistics. Retrieved 20 November 2010 from http://www.aclweb.org/anthology-new/W/W03/W03-2804.pdf (Archived by Webcite® at http://www.webcitation.org/5uO4C88Ai)

Turpin, A.H. & Hersh, W. (2001) Why batch and user evaluations do not give the same results In Proceedings of the 24th Annual International ACM SIGIR Conference on Research & Development on Information Retrieval, September 9-12, 2001, New Orleans, Louisiana, USA. (pp. 225-231). New York, NY: ACM Press. Retrieved 20 November 2010 from http://www.seg.rmit.edu.au/research/download.php?manuscript=266 (Archived by Webcite® at http://www.webcitation.org/5uvGqzSJB)

Uebersax, J.S. (2006). Likert scales: dispelling the confusion. Retrieved 10 November, 2010 from http://www.john-uebersax.com/stat/likert.htm (Archived by Webcite® at http://www.webcitation.org/5uO7Qr2vU)

Ueffing, N. & Ney, H. (2007). Word-level confidence estimation for machine translation. Computational Linguistics, 33(1), 9-40.

Vidal, E. (1997). Finite-states speech-to-speech translation. In Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97), Munich, Germany.. Volume 1 (p. 111). Washington, DC: IEEE Computer Society.

Voorhees, E.M. & Harman, D. (2000). Overview of the Sixth Text REtrieval Conference (TREC-6). Information Processing & Management, 36(1), 3-35.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item