Failed Queries: a Morpho-Syntactic Analysis Based on Transaction Log Files

Mastora, Anna and Kapidakis, Sarantos and Monopoli, Maria Failed Queries: a Morpho-Syntactic Analysis Based on Transaction Log Files., 2011 . In First Workshop on Digital Information Management, Corfu (Greece), 30-31 March 2011. [Conference paper]

[img]
Preview
PDF
01.Mastora.pdf

Download (412kB) | Preview

English abstract

The aim of the study is to elaborate on the procedure needed in order to analyze morpho-syntactically the typing-error queries submitted in Greek during the search process. In the context of our analysis a failed query is a query which returned no hits. The analysis showed that failed queries represent 36% of the submitted queries. More specifically, 19.6% of failed queries occurred due to typing errors. We discovered that for analyzing morpho-syntactically a Greek text corpus the PoS tools need to be rich in tags in order to work adequately. Open Xerox tokenizer performed well but with significant pre-processing of the queries and the analyzer seems to require additional tools to improve its performance. MS Word which was used for spelling corrections seems to perform satisfactorily. All tools were challenged in terms of named entities recognition.

Item type: Conference paper
Keywords: Failed queries, Morpho-syntactic analysis, PoS tagging, Typing errors
Subjects: C. Users, literacy and reading. > CB. User studies.
I. Information treatment for information services > IC. Index languages, processes and schemes.
Depositing user: Giannis Tsakonas
Date deposited: 27 Jun 2011
Last modified: 02 Oct 2014 12:19
URI: http://hdl.handle.net/10760/15845

References

- Tonta, Y., 1992. Analysis of Search Failures in Document Retrieval Systems: A Review. Public-Access Computer Systems Review, 3(1), pp. 4-53.

- Jones, S. et al., 2000. A Transaction Log Analysis of a Digital Library. International Journal on Digital Libraries, 3, pp. 152-169.

- Pu, H.-T., 2008. An analysis of failed queries for web image retrieval. Journal of Information Science, 34(3), p.275–289.

- Lau, E.P. & Goh, D.H.-L., 2006. In search of query patterns: a case study of a university OPAC. Information Processing and Management: an International Journal, 42(5), pp. 1316–1329.

- Villén-Rueda, L. et al. 2007. The Use of OPAC in a Large Academic Library: A Transactional Log Analysis Study of Subject Searching. The Journal of Academic Librarianship, 33(3), pp. 327-337.

- Nicholas, D. et al., 2008. User diversity: as demonstrated by deep log analysis. The Electronic Library, 26(1), pp. 21-38.

- Acedański, S., 2010. A morphosyntactic Brill Tagger for inflectional languages. In Proceedings of the 7th international conference on Advances in natural language processing. IceTAL’10. Berlin, Heidelberg: Springer-Verlag, pp. 3–14.

- Orphanos, G., 2000. Computational morphosyntactic analysis of modern Greek. Unpublished PhD thesis. Patras: University of Patras. School of engineering. Department of computer engineering and Informatics.

- Mastora, A. et al., 2007. Exploring users’ online search behaviour: a preliminary study in a library collection, 2nd DELOS Conference on Digital Libraries, Pisa, Italy, December 5-7.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item