Web Page Retrieval by Combining Evidence

G.-Figuerola, Carlos and Alonso-Berrocal, José-Luis and Zazo, Ángel F. and Rodríguez-Vázquez-de-Aldana, Emilio . Web Page Retrieval by Combining Evidence., 2006 In: Accessing Multilingual Information Repositories: 6th Workshop of the Cross-Language Evaluation Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005, Revised Selected Papers. UNSPECIFIED, pp. 880-887. [Book chapter]

[img]
Preview
PDF
figuerola2006web.pdf

Download (323kB) | Preview

English abstract

The participation of the REINA Research Group in WebCLEF 2005 focused in the monolingual mixed task. Queries or topics are of two types: named and home pages. For both, we first perform a search by thematic contents; for the same query, we do a search in several elements of information from every page (title, some meta tags, anchor text) and then we combine the results. For queries about home pages, we try to detect using a method based in some keywords and their patterns of use. After, a re-rank of the results of the thematic contents retrieval is performed, based on Page-Rank and Centrality coeficients.

Item type: Book chapter
Keywords: World wide wen, Information retrieval (IR), Web Page
Subjects: L. Information technology and library technology > LC. Internet, including WWW.
Depositing user: R. Gómez-Díaz
Date deposited: 13 Dec 2009
Last modified: 02 Oct 2014 12:16
URI: http://hdl.handle.net/10760/14005

References

Figuerola, C.G., Zazo Rodríguez, A., Alonso Berrocal, J.L., Rodríguez, E.: Karpanta: Un motor de búsqueda para la investigación experimental en recuperación de la información. In: IBERSID 2003, Zaragoza, Spain (2003)

Figuerola, C.G., Zazo, A.F., Rodríguez Vázquez de Aldana, E., Alonso Berrocal, J.L.: La recuperación de información en español y la normalización de términos. Revista Iberoamericana de Inteligencia Artificial 8(22) (2004) 135–145.

Beitzel, S., Jensen, E., Cathey, R., Ma, L., Grossman, D., Frieder, O., Chowdury, A., Pass, G., Vandermolen, H.: Task classification and document structure for known-item search. In: The Twelfth Text REtrieval Conference (TREC 2003), Gaithersburg, Maryland,2003. NIST Special Publication 500-255 (2003).

Fox, E.A., Shaw, J.A.: Combination of multiples searches. In: Overview of the Third Text REtrieval Conference (TREC-3), NIST Special Publication 500-226 (1994) 243–252.

Lee, J.H.: Combining multiple evidence from different relevance feedback methods. Technical Report, Center for Intelligent Information Retrieval (CIIR), Department of Computer Science, University of Massachusetts (1996).

Thompson, P.: A combination of expert opinion approach to probabilistic information retrieval, part 1: The conceptual model. Information Processing and Management 26(3) (1990) 371–382

Basterr, B.T., Cottrell, G.W., Belew, R.K.: Automatic combination of multiple ranked retrieval systems. In: Proceedings of the 17th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval. Dublin. Ireland, 3–6 July 1994 (Special Issue of the SIGIR Forum), ACM/Springer-Verlag (1994)

Lee, J.H.: Analyses of multiple evidence combination. In: SIGIR ’97: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, NY, USA, ACM Press (1997) 267–276 Kraaij,

W., Westerveld, T., Hiemstra, D.: The importance of prior probabilities for entry page search. In: 5th Annual International ACM SIGIR Conference, Association for Computing Machinery (2002) 27–34

Plachouras, V., Ounis, I., Rijsbergen, C.J.v., Cacheda, F.: University of Glasgow at the Web Track: Dynamic application of hyperlink analysis using the query scope. In: The Twelfth Text REtrieval Conference (TREC 2003), Gaithersburg, Maryland,2003. NIST Special Publication 500-255 (2003)

Tomlinson, S.: Robust, Web and Terabyte retrieval with Hummingbird Searchserver at TREC 2004. In: The Thirteen Text REtrieval Conference (TREC 2004), NIST Special Publication 500-261 (2004)

Hawking, D., Craswell, N.: Very large scale retrieval and Web search. In Voorhees, E., Harman, D., eds.: TREC: Experiment and Evaluation in Information Retrieval. MIT Press (2005) http://es.csiro.au/pubs/trecbook for website.pdf (ISBN 0262220733).

Yang, K., Albertson, D.: Widit in TREC 2004 genomics, hard, robust and Web tracks. In: The Thirteen Text REtrieval Conference (TREC 2004), NIST Special Publication 500-261 (2004)

Zaragoza, H., Craswell, N., Taylor, M., Saria, S., Robertson, S.: Microsoft Cambridge at TREC-13: Web and hard tracks. In: The Thirteen Text REtrieval Conference (TREC 2004), NIST Special Publication 500-261 (2004)

Farah, M., Vanderpooten, D.: Novel approaches in text information retrieval. Experiments in the Web track of TREC-2004. In: The Thirteen Text Retrieval Conference (TREC 2004), NIST Special Publication 500-261 (2004)

Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30(1–7) (1998) 107–117 Kleinberg, J.M., Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.S.: The Web as a graph: measurements, models, and methods. Lecture Notes in Computer Science 1627 (1999)


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item