La recuperación de información en el Web : retos y ¿soluciones?

Berrocal, José Luis and Zazo, Ángel F. and Figuerola, Carlos G. and Rodríguez, Emilio La recuperación de información en el Web : retos y ¿soluciones?, 2004 . In I Congreso Internacional sobre Tecnología Documental y del Conocimiento, Madrid (Spain), 28-30 January 2004. [Conference paper]


Download (688kB) | Preview

English abstract

The classic information retrieval systems have been implemented with problems at the time of being concretly with the information in Internet. The particularitities of this information are forcing to design new mechanisms that allow levels of precision much more elevated and that make possible that the user obtains what really needs. Because of the new challenges, our group of investigation REINA is working in the possible solutions. Some of the theories of improvement of the information retrieval will be analyzed and the Sacarino bot tool will appear like the possible software that facilitates this.

Spanish abstract

Los sistemas de recuperación de información clásicos se han encontrado con problemas a la hora de ser implementados en la información del web. Las particularidades de esta información están obligando a diseñar nuevos mecanismos que permitan unos niveles de precisión mucho más elevados y que posibiliten que el usuario obtenga lo que realmente necesita. Ante los nuevos retos aparecidos, nuestro grupo de investigación REINA está trabajando en las posibles soluciones. Se analizarán algunas de las teorías de mejora de la recuperación de información y se presentará la herramienta Sacarino bot como posible software que facilite esta tarea.

Item type: Conference paper
Keywords: Recuperación de información, Internet, Information Retrieval
Subjects: L. Information technology and library technology
H. Information sources, supports, channels.
Depositing user: Javier Martinez
Date deposited: 24 Aug 2004
Last modified: 02 Oct 2014 11:59


"SEEK" links will first look for possible matches inside E-LIS and query Google Scholar if no results are found.

KAHLE, B. Wide Area Information Servers Concepts. [en línea]. 1989 [Citado: Septiembre 1999]. Disponible en Internet:

LINDNER, P. Frequently asked questions about Gopher. [en línea]. 1994 [Citado: Septiembre 1999]. Disponible en Internet:

BUSH, V. As We May Think. Atlantic Montly, 1945, Vol. 176, No. 1, p. 101-108.

NELSON, T. H. A file Structure for the Complex, The Changing and The Indeterminate. ACM 20th National Conference,(1965).

BALASUBRAMANIAN, V. Hypermedia Issues and Applications: A State-of-the-Art Review. Graduate School of Management, Rutgers University, Newark, New Jersey, 1994.

HARDY, H. The History of the Net. [en línea]. Master Thesis, School of Communications, Grand Valley State University [Citado: Septiembre 1999]. Disponible en Internet:

HUBERMAN, B. A. y ADAMIC, L. A. Evolutionary dynamics of the World Wide Web. Tech. Rep., Xeros Palo Alto Reserach Center, (February, 1999).

GRAY, M. Measuring the growth of the Web. [en línea]. 1995 [Citado: Octubre 1999]. Disponible en Internet:

GRAY, M. Internet Statistics: Growth and usage of the Web and the Internet. [en línea]. 1996 [Citado: Noviembre 1999]. Disponible en Internet:

BRAY, T. Measuring the Web. Fifth International World Wide Web Conference, (Paris, France, 6-10 May 1996).

COFFMAN, K. G. y ODLYZKO, A. The size and growth rate of the internet. First Monday, 1998, Vol. 3, No. 10.

HOBBES ZAKON, R. Hobbes' Internet Timeline v5.2. [en línea]. 2000 [Citado: Febrero 2000]. Disponible en Internet:

BAEZA -YATES, R.; RIBEIRO-NETO, B. Modern information retrieval. New York: ACM Press ; Harlow [etc.] : Addison-Wesley, 1999.

FIGUEROLA, C.G.; BERROCAL, J.L.; ZAZO, A.F. Diseño de un motor de recuperación de información para uso experimental y educativo. BID. Textos Universitaris de Biblioteconomia i Documentació, 4. [en línea]. 2000 [Citado: Diciembre 2003]. Disponible en Internet:

BHARAT, K. y BRODER, A. A tecnique for measuring the relative size and overlap of public Web serach engines. Proc. of the Seventh WWW Conference, (Brisbane, Australia, 1998).

KLEINBERG, J. M., KUMAR, R. y RAGHAVAN, P. The Web as a graph: measurements, models, and methods. Proceedings of the Fifth Annual International Computing and Combinatorics Conference, (1999).

KUMAR, R., RAGHAVAN, P., RAJAGOPALAN, S. y TOMKINS, A. Trawling the Web for emerging cyber-communities. 8th. International World Wide Web Conference, (Toronto, Canada, May 11-14, 1999 ).

BHARAT, K. y HENZINGER, M. R. Improved algorithms for topic distillation in a hyperlinked environment. Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information retrieval, (1998), p. 104-111.

BRIN, S. y PAGE, L. The anatomy of a large-scale hypertextual Web search engine. Proc. 7th. WWW conference, (Brisbane, Australia, 14-18 April 1998). Url:

CARRIERE, J. y KAZMAN, R. Webquery: searching and visualizing the Web through connectivity. Sixth international World Wide Web conference, (Santa Clara, California, USA, April 7-11, 1997).

CHAKRABARTI, S., DOM, B., RAGHAVAN, P., RAJAGOPALAN, S., GIBSON, D. y KLEINBERG, J. Automatic resource compilation by analyzing hyperlink structure and associated text. Proc. 7th International World Wide Web Conference, (1998).

KLEINBERG, J. M. Authoritative sources in a hyperlinked environment. Journal of the ACM, 1999, p. 668-677.

CHAKRABARTI, S. y DOM, B. I. P. Enhanced hypertext categorization using hyperlinks. Proceedings ACM SIGMOD, (1998).

BOTAFOGO, R. A. y SHNEIDERMAN, B. Identifying aggregates in Hypertext structures. Proceedings of Hypertext'91, (Diciembre de 1991), p. 63-74.

PIROLLI, P., PITKOW, J. y RAO, R. Silk from a Sow's ear: extracting usable structures from the Web. Conference on Human Factors in Computing Systems, CHI'96, (Vancouver, April 13-18, 1996).

MENDELZON, G. M. y MILO, T. Querying the World Wide Web. Journal of Digital Libraries , 1997, Vol. 1, No. 1, p. 68-88.

KUMAR, R., RAGHAVAN, P., RAJAGOPALAN, S. y TOMKINS, A. Trawling the Web for emerging cyber-communities. 8th. International World Wide Web Conference, (Toronto, Canada, May 11-14, 1999).

SMEATON, A. F. y MORRISEY, P. J. Experiments on the Automatic Construction of Hypertext from Text. The New Review of Hypermedia and Multimedia: Applications and Research, 1995, Vol. 1. Url: GOLLOGLEY, G. y SMEATON ALAN F. Assisting the Hypertext Authoring Process with Topology Metrics and Information Retrieval. Working Papers, (1997).

GIBSON, D., KLEINBERG, J. y RAGHAVAN, P. Inferring Web communities from link topology. Proc. 9th ACM Conference on Hypertext and Hypermedia, (1998).

CHEN, C. Structuring and Visualising the WWW by Generalised Similarity Analysis. Proceedings of Hypertext'97, (Southampton, UK, 1997), p. 177-186.

HARARY, F. Graph Theory. Reading, MA: Adison Wesley, 1969.

ELLIS, D., FURNER-HINES, J. y WILLETT, P. On the creation of hypertext links in full-text documents: measurement of inter-linker consistency. Journal of Documentation, June 1994, Vol. 50, No. 2, p. 67-98.

BOTAFOGO, R. A., RIVLIN, E. y SHNEIDERMAN, B. Structural Analysis of Hypertexts: Identifying Hierarchies and Useful Metrics. ACM Transactions on Information Systems, April 1992, Vol. 10, No. 2, p. 142-180.

BERROCAL, J.L. Cibermetría: Análisis de los dominios web españoles. Salamanca: Ediciones Universidad de Salamanca, 2002.

KOSTER, M. A Standard for Robot Exclusion. [en línea]. 1994 [Citado: Diciembre 1998]. Disponible en Internet:


Downloads per month over past year

Actions (login required)

View Item View Item