Evaluación de sistemas españoles de recuperación de información distribuída en Internet

Amat, Carlos B. Evaluación de sistemas españoles de recuperación de información distribuída en Internet., 2005 PhD thesis thesis, Universidad de Valencia (Spain). [Thesis]

[thumbnail of TesisCBAmat.pdf]
Preview
PDF
TesisCBAmat.pdf

Download (3MB) | Preview

English abstract

The set of information spaces collectively referred as Internet poses serious problems to information retrieval tasks. Content evolution of Internet spaces and documents is reviewed and distinctive features of web documents are empathized. Web search engines are classified according to their scope, functionalities and retrieval philosophy. A chapter is devoted to the characterization of Spanish web though the study of a random set of web sites, their quantitative composition and their qualitative features. The analysis of search engines of the Spanish web begins with a study of coverage, methods of crawling, data schema and indexing mechanisms. Finally, eight search engines (AltaVista, EnlaWeb, Lycos, Olé/Terra, Ozú, Sol, Ya and Yahoo) were evaluated in retrieving information from Spanish web space. Indicators chosen were their relative coverage, specific offering, proportion of dead links and accessibility of Spanish websites. Performance was determined by relative recall and precision in retrieval during the first quarter in 2003. Search topics and relevance of results were determined by the end users. 12,4% of the searches led to dead links and 76% of the pages were returned by only a single system. System performance, expressed in terms of recall ranged from 7% (AltaVista) to 14% (Ozú) and precision between 9% (Sol) and 30% (Ozú). Only Yahoo displayed typical inverse relationship between recall and precision figures. The rest of the systems invariably showed an increase in precision figures starting with the second or third search result, suggesting problems with the sorting algorithm.

Spanish abstract

El conjunto de espacios informativos que, colectivamente, se denomina Internet, plantea serios desafíos desde el punto de vista de la documentación y la recuperación de información. Parece conveniente introducir este conjunto de problemas con una revisión de la evolución de Internet que, más que centrarse en los desarrollos técnicos, atienda a la progresiva configuración de su contenido informativo. Desde este punto de vista, Internet parece haber evolucionado en sentido centrífugo desde un estado de homogeneidad temática hasta un universo de gran heterogeneidad. Este acercamiento permite caracterizar de forma conveniente el universo documental que alberga y sus propiedades, que lo diferencian mucho del universo documental tradicional, alrededor de documentos y fuentes de información estructurados. Tras esta revisión, se examinan los sistemas para la recuperación de la información distribuida desarrollados en cada uno de los espacios que han venido integrándose en Internet y, especialmente, los del espacio Web. Más que disponerlos en orden cronológico, se propone una clasificación funcional de estos sistemas y se atiende a las ventajas e inconvenientes de cada modelo. Por último, se revisan los trabajos que han intentado evaluar los sistemas de recuperación de información distribuida como paso previo a establecer un plan de trabajo que permita evaluar los sistemas españoles de recuperación de información en Internet. El examen de la evolución de Internet, el análisis de las características de la información y los documentos que contiene, el establecimiento de una taxonomía de sistemas para su recuperación y los métodos de evaluación de estos mismos sistemas se basan en una revisión de la literatura amplia, pero especialmente centrada en las aportaciones más recientes y procedentes con frecuencia de campos no estrictamente relacionados con la documentación tradicional.

Item type: Thesis (UNSPECIFIED)
Keywords: Search Engines; Web search; Retrieval evaluation
Subjects: L. Information technology and library technology > LS. Search engines.
Depositing user: Carlos Benito
Date deposited: 12 Dec 2007
Last modified: 02 Oct 2014 12:10
URI: http://hdl.handle.net/10760/10799

References

Diameter of the World-Wide Web (1999). Nature, 401 (6749): 130-131.

20 Year Usenet Timeline (2003). Google, Inc [Online]. Accesible en: http://www.google.com/googlegroups/archive_announce_20.html (3 de Julio, 2003)

The Open Directory Project (2003). Wikipedia [Online]. Accesible en: http://www.wikipedia.org/wiki/Open_Directory_Project (6 de Agosto, 2003)

Abad García, M. (1997). Evaluación de los componentes de los sistemas de recuperación de la información. En: Investigación Evaluativa en Documentación (pp. 125-163). Valencia: Universitat de València.

Abad García, M. (1997). Evaluación de la eficacia de los SRI. En: Investigación evaluativa en Documentación: Aplicación a la Documentación Médica (pp. 85-122). Valencia: Universitat de València.

Abiteboul, S., Preda, M., Cobena, G(2003): Adaptive On-Line Page Importance Computation. Twelfth International World Wide Web Conference. 20 de mayoo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Adamic, L. Huberman, B. (2001). The Web's Hidden Order. Communications of the ACM, 44 (9): 55-59.

Adell, J : WWW and gopher statistics? (Respuesta) [Online]. Accesible en: http://groups.google.com/groups?hl=es&lr=&ie=UTF-8&oe=UTF-8&selm=jordi-150394110424%40bembo.edu.uji.es. (15 de Marzo, 1994)

Adell, J. (2002). Arqueología digital: Los primeros servidores web de España. Universitat Jaume I, Departament de Noves Tecnologies en Educació [Online]. Accesible en: http://nti.uji.es/~jordi/historia_spain_web/html/index.html (13 de Febrero, 2003)

Aguilló, I.(2000): Internet invisible o Infranet: definición, clasificación y evaluación. Séptimas Jornadas Españolas de Documentación.19 de octubre de 2000. Bilbao, FESABID.

Tsoi, A.S., Morini, G., Scarselli, F., Hagenbuchner, M., Maggini, M. (2003): Adaptive Ranking of Web Pages. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Aldana Montes, J., Gómez Lora, A., Moreno Vergara, N., Roldán García, MM (2002). Querying the Semantic Web: Feasibility Issues. UPGrade, 3 (4).

Alonso Berrocal, J. (2000). Cibermetría. Análisis de los dominios Web españoles: recuperación en internet. Tesis doctoral. Universidad de Salamanca.

Amat, C. B. (1998). Sistemas de recuperación de información distribuida en Internet. Una revisión de su evolución, sus características y sus perpectivas. Primera parte. Revista Española de Documentación Científica, 21 (4): 463-474.

Amat, C. B. (1999). Recuperación en Internet: Cuatro modelos complementarios y una agenda para su integración. Boletín de RedIRIS,(48).

Amat, C. B. (2003). Caracterización de una muestra de sedes Web españolas bajo dominio .es. Boletín de RedIRIS,(64): 33-40.

Andreesen, M : NCSA Nosaic for X 0.10 available [Online]. Accesible en: http://groups.google.com/groups?selm=MARCA.93Mar14225600%40wintermute.ncsa.uiuc.edu. (14 de Marzo, 1993)

Arasu, A., Cho, J., García-Molina, H., Paepcke, A., Raghavan, S. (2001). Searching the Web. ACM Transactions on Internet Technology, 1 (1): 2-43.

AT&T (1995). AT&T to include FrontPage in Easy World Wide Web Service. AT&T [Online]. Accesible en: http://www.att.com/news/1195/951121.bsa.html

Baeza-Yates, R. Ribeiro-Neto, B. (1999). Searching the Web. In Modern Information Retrieval (pp. 367-396). Harlow: Pearson Education.

Baeza-Yates, R. (2002). The Web of Spain . UPGrade [Online]. Accesible en: http://www.upgrade-cepis.org/issues/2002/3/upgrade-vIII-3.html (21 de Octubre, 2003)

Baeza-Yates, R. Saint-Jean, F. (2003). Análisis de consultas a un buscador y su aplicación a la jerarquización de páginas web. BiD [Online]. Accesible en: http://www2.ub.es/bid/consulta_articulos.php?fichero=10baeza.htm (23 de Septiembre, 2003)

Baeza-Yates, R. (2004). Excavando la Web. El Profesional de la Información, 13 (1): 4-10.

Baeza-Yates, R. (2003). Information retrieval in the Web: beyond current search engines. International Journal of Approximate Reasoning, 34 (2-3): 97-104.

Bailey, P., Craswell, N., Hawking, D. (2003). Engineering a multi-purpose test collection for Web retrieval experiments. Information Processing and Management, 39 (6): 853-871.

Bar-Ilan, J. (1998). On the overlap, the precision and estimated recall of search engines. A case study of the query 'Erdos'. Scientometrics, 42 (2): 207-228.

Bar-Ilan, J. (1999). Search Engine Results over Time: A Case Study on Search Engine Stability. Cybermetrics, 2-3 (1): 1.

Bar-Ilan, J. (2003). How much information do search engines disclose on the links to a web page? A longitudinal case study of the 'cybermetrics' home page. Journal of Information Science, 28 (6): 455-466.

Baró i Queralt, J.(1997): Cerca i recuperació d'informació al World Wide Web: una aproximació a les eines disponibles. Sisenes Jornades Catalanes de Documentació. 23 de Octubre de 1997. Barcelona: FESABID; SOCADI.

Bates, M. (2002). After the Dot-Bomb: Getting Web Information Retrieval Right This Time. First Monday [Online]. Accesible en www.firstmonday.dk/issues/issue7_7/bates/ (20 de septiembre, 2002)

Beaver, A. (1998). Evaluating Search Engine Models for Scholarly Purposes: A report from the Internet Applications Laboratory. D-Lib Magazine [Online]. Accesible en: http://www.dlib.org/dlib/diciembre98/12beavers.html (20 de septiembre, 2002)

Beckett, D.(1997): 30% Accessible - A Survey of The UK Wide Web. 6th World Wide Web Conference. Santa Clara (California), International World Wide Web Consortium.

Behlendorf, B : MCC's EINet(TM) Introduces Galaxy, an Internet Directory Service [Online]. Accesible en: http://groups.google.com/groups?q=einet+galaxy&hl=es&lr=&ie=UTF-8&oe=UTF-8&selm=2i2l2f%24goc%40agate.berkeley.edu&rnum=1. (20 de enero, 1994)

Bellardo Hahn, T. (1998). Text Retrieval Online: Historical Perspective on Web Search Engines. Bulletin of the American Society for Information Science, 24 (4): 7-10.

Bergman, M. (2001). The Deep Web: Surfacing Hidden Value. Journal of Electronic Publishing [Online]. Accesible en: http://www.press.umich.edu/jep/07-01/bergman.html (11 de julio, 2003)

Bergonneau, M. (2002). The French Connection: Minitel meets the Web. Onlie Journalism Review [Online]. Accesible en: http://www.ojr.org/ojr/business/1017968245.php (11 de enero, 2004)

Berners-Lee, T. (1989). Information Management: A Proposal. W3 Archive [Online]. Accesible en: http://www.w3.org/History/1989/proposal.html (11 de julio, 2003)

Berners-Lee, T (1991): WorldWideWeb: Summary [Online]. Accesible en: http://groups.google.com/groups?selm=6487@cernvax.cern.ch. (6 de agosto, 2003)

Berners-Lee, T., Caillou, R., Groff, J., Pollermann, B. (1992). World-Wide Web: The Information Universe . Electronic Networking: Research, Applications and Policy, 1 (2): 78-84.

Berners-Lee, T. (1996). The World Wide Web: Past, Present and Future. W3 Archive [Online]. Accesible en: http://www.w3.org/People/Berners-Lee/1996/ppf.html (15 de julio, 2003)

Berners-Lee, T. (1998). Semantic Web Road map. World Wide Web Consortium [Online]. Accesible en: http://www.w3.org/DesignIssues/Semantic.html (17 de septiembre, 2003)

Berners-Lee, T., Hendler, J., Lassila, O. (2001). The Semantic Web. Scientific American (mayo, 2001).

Berrocal, J., Figuerola, C., Zazo, A., Rodríguez, E.(2002): La Cibermetría en la recuperación de información en el Web. Primeras Jornadas de Tratamiento y Recuperación de la Información. 4 y 5 de julio de 2002, Valencia.

Berrocal, J., Figuerola, C., Zazo, A., Rodríguez, E. (2003). Agentes inteligentes: recuperación autónoma de la información en la Web. Revista Española de Documentación Científica, 26 (1): 11-20.

Bharat, K. Broder, A.(1998): A technique for measuring the relative size and overlap of public Web search engines. 7th International WWW Conference. 14 de abril de 1998. Brisbane.

Bharat, K (2001): Ranking search results by reranking the results based on local inter-connectivity. United States Patent 6,526,440

Borlund, P. (2000). Experimental components for the evaluation of interactive information retrieval systems. Journal of Documentation, 56 (1): 71-90.

Borlund, P. (2003). The IIR evaluation model: a framework for evaluation of interactive information retrieval systems. Information Research [Online]. Accesible en: http://informationr.net/ir/8-3/paper152.html (15 de enero, 2004).

Bowman, C., Danzig, P., Hardy, D., Manber, U., Schwartz, M.(1995): The Harvest Information Discovery and Access System.1 de Octubre de 1994. Chicago: National Center for Supercomputing Applications.

Bray, T. (1996). Measuring the Web. Computer Networks and ISDN Systems, 28 (7-11): 993-1005.

Brewington, B. Cybenko, G. (2000). How dynamic is the Web ? Computer Networks, 33 (1-6): 257-276.

Brin, S. Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30 (1-7): 107-117.

Broder, A. (2000). Graph structure in the Web. Computer Networks, 33 (1-6).

Broder, A. (2002). A taxonomy of web search. SIGIR Forum, 36 (2).

Broder, A., Najork, M., Wiener, J.(2003): Efficient URL Caching for World Wide Web Crawling. Twelfth International World Wide Web Conference. 20 de mayoo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Brooks, T. (2003). Web search: how the Web has changed information retrieval. Information Research [Online]. Accesible en: http://informationr.net/ir/8-3/paper154.html (15 de enero, 2004).

Bruce, H. (1998). User satisfaction with information seeking on the Internet. Journal of the American Society for Information Science, 49 (6): 541-556.

Bumgarner, J. (2002). The Great Renaming: 1985 - 1988. James Madison University [Online]. Accesible en: http://www.vrx.net/usenet/history/rename/ (3 de julio, 2003)

Burrows, M (1998): Method for statistically projecting the ranking of information. Unites States Patent 5,765,150

Bush, R. (1993). FidoNet: technology, tools, and history. Communications of the ACM, 36 (8): 31-35.

Butler, D. (1999). The writing is on the Web for Science journals in print. Nature, 397 (6716): 195-200.

Caillou, R. (2002). A Little History of the World Wide Web: from 1945 to 1995 Rev 1.39. Web Consortium [Online]. Accesible en: http://www.w3.org/History.html (14 de julio, 2003)

Calanag, M. L. (2003). Public libraries in the information society: what do information policies say. World Library and Information Congress: 69th IFLA General Conference and Council . 1 de agosto, 2003. Berlin, IFLA. [Online]. Accesible en http://www.ifla.org/IV/ifla69/papers/112e-Calanag.pdf (5 de febrero, 2004)

Can, F., Nuray, R., Sevdik, A. B. (2004). Automatic performance evaluation of Web search engines. Information Processing Management, 40 (3): 495-514.

Castells, M. (2001). Internet y la sociedad red: Lección inaugural del programa de doctorado sobre la sociedad de la información y el conocimiento. Universitat Oberta de Catalunya [Online]. Accesible en: http://www.uoc.edu/web/esp/articles/castells/print.html (7 de abril, 2004)

Castillo Blasco, L., Martínez de Pablos, M., Server, G. (1999). Evaluación de la información contenida en seis sedes web de las Escuelas Universitarias y Facultades de Biblioteconomía y Documentación españolas. Revista Española de Documentación Científica, 22 (3): 325-332.

Castillo Sobrino, M. d., Serrano Moreno, J., Sesmero Llorente, M.(2003): Arquitectura multiagente para la asignación de categorías a textos. Segundas Jornadas de Tratramiento y Recuperación de la Información. 8 de Septiembre de 2003. Leganés: Universidad Carlos III.

Cerf, V., Dalal, Y., Sunshine, C. (1974). RFC 675: Specification of Internet transmission control program. Network Information Center Network Working Group [Online]. Accesible en: http://www.cis.ohio-state.edu/cgi-bin/rfc/rfc0675.html (29 de enero, 2003)

Chankhunthod, A., Danzig, P., Neerdaels, C., Schwartz, M., Worrel, K., c (1996). A Hierarchical Internet Object Cache. Proceedings of the 1996 Usenix Technical Conference [Online]. Accesible en: http://www.usenix.org/publications/library/proceedings/sd96/full_papers/danzig-html/cache.html (22 de enero, 1996)

Cho, J. García-Molina, H.(2000): The Evolution of the Web and Implications for an Incremental Crawler. VLDB Conference.1 de Septiembre de 2000. El Cairo, Very Large Data Base Endowment Inc. [Online]. Accesible en: http://www.vldb.org/dblp/db/conf/vldb/ChoG00.html (15 de agosto, 2003)

Claffy, K. (2000). Measuring the Internet. IEEE Internet Computing, 4 (1): 73-75.

Clarke, S. Willett, P. (1997). Estimating the recall perfomance of Web search engines. ASLIB Proceedings, 49 (7): 184-189.

Clever Project (1999). Hypersearching the Web. Scientific American,(junio, 1999).

Codina, L. (2003). La Web semántica: una visión crítica. El Profesional de la Información, 12 (2): 149-152.

Comisión del Mercado de las Telecomunicaciones (2001). Estudio sobre la presencia de las entidades españolas (.es) en Internet. Novatica,(152): 42-44.

Computer Museum History Center (2002). Timeline of Computer History. Computer Museum History Center [Online]. Accesible en: http://www.computerhistory.org/timeline/ (5 de febrero, 2003)

Corbalán, L. M. Amat, C. B. (2003). Vocabulario de información y documentación automatizada. Valencia: Universitat de València.

Corchuelo, R., Arjona, J., Toro, M. (2002). Automatic Extraction of Semantically-Meaningful Information from the Web. UPGrade, 3 (3).

Corporation for Research and Educational Networking (1997). CREN History and Future. Corporation for Research and Educational Networking [Online]. Accesible en: http://www.cren.net/cren/cren-hist-fut.html (7 de febrero, 2003)

Courtois, M. Berry, M. (1999). Results ranking in Web search engines. Online Magazine, 23 (3): 39.

Craven, T. C. (2004). Variations in use of meta tag descriptions by Web pages in different languages. Information Processing Management, 40 (3): 479-493.

Crimmins, F., Smeaton, A., Dkaki, T., Mothe, J. (1999). TétraFusion: Information Discovery on the Internet. IEEE Intelligent Systems, 14 (4): 55-62.

Croft, W. Turtle, H.(1989): A Retrieval Model Incorporating Hypertext Links. Proceedings of the second annual ACM conference on Hypertext. 1 de Noviembre de 1989. Pittsburgh, ACM.

Culliss, G (1999): Method for organizing information. United States Patent 6,006,222

Danzig, P., Obraczka, K., Li, S. (1993). Internet Resource Discovery Services. IEEE Computer, 26 (9): 8-22.

Dasen, M. Wilde, E.(2001): Keeping Web indices up-to-date. Tenth International World Wide Web Conference.1 de Mayo de 2001. Hong Kong, International World Wide Web Consortium.

Davila, R. (2000). History and Development of the Internet. San Antonio Public Library: Government Documents [Online]. Accesible en: http://www.sat.lib.tx.us/Displays/itintro.htm (31 de Enero, 2003)

Dekkers, M. Weibel, S. (2003). State of the Dublin Core Metadata Initiative, Abril 2003. D-Lib Magazine [Online]. Accesible en: http://www.dlib.org/dlib/april03/weibel/04weibel.html (15 de enero, 2004).

Deutsch, P. Emtage, A.(1992): Archie: An Electronic Directory Service for the Internet. Proceedings of Usenix.1 de Enero de 1992. San Francisco, USENIX.

Dhyani, D, Keong Ng, W, Bhowmick, SS (2002). A survey of Web Metrics. ACM Computing Surveys, 34 (4): 469-503.

Digital Equipment Corporation : Digital develops Internet's first "Super Spider" [Online]. Accesible en: http://groups.google.com/groups?selm=9512151806.AA02246%40raptor.pa.dec.com. (15 de diciembre, 2003)

Dill, S., Kumar, R., Mccurley, K., Rajagopalan, S., Sivakumar, D., Tomkins, A. (2002). Self-similarity in the web. ACM Transactions on Internet Technology, 2 (3): 205-223.

Douglis, F., Feldmann, A., Krishnamurthy, B., Mogul, J. (1997). Rate of Change and other Metrics: a Live Study of the World Wide Web. USENIX Symposium on Internet Technologies and System. 8 de diciembre, 1997. Monterrey, USENIX.

Dublin Core Metadata Initiative (2003). Dublin Core Metadata Element Set, Version 1.1: Reference Description. OCLC DCMI [Online]. Accesible en: http://www.dublincore.org/documents/dces/ (16 de septiembre, 2003)

Eckmann, J. Moses, E. (2002). Curvature of co-links uncovers hidden thematic layers in the World Wide Web. Proceedings of the National Academy of Sciences USA, 99 (9): 5825-5829.

Eiron, N. Mccurley, K.(2003): Analysis of anchor text for web search. Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval. 28 de Julio de 2003. Toronto, ACM.

Emtage, A : Announcing "Archie 1.0": The Archive Server Server [Online]. Accesible en: http://groups.google.com/groups?q=archie+emtage&hl=es&lr=&ie=UTF-8&oe=UTF-8&selm=1990Nov15.045448.2861%40ox.com&rnum=1. (14 de noviembre, 1990)

Enos, L. (2001). Excite@Home is raising funds to improve its bottom line while at the same time taking steps to cut costs. E-Commerce Times [Online]. Accesible en: http://www.ecommercetimes.com/perl/story/11148.html (20 de agosto, 2002)

Escalona, M., Mejías, M., Torres, J. (2002). Methodologies to develop Web Information Systems and Comparative Analysis. UPGrade, 3 (3).

ESNIC (2003). Estadísticas del ES-NIC: Dominios registrados en los últimos años. ESNIC [Online]. Accesible en: https://www.nic.es/documentacion/estadisticas.html (30 de julio, 2003)

Faloutsos, M., Faloutsos, P., Faloutsos, C. (1999). On Power-Law Relationships of the Internet Topology . ACM SIGCOMM Computer Communication Review , Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication, 29 (4): 251-262.

Federal Networking Council (1995). FNC Resolution: Definition of "Internet" Federal Networking Council. [Online]. Accesible en http://www.hpcc.gov/fnc/Internet_res.html (11 de septiembre, 2002).

Fernández Beobide, C. González Obiol, A. (1992). Videotex e Ibertex: Experiencias y realizaciones. Telos,(29).

Fetterly, D., Manasse, M., Najork, M., Wiener, J.(2003): A Large-Scale Study of the Evolution of Web Pages. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Fichter, D. (2003). Exploiting intranet search engines for data discovery. Online, 27 (6): 47.

Fidel, R., Davies, R., Douglas, M., Holder, J., Hopkins, C., Kushner, E. et al. (1999). A visit to the information mall: Web searching behavior of high school students. Journal of the American Society for Information Science, 50 (1): 24-37.

Ford, G. (2001). Theory and Practice in the Networked Environment: A European Perspective. In C.McClure J. Bertot (Eds.), Evaluating Networked Information Services. Techniques, Policy and Issues (pp. 1-22). Melford: Information Today.

Ford, N. Miller, D. M. N. (2001). The role of individual differences in Internet searching: An empirical study. Journal of the American Society for Information Science and Technology, 52 (12): 1049-1066.

Foster, S : Veronica: an Archie for Gopher [Online]. Accesible en: http://groups.google.com/groups?q=veronica+nevada+university+group:comp.infosystems.gopher&start=20&hl=es&lr=&ie=UTF-8&oe=UTF-8&scoring=d&selm=9211180514.AA01778%40pyramid&rnum=25. (17 de noviembre, 2003)

Fox, E. Urs, S. (2002). Digital Libraries. Annual Review of Information Science and Technology, 36: 503-589.

Fragoudis, D. Likothanassis, S.(1999): Retriever: an agent for intelligent information recovery. Proceedings of the 20th International Conference on Information Systems.12 de Diciembre de 1999. Charlotte (NC).

García Barriocanal, H., Sicilia Urbán, M., Aedo Cuevas, I. (2003). Ontology-Based Annotation of Usability Evaluation-Related Resources: Design and Retrieval Mechanisms . UPGrade, 4 (1): 12-17.

García Santiago, M. (2000). Topología de la información en la World Wide Web: Modelo metodológico de visualización en una red hipertextual nacional. Tesis doctoral. Universidad de Granada.

García, J. (1998). IRIS-NEWS: la aventura de la Usenet en RedIRIS. Boletín de RedIRIS,(44).

Garratt, A., Jakson, M., Burden, P., Wallis, J. (2001). A survey of alternative designs for a search engine storage structure. Information and Storage Technology, 43 (11): 661-677.

Glover, E., Tsioutsiouliklis, K., Lawrence, S., Pennock, D., Flake, G.(2002): Using Web Structure for Classifying and Describing Web Pages. Eleventh International World Wide Web Conference. 7 de Mayo de 2002. Honolulu. International World Wide Web Consortium.

Google Groups Team (2001). Google Groups Archive Information. google.public.support.general [Online]. Accesible en: http://groups.google.com/groups?selm=90cbefb1.0112211728.4cfe9bb%40posting.google.com (8 de julio, 2003)

Gorbunov, A. (2002). Relevance of Web documents: Ghosts consensus method. Journal of the American Society for Information Science and Technology, 53 (10): 783-788.

Gordon, M. Pathak, P. (1999). Finding information on the world wide web: the retrieval efectiveness of search engines. Information Processing and Management, 35 (2): 144-180.

Gómez Díaz, R. (2003). La evaluación en recuperación de la información. Hipertext.net [Online]. Accesible en: http://www.hipertext.net/web/pag188.htm (5 de noviembre, 2003)

Gravano, L., Chang, K., García Molina, H., Lagoze, C., Paepcke, A. (1997). STARTS: Stanford Protocol Proposal for Internet Retrieval and Search. Digital Library Project Stanford University [Online]. Accesible en: http://www-db.stanford.edu/~gravano/starts.html (15 de septiembre, 2003)

Greco, G., Greco, S., Zumpano, E. (2001). A Probabilistic Approach for Distillation and Ranking of Web Pages. World Wide Web, 4: 189-207.

Griffiths, R. (2002). History of Internet, Internet for Historians (and just about everyone else). Leiden University [Online]. Accesible en: http://www.let.leidenuniv.nl/history/ivh/frame_theorie.html (2 de julio, 2003)

Guha, R., McCool, R., Miller, E.(2003): Semantic Search. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Gurrin, C. Smeaton, A. (2004). Replicating Web Structure in Small-Scale Test Collections. Information Retrieval, 7 (3-4): 239-263.

GVU's WWW Surveying Team (1998). GVU's Tenth WWW User Survey (Conducted Octubre 1998). Georgia Institute of Technology [Online]. Accesible en: http://www.gvu.gatech.edu/user_surveys/survey-1998-10/ (30 de septiembre, 2003)

Haas, S. Grams, E. (2000). Readers, Authors, and Page Structure: A Discussion of Four Questions Arising from a Content Analysis of Web Pages. Journal of the American Society for Information Science, 51 (2): 181-192.

Hald, A. (1952). Statistical Tables and Formulas. (s.l.): Wiley.

Han, Y., Loke, S., Sterling, L. (1996). Agents for Citation Finding on the World Wide Web . Technical Report 96/40. Parkville, University of Melbourne.

Hardy, D., Schwartz, M., Wessels, D. (1996). Harvest User's Manual Version 1.4 patchlevel 2. Internet Research Task Force Research Group on Resource Discovery [Online]. Accesible en: http://harvest.sourceforge.net/harvest-1.4.pl2-docs/user-manual.html (15 de septiembre, 2003)

Hardy, H. (1993). The History of the Net v8.5. Master Thesis. School of Communications, Grand Valley State University.

Harter, S. Hert, C. (1997). Evaluation of Information Retrieval Systems: Approaches, Issues and Methods. Annual Review of Information Science and Technology, 32: 3-94.

Hauben, M. Hauben, R. (1996). Netizens: On the History and Impact of the Net. Columbia University [Online]. Accesible en: http://www.columbia.edu/~rh120/ (2 de julio, 2003)

Hausherr, T. (2001). Xenu's Link Sleuth (Version 1.1c) [Programa informático]. Berlin.

Hawking, D., Craswell, N., Thistlewaite, P., Harman, D. (1999). Results and challenges in Web search evaluation. Computer Networks, 31 11-16.

Hawking, D., Craswell, N., Bailey, P., Griffiths, K. (2001). Measuring Search Engine Quality. Information Retrieval, 4 (1): 33-59.

Hawking, D. Robertson, S. (2003). On Collection Size and Retrieval Effectiveness. Information Retrieval, 6 (1): 99-105.

Heery, R. (1996). Review of Metadata Formats. Program, 30 (4): 345-373.

Hendler, J. (1999). Web Matters: Is there an Intelligent Agent in Your Future ? Nature [Online]. Accesible en: http://www.nature.com/nature/webmatters/agents/agents.html (10 de diciembre, 2003)

Hendler, J. (2001). Agents and the Semantic Web. IEEE Intelligent Systems, 16 (2): 30-37.

Henzinger, M., Heydon, A., MIzenmacher, M., Najork, M. (1999). Measuring index quality using random walks on the Web. Computer Networks, 31 1291-1303.

Henzinger, M., Bay-Wei Chang, Brian Milch, Sergey Brin(2003): Query-Free News Search. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Hermans, B. (1996). Intelligent Software Agents on the Internet: an inventory of currently offered functionality in the information society and a prediction of (near-)future developments. Doctoral Dissertation Tilburg University [Online]. Accesible en: http://www.broadcatch.com/agent_thesis/ (24 de septiembre, 2003)

Hermans, B. (1997). Intelligent Software Agents on the Internet. First Monday [Online]. Accesible en: http://www.firstmonday.dk/issues/issue2_3/index.html (24 de septiembre, 2003)

Hermans, B. (1998). Desperately Seeking: Helping Hands and Human Touch. First Monday [Online]. Accesible en: http://www.firstmonday.dk/issues/issue3_11/index.html (24 de septiembre, 2003)

Herring, S. (2002). Computer-Mediated Communication on the Internet. Annual Review of Information Science and Technology, 36: 109-168.

Hípola, P. Vargas Quesada, B. (1999). Agentes inteligentes, definición y tipología. Los agentes de información. El Profesional de la Información, 8 (4): 13-21.

Hölscher, C. Strube, G.(2000): Web Search Behavior of Internet Experts and Newbies. Ninth International World Wide Web Conference. 15 de Mayo de 2000. Amsterdam: Centre for Mathematics and Computer Science; International World Wide Web Consortium.

Hsieh-Yee, I. (1998). The retrieval power of selected search engines: how well do they address general reference questions and subject questions? Reference Librarian, 60 27-47.

Huberman, B., Pirolli, P., Pitkow, J., Lukose, R. (1998). Strong Regularities in World Wide Web Surfing. Science, 280 (5630): 95-97.

Huberman, B. Adamic, L. (1999). Growth dynamics of the World-Wide Web. Nature, 401 (6749): 131.

Huberman, B. (2002). Patterns in the World Wide Web. Libray of Economics and Liberty [Online]. Accesible en: http://www.econlib.org/library/Columns/Hubermanpatterns.html (5 de febrero, 2004)

Internet Society (2002). What is the Internet ? Internet Society [Online]. Accesible en: http://www.isoc.org/internet/index.shtml (3 de marzo, 2003)

Jansen, B. (1997). Using an intelligent agent to enhace search engine perfomance. First Monday [Online]. Accesible en: http://www.firstmonday.dk/issues/issue2_3/jansen/index.html (24 de septiembre, 2003)

Jansen, B. Pooch, U. (2001). A Review of Web Searching Studies and a Framework for Future Research. Journal of the American Society for Information Science, 52 (3): 235-246.

Jansen, B., Spink, A., Saracevic, T. (2002). Real life, real users, and real needs: a study and analysis of user queries on the web. Information Processing and Management, 36 (2): 207-227.

Jansen, B. Spink, A. An analysis of Web searching by European AlltheWeb.com users. Information Processing and Management, (en prensa).

Delort, J.Y., Bouchon-Meunier, B., Rifqi, M. (2003): Web Document Summarization by Context. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Jenkins, C., Jackson, M., Burden, P., Wallis, J. (1998). Searching the World Wide Web: an evaluation of available tools and methodologies. Information and Storage Technology, 39 (14-15): 985-994.

Johnson, F., Griffiths, J., Hartley, R. (2001). DEVISE. A framework for the evaluation of Internet search engines (Rep. No. 100). London: British Library.

Johnstone, B. Carlson, D. (2002). History of Electronic Publishing: Teletext and Videotext. Applied Interactive Newspapers Syllabus, Univ of Florida [Online]. Accesible en: http://iml.jou.ufl.edu/carlson/professional/new_media/history/ehistory.htm (17 de julio, 2003).

Kahle, B. (1989). Wide Area Information Server Concepts v4 Draft. Thinking Machines Corporation [Online]. Accesible en: http://nti.uji.es/software/Simple/docs/wais-concepts.txt (20 de julio, 2003).

Kahle, B. Medlar, A. (1991). An Information System for Corporate Users: Wide Area Information Servers v3. Universidad de Heidelberg [Online]. Accesible en: http://www.urz.uni-heidelberg.de/Netzdienste/internet/tools/info/wais/corporate.html (23 de julio, 2003).

Kannan, N : Qualifiers on Hypertext links... [Online]. Accesible en: http://groups.google.com/groups?selm=1991Aug2.115241@ardor.enet.dec.com. (2 de agosto, 2003).

Kantor, B. Lapsley, P. (1986). RFC 977: Network News Transfer Protocol: A Proposed Standard for the Stream-Based Transmission of News. Network Working Group [Online]. Accesible en: ftp://ftp.isi.edu/in-notes/rfc977.txt (8 de julio, 2003).

Kessler, J. (1995). The French Minitel: Is There Digital Life Outside of the "US ASCII" Internet? A Challenge or Convergence? D-Lib Magazine [Online]. Accesible en: http://www.dlib.org/dlib/diciembre95/12kessler.html (5 de septiembre, 2002).

Khan, M. Khor, S. (2004). Enhanced Web document retrieval using automatic query expansion. Journal of the American Society for Information Science and Technology, 55 (1): 29-40.

Khare, R. Rifkin, A.(1998): The origin of (document) species. 7th International World Wide Web Conference. 14 de abril de 1998. Brisbane.

Kirsch, ST (1997): Document retrieval over networks wherein ranking and relevance scores are computed at the client for multiple database documents. United States Patent 5,659,732.

Kleinberg, J. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM, 46 (5): 604-632.

Kleinberg, J (2000): Method and system for identifying authoritative information resources in an environment with content-based links between information resources. United States Patent 6,112,202.

Kleinberg, J. Lawrence, S. (2001). The Structure of the Web. Science, 294 1849-1850.

Kobayashi, M. Takeda, K. (2000). Information Retrieval on the Web. ACM Computing Surveys, 32 (2): 144-173.

Koch, T., Ardo, A., Brümer, A., Lundberg, S. (1996). The building and maintenance of robot based internet search services: A review of current indexing and data collection methods. NetLab Lund University Library [Online]. Accesible en: http://www.lub.lu.se/desire/radar/reports/D3.11/ (1 de septiembre, 2001).

Koehler, W. (1999). An Analysis of Web Page and Web Site Constancy and Perfomance. Journal of the American Society for Information Science, 50 (2): 162-180.

Koehler, W. (2002). Web page change and persistence: A four-year longitudinal study. Journal of the American Society for Information Science and Technology, 53 (2): 162-171.

Koehler, W. (2004). A longitudinal study of Web pages continued: a consideration of document persistence. Information Research [Online]. Accesible en: http://informationr.net/ir/9-2/paper174.html (5 de febrero, 2004)

Koster, M : ALIWEB (Archie-Like Indexing for the Web) [Online]. Accesible en: http://groups.google.com/groups?q=koster+aliweb+group:comp.infosystems.www+author:koster&hl=es&lr=&ie=UTF-8&oe=UTF-8&selm=1993Nov30.093536.28554%40cs.nott.ac.uk&rnum=1. (30 de noviembre, 2003).

Koster, M.(1994): ALIWEB - Archie-Like Indexing in the WEB. First International Conference on the World-Wide Web. 25 de mayoo, 1994. Geneva: CERN.

Kwong, L. Ng, Y. (2003). Performing Binary-Categorization on Multiple-Record Web Documents Using Information Retrieval Models and Application Ontologies. World Wide Web, 6 (3): 281-303.

Lamas, C. (2002). La investigación de Internet. Telos,(52).

Lancaster, F. Warner, A. (1993). Evaluation Criteria and Evaluation Procedures. In Information Retrieval Today (pp. 159-202). Arlington: Information Resources Press.

Lancaster, F. Warner, A. (1993). Subject Access: Problems and Perfomance Criteria. In Information Retrieval Today (pp. 43-63). Arlington: Information Resources Press.

Landoni, M. Bell, S. (2000). Information retrieval techniques for evaluating search engines: a critical overview. ASLIB Proceedings, 52 (3): 124-129.

Lavoie, B. Frystyk Nielsen, H. (2003). Web Characterization Terminology Definitions Sheet. World Wide Web Consortium [Online]. Accesible en: http://www.w3.org/1999/05/WCA-terms/ (3 de mayo, 2002)

Lawrence, S. Giles, C. (1998). Searching the World Wide Web. Science, 280 (5630): 98-100.

Lawrence, S. Giles, C. (1999). Accesibility of Information on the Web. Nature, 400 (6740): 107-109.

Lawrence, S. Giles, C. (1999). Searching the Web: General and Scientific Information Access. IEEE Communications, 37 (1): 116-122.

Leighton, V. Srivastava, J. (1997). Precision among World Wide Web Search Services. Winona State University [Online]. Accesible en: http://www.winona.msus.edu/library/webind2/webind2.htm (24 de septiembre, 2003)

Leighton, V. Srivastava, J. (1999). First 20 Precision among World Wide Web search services (Search Engines). Journal of the American Society for Information Science, 50 (10): 870-881.

Leiner, B., Cerf, V., Clark, D., Kahn, R., Kleinrock, L., Lynch, D. et al. (1997). The past and future history of the Internet. Communications of the ACM, 40 (2): 102-108.

Leiner, B., Cerf, V., Clark, D., Kahn, D., Kleinrock, L., Lynch, D. et al. (2000). A Brief History of the Internet Internet Society.

Li, L. Shang, Y. (2000). A new method for automatic performance comparison of search engines. World Wide Web, 3 241-247.

Li, L., Shang, Y., Zhang, W.(2002): Improvement of HITS-based Algorithms on Web Documents. International World Wide Web Conference. 7 de Mayo de 2002. Honolulu, International World Wide Web Consortium.

Liaw, S. S. Huang, H. M. (2003). An investigation of user attitudes toward search engines as an information retrieval tool. Computers in Human Behavior, 19 (6): 751-765.

Licklider, J. Clark, W. (1962). On-line Man Computer Interactions. MIT. Publicado también como Man-Computer Symbiosis. IRE Transactions on Human Factors in Electronics, HFE-1: 4-11, 1960 [Online]. Accesible en: http://medg.lcs.mit.edu/people/psz/Licklider.html (28 de abril, 2004).

Lieberman, H., Fry, C., Weitzman, L. (2003). Exploring the Web with Reconnaissance Agents. Communications of the ACM, 44 (8): 69-75.

Lindner, P (1991) : Internet Gopher v0.2 Curses Client and Server is available. [Online]. Accesible en: http://groups.google.com/groups?selm=1991Sep10.020238.4751%40cs.umn.edu. (10 de septiembre, 2002)

Lim, L., Wang, M., Padmanabha, S., Vitte, J.S., Agarwa, R (2003): Dynamic Maintenance of Web Indexes Using Landmarks. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Loeber, S. Cristea, A. (2003). A WWW Information Seeking Process Model. Educational Technology Society, 6 (3): 43-52.

López Alonso, M. Mares Marín, J.(1996): El futuro de la identificación de la información en Internet. Quintas Jornadas Españolas de Documentación Automatizada. 17 de Octubre de 1996. Caceres: FESABID.

López, D. Massa, J. (1998). Dando forma al envase y, con ello, al contenido: Webber. Boletín de RedIRIS,(45).

Lyman, P. Varian, H. (2000). How Much Information? Journal of Electronic Publishing [Online]. Accesible en: http://www.press.umich.edu/jep/06-02/lyman.html (3 de Marzo, 2002).

MacMurdo, G. (1995). How the Internet was indexed. Journal of Information Science, 21 (6): 479-489.

Maes, P. (2003). Agents that reduce work and information overload. Communications of the ACM, 37 (7): 30-40.

Maldonado Martínez, A. Fernández Sánchez, E.(1998): Evaluación de los principales "buscadores" desde un punto de vista documental: recogida, análisis y recuperación de recursos de información. Sextas Jornadas Españolas de Documentación. 29 de Octubre de 1998. Valencia: FESABID.

Mañas, J. (1994). Búsqueda y recuperación de información en Internet. Novatica,(110): 75-81.

Marable, L. (2003). False Oracles: Consumer Reaction to Learning the Truth About How Search Engines Work: Results of an Ethnographic Study. Baltimore, Consumer WebWacht Research [Online]. Accesible en : http://www.consumerwebwatch.org/news/searchengines/ (28 de abril, 2004)

Marzoiori, M.(1998): The limits of Web metadata, and beyond. Seventh International World Wide Web Conference. 14 de Abrilde 1998. Brisbane. International World Wide Web Consortium.

Marcos Mora, M. (1998). Motores de recuperación de información: un análisis comparativo (parte 1). El Profesional de la Información, 7 (1-2): 18-22.

Martínez de Lejarza Esparducer, I. (1999). Una aproximación al análisis regional del mercado de la información digital: Distribución, concentración y difusión regional de Internet en las comunidades autónomas españolas. Departamento de Economía Aplicada.Universidad de Valencia [Online]. Accesible en: http://www.uv.es/~econinfo/presentacion/mercinter/mercinter.html (4 de agosto, 2003)

Martínez Méndez, F. (2001). Propuesta y desarrollo de una metodología para la evaluación de la recuperación de información en Internet. Tesis doctoral. Universidad de Murcia.

Martínez Méndez, F. Rodríguez Múñoz, J. (2003). Síntesis y crítica de las evaluaciones de la efectividad de los motores de búsqueda en la Web. Information Research [Online]. Accesible en: http://informationr.net/ir/8-2/paper148.html (15 de enero, 2004).

Masashi Toyoda Masaru Kitsuregawa(2003): Analyzing Global Behavior of Web Community Evolution. Twelfth International World Wide Web Conference. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Massa, J. (2003). Metainformación Dublin Core: Elementos del conjunto de metadatas de Dublin Core: Descripción de Referencia. CSIC Red IRIS [Online]. Accesible en: http://www.rediris.es/metadata/dublin_core_elements.es.html (16 de septiembre, 2003)

Mauldin, M. Leavitt, J.(1994): Web-agent related research at the Center for Machine Traslation. Proceedings of the ACM Special Interest Group on Networked Information Discovery and Retrieval. 1 de Agosto de 1994. McLean, ACM.

Mauldin, M. (1995). Measuring the Web with Lycos. Third International WWW Conference [Online]. Accesible en: http://www.lazytoad.com/lti/pub/lycos-websize.html (31 de enero, 2001)

Mauldin, M. (1997). Lycos: Design choices in an Internet search service. IEEE Expert,(Janjuary February): 8-11.

Mauldin, ML (1998): Method for searching a queued and ranked constructed catalog of files stored on a network. United States Patent 5,748,954

McCahill, M. Anklesaria, F. (1995). Evolution of Internet Gopher. Journal of Universal Computer Science, 1 (4): 235-246.

Meghabghab, G. (2001). Google's Web Page Ranking Applied to Different Topological Web Graph Structures. Journal of the American Society for Information Science and Technology, 52 (9): 736-747.

Menczer, F. (2003). Complementing search engines with online web mining agents. Decision Support Systems, 35 (2): 195-212.

Menczer.F (2002). Growing and navigating the small world Web by local content. Proceedings of the National Academy of Sciences USA, 99 (22): 14014-14019.

Méndez Rodríguez, E. (2002). Metadatos y recuperación de información . Estándares , problemas y aplicabilidad en bibliotecas digitales. Gijón, Trea.

Microsoft (1996). Microsoft Acquires Vermeer Technologies Inc.: Critically Acclaimed Visual Client-Server Web Publishing Tool to Complement Internet Offerings From Microsoft Desktop Applications Division. [Online]. Accesible en: http://www.microsoft.com/presspass/press/1996/jan96/vrmeerpr.asp (21 de julio, 2003)

Milne, J (1995). Vermeer Technologies Gives Birth to FrontPage. Network Computing, 6.

Ministerio de Ciencia y Tecnología (2003). ORDEN CTE/662/2003, de 18 de marzo, por la que se aprueba el Plan Nacional de nombres de dominio de Internet bajo el código de país correspondiente a España («.es»). Boletín Oficial del Estado,(73): 11917-11924.

Mladenic, D. (1999). Text-Learning and related Intelligent Agents: A Survey. IEEE Intelligent Systems, 14 (4): 44-54.

Moise, G., Sander, J., Rafiei, D.(2003): Focused Co-citation: Improving the Retrieval of Related Pages on the Web. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Monk, T. Claffy, K. (2002). A survey of Internet Statistics/Metrics Activities. Technical Report. National Laboratory for Applied Network Research [Online]. Accesible en: http://www.caida.org/outreach/papers/1996/metricsurvey/metricsurvey.html (11 de abril, 2002)

Montaner, M., López, B., Rosa, J. d. l. (2003). A Taxonomy of Recommender Agents on the Internet. Artificial Intelligence Review, 19 (4): 285-330.

Montebello, M.(1998): Optimizing recall/precision scores in IR over the WWW. Proceedings of the 21st Annual International ACN SIGIR Conference on Research and Development in Information Retrieval. 1 de Agosto de 1998. Melbourne: ACM.

Mowshowitz, A. Kawaguchi, A. (2002). Assessing bias in search engines. Information Processing and Management, 38 (1): 141-156.

Muylle, S., Moenaert, R., Despontin, M. (2004). The conceptualization and empirical validation of web site user satisfaction. Information Management, 41 (5): 543-560.

Najork, M. Wiener, J.(2001): Breadth-First Search Crawling Yields High-Quality Pages. Tenth International World Wide Web Conference.1 de Mayo de 2001. Hong Kong, International World Wide Web Consortium.

Netscape Communications Corporation (1997). NetScape works with W3C and leading content providers to drive new specification for organizing, describing and navigating information on internet, intranets and desktops. NetScape [Online]. Accesible en: http://wp.netscape.com/flash1/newsref/pr/newsrelease488.html (16 de septiembre, 2003)

Newby GB (2002). The necessity for information space mapping for information retrieval on the semantic web. Information Research [Online]. Accesible en: http://informationr.net/ir/7-4/paper137.html (15 de enero, 2003).

Nogales Flores, J. (1999). Los usos básicos de Internet. Servicios y aplicaciones. En Caridad Sebastián, M (Ed.): La Sociedad de la Información. Política, Tecnología e Industria de los contenidos (pp. 143-173). Madrid: Centro de Estudios Ramón Areces.

Noh, Y.-H. (2003). A study on the estimation of performance of the concept-based information retrieval model for searching the Web. Journal of Information Science, 28 (5): 407-415.

O'Neill, E., McClain, P., Lavoie, B. (1998). A Methodology for Samplig the World Wide Web. Annual Review of OCLC Research [Online]. Accesible en: http://digitalarchive.oclc.org/da/ViewObject.jsp?objid=0000003447 (20 de octubre, 2003)

O'Neill, E., Lavoie, B., McClain, P. (1999). Web Characterization Project: An Analysis of Metadata Usage on the Web. Annual Review of OCLC Research [Online]. Accesible en: http://digitalarchive.oclc.org/da/ViewObject.jsp?objid=0000003486 (21 de octubre, 2003)

O'Neill, E., Lavoie, B., Bennett, R. (2003). Trends in the Evolution of the Public Web: 1998-2002. D-Lib Magazine [Online]. Accesible en: http://www.dlib.org/dlib/april03/lavoie/04lavoie.html (21 de octubre, 2003).

Olvera Lobo, M. (1999). Métodos y técnicas para la indización y la recuperación de los recursos de la World Wide Web. Boletín de la Asociación Andaluza de Bibliotecarios, 14 (57): 11-22.

Olvera Lobo, M. (1999). Evaluación de la recuperación de información en internet : un modelo experimental. Tesis doctoral. Universidad de Granada.

Olvera Lobo, M. (2000). Rendimiento de los sistemas de recuperación de información en la world wide web: revisión metodológica. Revista Española de Documentación Científica, 23 (1): 63-78.

Olvera Lobo, M. (2000). Rendimiento de los sistemas de recuperación de información en la web: evaluación de servicios de búsqueda (search engines). Revista Española de Documentación Científica, 23 (3): 302-316.

Oppenheim, C., Morris, A., Mcknight, C., Lowley, S. (2000). The evaluation of WWW search engines. Journal of Documentation, 52 (2): 190-211.

Page, L (2001): Method for node ranking in a linked database. United States Patent 6,285,999

Peiró, C. (1996). El fenómeno Internet en España: ayer, hoy y mañana. Exposición Universal Internet'96 [Online]. Accesible en: http://personales.mundivia.es/astruc/doctxt53.htm (17 de julio, 2003)

Peterson, R. (1997). Eight Internet Search Engines Compared. First Monday [Online]. Accesible en: http://www.firstmonday.dk/issues/issue2_2/peterson/index.html (1 de Marzo, 2000).

Pettigrew, K., Durrance, J., Unruh, K. (2002). Facilitating community information seeking using the Internet: Findings from three public library-community network systems. Journal of the American Society for Information Science and Technology, 53 (11): 894-903.

Picard, J. Savoy, J. (2003). Enhancing retrieval with hyperlinks: A general model based on propositional argumentation systems. Journal of the American Society for Information Science and Technology, 54 (4): 347-355.

Pinkerton, B (1994). The WebCrawler Index: A content-based Web index [Online]. Accesible en: http://groups.google.com/groups?selm=2r0rnm%24ftj%40news.u.washington.edu. (11 de June, 1994)

Pinkerton, B.(1994): Finding What People Want: Experiences with the WebCrawler.1 de Octubre de 1994. Chicago: National Center for Supercomputing Applications.

Pinkerton, B. (2000). WebCrawler: Finding What People Want. Doctor in Philosophy Doctoral Dissertation, Computer Science Department, University of Washington.

Pinkerton, B. (2002). WebCrawler Timeline. WebCrawler [Online]. Accesible en: http://www.thinkpink.com/bp/WebCrawler/History.html (4 de septiembre, 2003)

Pollock, A. Hockley, A. (1997). What's Wrong with Internet Searching. D-Lib Magazine [Online]. Accesible en: http://mirrored.ukoln.ac.uk/lis-journals/dlib/dlib/dlib/marzo97/bt/03pollock.html (15 de abril, 2001).

PricewaterhouseCoopers (2004). Estudio de la industria de contenidos digitales en España. Price Waterhouse Coopers España [Online]. Accesible en: http://www.pwc.com/es/esp/ins-sol/spec-int/ind_contenidos.html (7 de abril, 2004)

Quaterman, J. Hoskins, J. (1986). Notable Computer Networks. Communications of the ACM, 29 (10): 932-971.

Quaterman, J. (1996). User Growth of the Internet and of the Matrix. Matrix News, 6 (5).

Raghavan, P. (2002). Information Retrieval for Enterprise Content . UPGrade, 3 (3).

Rasmussen, E. (2003). Indexing and retrieval for the Web. Annual Review of Information Science and Technology, 37: 91-124.

Rhind-Tutt, S. (2003). Semantic indexing: a case study. Library Collections, Adquisitions and Technical Services, 27 (2): 243-248.

Rieh, S. Y. (2004). On the Web at home: Information seeking and Web searching in the home environment. Journal of the American Society for Information Science and Technology, 55 (8): 743-753.

Risvik, K. Michelsen, R. (2002). Search Engines and Web Dynamics. Computer Networks, 39 (3): 289-302.

Rouse, M. E. (2004). Whatis.com. Tech Target [Online]. Accesible en: http://whatis.techtarget.com/ (25 de febrero, 2004)

Ruthfield, S. (2002). The Internet's History and Development: From Wartime Tool to the Fish-Cam. ACM Crossroads [Online]. Accesible en: http://www.acm.org/crossroads/xrds2-1/inet-history.html (5 de marzo, 2003).

White, R.W., Jose, J.M., Ruthven, I. (2003): Using Top-Ranking Sentences for Web Search Result Presentation. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Salazar García, I.(2002): La Red profunda. Lo que los buscadores convencionales no encuentran. 1er Congreso ONLINE del Observatorio para la CiberSociedad. 9 de Septiembre de 2002. Barcelona.

Salton, G. McGill, M. (1983). Text Analysis and Automatic Indexing. In Introduction to Modern Information Retrieval (pp. 52-117). New York: McGraw Hill.

Salton, G. McGill, M. (1983). Retrieval Evaluation. In Introduction to Modern Information Retrieval (pp. 157-197). New York: McGraw-Hill.

Sanchez, J., Sandra, N., Fernández, L., Chevalier, G. (2002). Distributed Information Retrieval from Web-Accessible Digital Libraries using Mobile Agents. UPGrade, 3 (3).

Sandeep Pandey Krithi Ramamritham(2003): Monitoring the Dynamic Web to respond to Continuous Queries. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Sanz, M. (1998). Fundamentos históricos de la Internet en Europa y en España. Boletín de RedIRIS, 45 22-36.

Savoy, J. (2002). Information Retrieval on the Web: A New Paradigm. UPGrade, 3 (3).

Sánchez Montero, J. (1997). Hacia una optimización de los recursos de Internet en la empresa. Revista Española de Documentación Científica, 20 (1): 52-60.

Schwartz, M., Emtage, A., Kahle, B., Neumann, B. (1992). A Comparison of Internet Resource Discovery Approaches. Computing Systems, 5 (4).

Schwartz, MF (1994). Harvest Software Available [Online]. Accesible en: http://groups.google.com/groups?q=harvest&start=20&hl=es&lr=&ie=UTF-8&oe=UTF-8&as_drrb=b&as_mind=12&as_minm=5&as_miny=1990&as_maxd=8&as_maxm=8&as_maxy=1999&selm=Pine.3.89.9411080806.N21666-0100000%40plains&rnum=30. (5 de noviembre, 2003)

Selberg, E (1995). MetaCrawler, a parallel meta-search engine [Online]. Accesible en: http://www.google.com/groups?q=metacrawler&hl=es&lr=&ie=UTF-8&oe=UTF-8&as_drrb=b&as_mind=12&as_minm=5&as_miny=1994&as_maxd=12&as_maxm=8&as_maxy=1997&selm=3u0qo7%24u0k%40big.aa.net&rnum=3. (12 de septiembre, 2003).

Selberg, E. Etzioni, O.(1995): Multi-Service Search and Comparison Using the MetaCrawler. Fourth International World Wide Web Conference. 11 de Diciembre de 1995. Boston, International World Wide Web Consortium.

Senso, J. (1998). Herramientas para realizar búsquedas en Internet: una revisión. El Profesional de la Información, 7 (1-2): 24-25.

Kamvar, S.D., Haveliwala, T.H., Manning, C.D., Golub, G.H. (2003): Extrapolation Methods for Accelerating PageRank Computations. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Shaw, R (1995). Crawlers, Spider s and Worms. Web Week (July, 1)

Shipeng Yu, Deng Cai, Ji-Rong Wen, Wei-Ying Ma(2003): Improving Pseudo-Relevance Feedback in Web Information Retrieval Using Web Page Segmentation. Twelfth International World Wide Web Conference. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Shiu, J. K. H., Chan, S. C. F., Chung, K. F. L. (2003). Accessing hidden web documents by metasearching a directory of specialty search engines. Databases in Networked Information Systems, Proceedings, 2822: 27-41.

Silverstein, C., Henzinger, M., Marais, H., Moricz, M (1999). Analysis of a Very Large Web Search Engine Query Log. SIGIR Forum, 33 (1).

Simon Lok Min-Yen Kan(2003): Employing Natural Language Summarization and Automated Layout for Effective Presentation and Navigation of Information Retrieval Result. Twelfth International World Wide Web Conference. 20 de Mayo de 2003. Budapest: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Slone, D. (2002). The influence of mental models and goals on search patterns during Web interaction. Journal of the American Society for Information Science and Technology, 53 (13): 1152-1169.

Smith, A. (2003). Testing the Surf: Criteria for Evaluating Internet Information Resources. Public Access Computer Systems Review, 8 (3).

Spicer, D., Bell, G., Zimmerman, J., Boas, J., Boas, B. (2002). Internet History and Microprocessor Timeline. Computer History Museum [Online]. Accesible en: http://www.computerhistory.org/exhibits/internet_history/ (31 de enero, 2003)

Spink, A., Wolfram, D., Jansen, B., Saracevic, T. (2001). Searching the web: The public and their queries. Journal of the American Society for Information Science and Technology, 52 (3): 226-234.

Spink, A., Jansen, B., Wolfram, D., Saracevic, T. (2002). From E-sex to E-commerce: Web search changes. IEEE Computer, 35 (3): 107-109.

Spink, A., Ozmutlu, S., Ozmutlu, H., Jansen, B. (2002). US versus European Web searching trends. SIGIR Forum, 36 (2).

Su, L. Chen, H.(1999): User evaluation of Web search engines. 3rd Conceptions of Library and Information Science Conference. 23 de Mayo de 1999. Dubrovnik.

Su, L. (2003). A comprehensive and systematic model of user evaluation of Web search engines: I. Theory and background. Journal of the American Society for Information Science and Technology, 54 (13): 1175-1192.

Su, L. (2003). A comprehensive and systematic model of user evaluation of Web search engines: II. An evaluation by undergraduates. Journal of the American Society for Information Science and Technology, 54 (13): 1193-1223.

Sullivan, D (1998). Open Text Repositions Its Web Index. Search Engine Report. (March, 31).

Sullivan, D (2002). Death Of A Meta Tag. Search Engine Report. (October, 1)

Térmens Graells, R., Ribera Turró, M., Sulé Duesa, A. (2003). Nivel de accesibilidad de las sedes Web de las universidades españolas. Revista Española de Documentación Científica, 26 (1): 21-39.

Thelwall, M. (2001). The Responsiveness of Search Engine Indexes. Cybermetrics, 5 (1).

Thelwall, M. (2003). Can Google's PageRank be used to find the most important academic Web pages? Journal of Documentation, 59 (2): 205-217.

Thomas, C. Griffin, L. (1999). Who will create the metadata for the Internet? First Monday [Online]. Accesible en: http://www.firstmonday.dk/issues/issue3_12/thomas/index.html (18 de abril, 2004)

Tomlin, J.(2003): A New Paradigm for Ranking Pages on the World Wide Web. Twelfth International World Wide Web Conference. 24 de Mayo de 2003. Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium: Computer and Automation Research Institute of the Hungarian Academy of Sciences ; International World Wide Web Consortium.

Tramullas Saz, J. Olvera Lobo, M. (2001). Recuperación de la Información en Internet. Madrid: Ra-Ma.

Travis, I. (1998). From "Storage and Retrieval Systems" to "Search Engines": Text Retrieval in Evolution. Bulletin of the American Society for Information Science, 24 (4): 1.

Vaughan, L. New measurements for search engine evaluation proposed and tested. Information Processing and Management, (in press).

Vellucci, S. (1998). Metadata. Annual Review of Information Science and Technology, 33: 187-222.

Voorbij, H. (1999). Searching scientific information on the Internet: A Dutch academic user survey. Journal of the American Society for Information Science, 50 (7): 598-615.

Wales, J. F. (2004). Wikipedia: The Free Encyclopedia. Wikipedia [Online]. Accesible en: http://en.wikipedia.org/wiki/Main_Page (25 de febrero, 2004)

Walton, B (1994). WWW and Gopher Statistics ? (Respuesta) [Online]. Accesible en: http://groups.google.com/groups?hl=es&lr=&ie=UTF-8&oe=UTF-8&selm=%25brucew.42.0%40sas-aux.byu.edu. (11 de marzo, 2003)

Wang, P., Wawk, W., Tenopir, C. (2000). Users' interaction with World Wide Web resources: An exploratory study using a holistic approach. Information Processing and Management, 36 (2): 229

251.

Web Characterization Project (2003). Web Sites: Concepts, Issues and Definitions (First Draft). OCLC Research [Online]. Accesible en: http://wcp.oclc.org/pubs/rn1-websites.html (6 de mayo, 2003)

Webopedia (2004). Webopedia: Online Dictionary for Computer and Internet Terms. Jupitermedia Corporation [Online]. Accesible en: http://www.pcwebopedia.com/ (23 de febrero, 2004)

Weibel, S. (1995). Metadata: The Foundations of Resource Description. D-Lib Magazine [Online]. Accesible en : http://www.dlib.org/dlib/July95/07weibel.html. (2 de marzo, 2000).

Weibel, S., Ianella, R., Cathro, W. (1997). The 4th Dublin Core Metadata Workshop Report. D-Lib Magazine [Online]. Accesible en: http://www.dlib.org/dlib/june97/metadata/06weibel.html (3 de marzo, 2000).

White, R., Joemon, M., Ruthven, I. (2003). A task-oriented study on the influencing effects of query-biased summarisation in web searching. Information Processing and Management, 39 (5): 707-733.

Wiederhold, G. (1992). Mediation in the architecture of future information systems. IEEE Computer, 26 (3): 38-49.

Wiederhold, G. Genesereth, M. (1997). The Conceptual Basis for Mediation Services. IEEE Expert, 12 (5): 38-47.

Wiggins, RW (1994). Statistics on growth in WAIS databases ? [Online]. Accesible en: http://groups.google.com/groups?q=wais+statistics&hl=es&lr=&ie=UTF-8&oe=UTF-8&selm=2p4vql%24e8d%40msuinfo.cl.msu.edu&rnum=4. (20 de octubre, 2003)

Winer, B. (1962). Design and Analysis of Single-factor Experiments. In Statistical Principles in Experimental Design (2nd ed ed., pp. 149-260). New York: McGraw-Hill.

Wolfram, D., Spink, A., Jansen, B., Saracevic, T. (2001). Vox populi: The public searching of the web. Journal of the American Society for Information Science and Technology, 59 (12): 1073-1074.

Woodruff, A., Aoki, P., Brewer, E., Gauthier, P., Rowe, L. (1996). An Investigation of Documents from the World Wide Web. Computer Networks and ISDN Systems, 28 (7-11): 963-980.

Wooldridge, M. Jennings, N. (1995). Intelligent Agents: Theory and Practice. Knowledge Engineering Review, 10 (2): 115-152.

World Wide Web Consortium (1997). World Wide Web Consortium Publishes Public Draft of Resource Description Framework (RDF). World Wide Web Consortium [Online]. Accesible en: http://www.w3.org/Press/RDF (16 de septiembre, 2003)

World Wide Web Consortium (2001). Metadata at W3C. World Wide Web Consortium [Online]. Accesible en: http://www.w3.org/Metadata/ (15 de septiembre, 2003)

Yok, S.-H., Hawoong J, Barabasi, A. (2002). Modeling the Internet's large-scale topology. Proceedings of the National Academy of Sciences USA, 99 (21): 13382-13386.

Zadeh, L. A. (2003). From search engines to question-answering systems - the need for new tools. Advances in Web Intelligence, 2663 15-17.

Zakon, R. (2003). Hobbes' Internet Timeline v6.0. Zakon Group [Online]. Accesible en: http://www.zakon.org/robert/internet/timeline/ (8 de febrero, 2003)

Zien, J., Meyer, J., Tomlin, J., Liu, J.(2001): Web Query Characteristics and their Implications on Search Engines.Tenth International World Wide Web Conference. 1 de Mayo de 2001. Hong Kong, International World Wide Web Consortium.

Zook, M. (2000). Internet Metrics: Using Host and Domain Counts to Map the Internet Globally. Telecommunications Policy Online [Online]. Accesible en: http://www.tpeditor.com/contents/2000/zook.htm (7 de abril, 2004)


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item