Uso de ontologías para la mejora de resultados de motores de búsqueda web

Aguilar-López, Dulce and López-Arévalo , Iván and Sosa-Sosa, Víctor Uso de ontologías para la mejora de resultados de motores de búsqueda web. El profesional de la informacion, 2009, vol. 18, n. 1, pp. 34-40. [Journal article (Paginated)]

[img] Text
dulce.pdf - Published version
Available under License Creative Commons Attribution.

Download (1MB)

English abstract

With the increasing number of web sites, the time spent by users reviewing the results also increases. In addition, the nature of web content is semantically heterogeneous and oriented to people who will be able to understand it. Frequently the results from search engines do not correspond to the expected topic. One approach to improve the results is to match the content of the web pages with a formal vocabulary on the topic (ontology) and with the informal vocabulary (common terms of the topic but not in the ontology). This paper describes a web search method that takes advantage of ontologies to reduce the search area of certain topics. With this approach the relevance of search engine results is enhanced by filtering the content through the integration of domain ontologies, the WordNet thesaurus, and a hierarchical similarity measure. Thus, the improvement on the relevance of results reduces the time required to review such results.

Spanish abstract

Con el aumento del número de webs, el tiempo que un usuario invierte en la revisión de los resultados ofrecidos por los motores de búsqueda se incrementa de manera considerable. La naturaleza del contenido de estas páginas es semánticamente heterogénea y orientada al humano que sabe interpretarla correctamente. Es importante que el resultado de la búsqueda realmente corresponda a la información deseada. Una propuesta para lograrlo es comparar el contenido de la página web con el vocabulario formal del tema (ontología) y con el vocabulario informal (términos comunes del tema pero ajenos a la ontología). Se describe un tipo de búsqueda web que aprovecha las ontologías para reducir el espacio de búsqueda de ciertos temas. Con esta propuesta se mejora la relevancia de los resultados de los buscadores utilizando ontologías de dominio, el tesauro WordNet y una medida de similitud jerárquica. El aumento en la relevancia de los resultados se traduce en la disminución en el tiempo de revisión de los mismos.

Item type: Journal article (Paginated)
Keywords: Ontologías, Búsqueda semántica, WordNet, Ontologies, Semantic search
Subjects: I. Information treatment for information services
I. Information treatment for information services > IZ. None of these, but in this section.
K. Housing technologies.
L. Information technology and library technology
L. Information technology and library technology > LC. Internet, including WWW.
L. Information technology and library technology > LS. Search engines.
Depositing user: Esther Rafael
Date deposited: 30 Oct 2014 23:12
Last modified: 30 Oct 2014 23:12
URI: http://hdl.handle.net/10760/24018

References

Gruber, Thomas. "Toward principles for the design of ontologies used for knowledge sharing". Intl. journal human-computer studies, 1993, v. 43, pp. 907-929.

Fernández-López, Mariano; Gómez-Pérez, Asunción; Juristo, Natalia. "Methontology: From ontological art towards ontological engineering". En:Proceedings of the AAAI97 spring symposium series on ontological engineering, 1997, pp. 33-40.

Morato, Jorge; Marzal, Miguel; Lloréns, Juan; Moreiro, José. "Word-Netapplications". En: Proceedings of the Second global WordNet conference, 2007, pp. 270-278.

Van-Haren, Mark; McIntyre, Ryan; Lutch, Ben; Kraus, Joe; Spencer, Graham; Reinfried, Martin. Excite.

http://www.excite.com

Brin, Sergey; Page, Lawrence. "The anatomy of a large-scale hypertextual web search engine". Computer networks and ISDN systems, 2008, v. 30, pp. 107-117.

Brewer, Eric; Gauthier, Paul. HotBot.

http://www.hotbot.com

Selberg, Erik; Etzioni, Oren. "The MetaCrawler architecture for resource aggregation on the Web". IEEE expert, 1997, pp. 11-14.

Microsoft Corporation.MSN. http://www.msn.com

Ganesan, Prasanna; Garcia-Molina, Hector; Widom, Jennifer. "Exploiting hierarchical domain structure to compute similarity". ACM transactions on information systems, 2003, v. 21, pp. 64-93.

Aguilar-López, Dulce; López-Arévalo, Iván; Sosa, Víctor. "Usage of domain ontologies for web search". En: Proceedings of DCAI, 2008, pp. 319-328.

Aguilar-López, Dulce; López-Arévalo, Iván; Sosa, Víctor. "Web search based on domain ontologies". En: 15th Intl. multi-conference on advanced computer systems & computer information systems and industrial management applications (ACS), 2008.

Bocio, Jaime; Isern, David; Moreno, Antonio; Riaño, David. "Semantically grounded information search on the WWW". En: Recent advances in artificial intelligence, research and development (Proceedings of Seté congrés català d'intel.ligència artificial (CCIA'04)), 2004, pp. 349-356.

Gao, Mingxia; Liu, Chunnian; Chen, Furong. "An ontology search engine based on semantic analysis". En: Icita 05: Proceedings of the Third intl. conf. on information technology and applications (Icita05), 2005, pp. 256-259.

Ramachandran, Rahul; Movva, Sunil; Graves, Sara; Tanner, Steve. "Ontology-based semantic search tool for atmospheric science". En: 22nd Intl. conf. on interactive information processing systems (IIPS), 86th American Meteorological Society annual meeting, 2006.

Droegemeier, Kevin; Gannon, Dennis; Reed, Daniel et al. "Service-oriented environments in research and education for dynamically interacting with mesoscale weather". IEEE computing in science & engineering, 2005, v. 7, pp. 24-32.

Sánchez-Ruenes, David. Domain ontology learning from the Web. PhD tesis,Universitat Politècnica de Catalunya, Departamento de lenguajes y sistemas informáticos, 2007.

Salton, Gerard; Wong, Anita; Yang, Chung-Shu. "A vector space model for automatic indexing". Commun. ACM, 1975, v. 18, pp. 613-620. [CrossRef]

Wurst, Michael. The word vector tool user guide.

http://nemoz.org/joomla/mining/wvtool/wvtool.pdf

McCallum, Andrew Kachites. Bow: a toolkit for statistical language modeling, text retrieval, classification and clustering.

http://www.cs.cmu.edu/mccallum/bow

Lesk, Michael. "Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone". En: Proceedings of the 5th annual intl. conf. on systems documentation (Sigdoc), pages 24-26, New York, NY, USA, 1986. ACM.

Patwardhan, Siddharth; Pedersen, Ted. "Using WordNet based context vectors to estimate the semantic relatedness of concepts". En: Proceedings of the EACL 2006 workshop making sense of sense-bringing computational linguistics and psycholinguistics together, 2006, pp. 1-8.

Chignell, Mark; Gwizdka, Jacek; Bodner, Richard. "Discriminating meta-search: a framework for evaluation". Information processing and management, 1999, v. 35, n. 3, pp. 337-362.

Noy, Natalya; Ferguerson, Ray; Musen, Mark. "The knowledge model of Protégé-2000: combining interoperability and flexibility". En: 12th Intl. conf. in knowledge engineering and knowledge management (EKAW00). Lecture notes in artificial intelligence, 2000, v. 1937, pp. 17-32.

Dumontier, Michel; Villanueva-Rosales, Natalia. "Modeling life science knowledge with OWL 1.1. En: Fourth intl. workshop OWL experiences and design, Owled 2008, Washington, DC.

Lindesay, Victor. SchemaWeb directory. http://www.schemaweb.info

Wackerly, Dennis; Mendenhall, William; Scheaffer, Richard. Estadística matemática con aplicaciones. Cengage Learning Editores, 2002.

Silverstein, Craig; Marais, Hannes; Henzinger, Monika; Moricz, Michael. "Analysis of a very large web search engine query log". Sigir forum, 1999, v. 33, n. 1, pp. 6-12.

Jansen, Bernard; Spink, Amanda; Saracevic, Tefko. "Real life, real users, and real needs: A study and analysis of user queries on the web". Information processing and management, 2000, v. 36, n. 2, pp. 207-227.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item