Formal Definitions of Web Information Search

Yan, Su and Giles, C. Lee and Jansen, Bernard J. Formal Definitions of Web Information Search., 2006 . In 69th Annual Meeting of the American Society for Information Science and Technology (ASIST), Austin (US), 3-8 November 2006. [Conference paper]

[img]
Preview
PDF
Yan_formal.pdf

Download (218kB) | Preview

English abstract

Research in Web search engines has been criticized for lacking underlying theories and models. Theories adopted from traditional information retrieval research have been found in many ways lacking and inefficient in dealing with information retrieval in the Web context, primarily because of the amount of information and its dynamic nature, the hyperlinked structure, and multimedia sources. Appropriate Web models and theories for search engines will make web search and information retrieval problems easier to formulate and comprehend. This in turn helps to highlight holes in current Web search engine techniques. We analyze and categorize previous Web and information retrieval models. Grounded on previous work, we then propose a new Web information retrieval model based on both objective and subjective criteria. The performance of the new model is systematically compared with other IR models, and contributions of this work are highlighted.

Item type: Conference paper
Keywords: search engine effectiveness ; information retrieval models
Subjects: B. Information use and sociology of information > BG. Information dissemination and diffusion.
I. Information treatment for information services > IK. Design, development, implementation and maintenance
L. Information technology and library technology > LS. Search engines.
Depositing user: Norm Medeiros
Date deposited: 05 Dec 2006
Last modified: 02 Oct 2014 12:05
URI: http://hdl.handle.net/10760/8553

References

Nielsen netratings for search engines. (2005). Retrieved November 2005, from http://searchenginewatch.com/reports/article.php.

Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., Raghavan, S. (2001). Searching the web. ACM Trans. Internet Techn, 1 (1), 2-43.

Baeza-Yates, R. A., Ribeiro-Neto, B. A. (1999). Modern Information Retrieval. ACM Press / Addison-Wesley.

Beg, M. M. S. (2005). User feedback based enhancement in web search quality. Inf. Sci, 170 (2-4), 153-172.

Bianchini, M., Gori, M., Scarselli, F. (2005). Inside pagerank. ACM Trans. Inter. Tech, 5 (1), 92-128.

Brin, S., Page, L. (1998). The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30 (1-7), 107-117.

Brooks, T. A., 2003. Web search: how the web has changed information retrieval. Inf. Res. 8 (3).

Bruza, P., Song, D., Wong, K.-F. (2000). Aboutness from a commonsense perspective. JASIS, 51 (12), 1090-1105.

Cho, J., Roy, S., Adams, R. E. (2005). Page quality: in search of an unbiased web ranking. SIGMOD '05, 551-562.

Crestani, F., Lalmas, M. (2001). Logic and uncertainty in information retrieval, Lectures on information retrieval, 179-206.

Cutler, M., Shih, Y., Meng, W. (1997). Using the structure of html documents to improve retrieval. USENIX Symposium on Internet Technologies and Systems.

Dominich, S. (2001). On applying formal grammar and languages, and deduction to information. Proceedings of the ACM SIGIR MF/IR, 37-41.

Egghe, L., Rousseau, R. (1998). A theoretical study of recall and precision using a topological approach to information retrieval. Inf. Process. Manage. 34 (2-3), 191-218.

Everett, D. M., Cater, S. C. (1992). Topology of document retrieval systems. JASIS 43 (10), 658-673.

Faloutsos, C., Barber, R., Flickner, M., Hafner, J., Niblack, W., Petkovic, D., Equitz, W. (1994). Efficient and effective querying by image content. J. Intell. Inf. Syst. 3 (3-4), 231-262.

Fuhr, N. (1992). Probabilistic models in information retrieval. Comput. J. 35 (3), 243-255.

Goncalves, M. A., Fox, E. A., Watson, L. T., Kipp, N. A. (2004). Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries. ACM Trans. Inf. Syst. 22 (2), 270-312.

Grossman, D. A., Frieder, O. (2004). Information Retrieval: Algorithms and Heuristics. Springer.

Jain, R. (1995). World-wide maze. IEEE MultiMedia 2 (2), 3.

Kherfi, M. L., Ziou, D., Bernardi, A. (2004). Image retrieval from the world wide web: Issues, techniques, and systems. ACM Comput. Surv. 36 (1), 35-67.

Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment. J. ACM 46 (5), 604-632.

Lawrence, S., Giles, C. L. (2000). Accessibility of information on the web. Intelligence, 11 (1), 32-39.

Meng, W., Yu, C., Liu, K.-L. (2002). Building efficient and effective metasearch engines. ACM Comput. Surv. 34 (1), 48-89.

Rolleke, T., Tsikrika, T., Kazai, G. (2003). A general matrix framework for modeling information retrieval. Proceedings of the ACM SIGIR MF/IR, 1-11.

Sheridan, P., Braschler, M., Schauble, P. (1997). Cross-language information retrieval in a multilingual legal domain. ECDL '97, 253-268.

Silverstein, C., Marais, H., Henzinger, M., Moricz, M. (1999). Analysis of a very large web search engine query log. SIGIR Forum, 33 (1), 6-12.

Tague, J., Salminen, A., McClellan, C. (1991). Complete formal model for information retrieval systems. SIGIR '91, 14-20.

Tsikrika, T., Lalmas, M. (2002). Combining web document representations in a bayesian inference network model using link and content-based evidence. Proceedings of the 24th BCSIRSG European Colloquium on IR Research, 53-72.

van Rijsbergen, C. J. (2004). The Geometry of Information Retrieval. Cambridge University Press.

van Rijsbergen, C. J. (2005). A probabilistic logic for information retrieval. ECIR. pp. 1-6.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item