Link Analysis and Site Structure in Information Retrieval

Mandl, Thomas Link Analysis and Site Structure in Information Retrieval., 2003 . In Informatik 2003: Innovative Informatikanwendungen. Beiträge der 33. Jahrestagung der Gesellschaft für Informatik, Frankfurt am Main, 29.September – 2.Oktober. [Conference paper]

[thumbnail of Mandl_Workshop_IR_Informatik_2003.doc+.pdf]
Preview
PDF
Mandl_Workshop_IR_Informatik_2003.doc+.pdf

Download (28kB) | Preview

English abstract

Link analysis is the most important application of web structure mining and serves as a new knowledge source in web information retrieval. However, the mono-dimensional analysis of links neglects many other structural aspects. The results presented in this article show that the structure of a site affects the in-links for its pages. Link analysis algorithms need to be refined in order to account for that fact.

Item type: Conference paper
Keywords: PageRank, Web Design, Web Mining
Subjects: L. Information technology and library technology > LS. Search engines.
Depositing user: Thomas Mandl
Date deposited: 29 Aug 2006
Last modified: 02 Oct 2014 12:04
URI: http://hdl.handle.net/10760/8046

References

[Ba02] Barabási, A.-L.: Linked: The New Science of Networks, Perseus, 2002.

[CJ02] Chakrabarti, S.; Joshi, M.; Punera, K.; Pennock, D.: The Structure of Broad Topics on the Web. In: Proc. Eleventh Intl World Wide Web Conf (WWW 2002). Honolulu, Hawaii. May 7.-11. http://www2002.org/CDROMrefereed/338/

[DK01] Dill, S.; Kumar, R.; McCurley, K.; Rajagopalan, S.; Sivakumar, D.; Tomkins, A.: Self-Similarity in the web. In: Proc 27th Intl Conf on Very Large Databases (VLDB). 2001.

[FG01] Fuhr, N.; Großjohann, K.: XIRQL: A Query Language for Information Retrieval in XML Documents. In: (Croft, W.; Harper, D.;Kraft, D.; Zobel , J. eds.) Proc 24th Annual Intl Conf on Research and Development in Information Retrieval 2001. pp. 172-180.

[GS01] Gurrin, C.; Smeaton, A. (2001): Dublin City University Experiments in Connectivity Analysis for TREC-9. In (Voorhees, E.; Harman, D. eds.): The Ninth Text REtrieval Conf (TREC 9). 2001. http://trec.nist.gov/pubs/trec9/t9_proceedings.html

[Ha02] Haveliwala, T.: Topic-Sensitive PageRank. In: Proc. of the Eleventh Intl World Wide Web Conf 2002 (WWW) Hawaii May 7-11. http://www2002.org/CDROMrefereed/127/

[Ha01] Hawking, D.: Overview of the TREC-9 Web Track. In (Voorhees, E.; Harman, D. eds.): The Ninth Text REtrieval Conf (TREC 9). 2001. http://trec.nist.gov/pubs/trec9/t9_proceedings.html

[He00] Henzinger, M.: Link Analysis in Web Information Retrieval. In: IEEE Data Engineering Bulletin, 23(3):3-8, 2000.

[HM02] Henzinger, M.; Motwani, R.; Silverstein, C.: Challenges in Web Search Engines. SIGIR Forum, 2002.

[JW03] Jeh, G; Widom, J.: Scaling Personalized Web Search. In: Proc of the Twelfth Intl World Wide Web Conf (WWW 2003) Budapest. May 20-24. pp. 271-279.

[Ma02] Mandl, T.: Evaluierung von Internet-Verzeichnisdiensten mit Methoden des Web-Mining. In (Hammwöhner, R.; Wolff, C; Womser-Hacker, C. eds.): Proc 8. Intl.

Symposium für Informationswissenschaft. (ISI 2002). Regensburg. pp. 239-257.

[Ma03] Mandl, T.: Neuere Entwicklungen bei der Evaluierung von Information Retrieval Systemen: Web- und Multimedia-Dokumente. In: nfd Information – Wissenschaft und Praxis vol. 54 (4). 2003. pp. 203-210.

[PF02] Pennock, D.; Flake, G.; Lawrence, S.; Glover, E.; Giles, L.: Winners don’t take all: Characterizing the competition for links on the web. In: Proc. National Academy of Sciences 99 (8) 2002. pp. 5207–5211

[Ya01] Yang, K.: Combining text- and link-based retrieval methods for Web IR. In (Voorhees, E.; Harman, D. eds.): The Ninth Text REtrieval Conf (TREC 9). 2001

http://trec.nist.gov/pubs/trec9/t9_proceedings.html


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item