Fundamental methodologies and tools for the employment of webometric analyses : a discussion and proposal for improving the foundation of webometrics

Fugl, Liv Danman Fundamental methodologies and tools for the employment of webometric analyses : a discussion and proposal for improving the foundation of webometrics., 2001 Master thesis thesis, Royal School of Library and Information Science (Denmark). [Thesis]

[thumbnail of Master-Thesis.pdf]
Preview
PDF
Master-Thesis.pdf

Download (600kB) | Preview

English abstract

The paper Fundamental methodologies and tools for the employment of webometric analyses defines the most important rules to keep in mind before performing webometric analyses. The paper deals with the two basic elements, that constitutes the foundation for webometric analyses: the documents being analysed, and the tools that are applied for the data collection. The concepts of a citation theory and a link theory are discussed through a study of the current litterature. Different methodologies for uncovering motivations for making references in scientific articles are reviewed and discussed. A methodology for uncovering motivations for making links on webpages is proposed and applied on six researchers' websites at the Royal School of Library and Information Science in Denmark, and on all the institutes at the same institution and at selected institutes at The Technical University of Denmark. The paper further contains a review on the linktopology of the Internet and the current status for the tools available for data collection. Finally, alternative possible tools for applying webometric analyses are proposed. The alternative tools are the Researchindex invented by Lawrence and Giles (Lawrence, Bollacker & Giles, 1999b; Giles, Bollacker & Lawrence, 1998), Kleinberg's HITS algorithm employed in the Clever search engine (The Clever Project, n.d.; Kleinberg, 1998), Proposals for possible extensions to the HTTP protocol to facilitate the collection and navigation of backlink information in the world wide web made by Chakrabarti, Gibson and McCurley (Chakrabarti, Gibson & McCurley, 1999c) and finally Link Agent, a program we have developed for this paper. The program makes it possible to uncover the reciprocal linking webpages, that exist in relation to the outgoing links from a chosen webpage.

Item type: Thesis (UNSPECIFIED)
Keywords: Informetrics, Webometrics, Citation theory, Link theory, Motivations for links, Motivations for references, Search engines, Webometric tools
Subjects: B. Information use and sociology of information > BB. Bibliometric methods
L. Information technology and library technology > LC. Internet, including WWW.
Depositing user: Liv Danman Fugl
Date deposited: 09 Nov 2005
Last modified: 02 Oct 2014 12:02
URI: http://hdl.handle.net/10760/6836

References

Agosti, M; Melucci, M. (2000). Information retrieval on the web. In: ESSIR 2000. European Summer School in Information Retrieval, September 11-15, 2000 - Villa Monastero, Varenna, Italy.

Albert, R.; Jeong, H.; Barabási, A-L. (1999). Diameter of the World-Wide Web. In: Nature. Vol. 401, p. 130-131. http://www.nd.edu/~networks/Papers/401130A0.pdf (04/03/01)

Allan, J. (1996). Automatic hypertext link typing. In: Proceedings for the Hypertext '96 conference. p. 42-52, March. Washington, D.C.: ACM.

Almind, T.C. (1997). Lænker på World Wide Web : - lænker set som citationer på WWW. Copenhagen: Royal School of Library and Information Science. (Master thesis).

Almind, T.C.; Ingwersen, P. (1997). Informetric analyses on the world wide web : methodological approaches to ’Webometrics’. In: Journal of Documentation. Vol. 53, no. 4, p. 404-426.

Andersen, I. (1998). Den skinbarlige virkelighed : -om valg af samfundsvidenskabelige metoder. Frederiksberg C: Samfundslitteratur.

Baccala, B. (1997). Connected: An Internet Encyclopedia. 3rd ed. http://www.freesoft.org/CIE/index.htm and http://www.freesoft.org/CIE/Topics/102.htm (04/05/01)

Balslev, A; Fugl, L.D. (1999). Cocitationer & Bibliografisk Kobling : en sammenlignende analyse af metodernes anvendelighed til afdækning af fagområder, belyst med et eksempel i faget Information Science & Library Science. Copenhagen: Royal School of Library and Information Science. (Hovedopgave).

Banks, P. (2000). Give and take : our E-commerce marketing expert shows you how to increase traffic with reciprocal links. In: Entrepreneur.com. May 5th. http://www.entrepreneur.com/Your_Business/YB_PrintArticle/0,2361,274248-----,00.html (03/19/01)

Bar-Ilan, J. (1999). Search Engine Results over Time : a case study on search engine stability. In: Cybermetrics. Vol. 2/3, no. 1, paper 1. http://www.cindoc.csic.es/cybermetrics/articles/v2i1p1.html (01/20/01)

Bar-Ilaan, J. (2001). Data collection methods on the web for informetric purposes - A review and analysis. In: Scientometrics. Vol. 50, no. 1, p. 7-32.

Bergman, M. (2000). The Deep Web : Surfacing Hidden Value. USA: BrightPlanet, the Internet content company. (White Paper). http://128.121.227.57/download/deepwebwhitepaper.pdf (09/28/00)

Bharat, K.; Broder, A. (1998). A technique for measuring the relative size and overlap of public Web search engines. In: Computer Networks and ISDN Systems. Vol. 30, p. 379-388.

Björneborn, L.; Ingwersen, P. (2001). Perspectives of Webometrics. In: Scientometrics. Vol. 50, no. 1, p. 65-82.

Bollacker, K.D.; Lawrence, S.; Giles, S.L. (1998). CiteSeer : an autonomous web agent for automatic retrieval and identification of interesting publications. In: K.P. Sycara, M. Woolridge (eds). Proceedings of the 2nd international Conference on Autonomous Agents. p. 116-123. New York: ACM Press. http://www.neci.nj.nec.com/~lawrence/papers/cs-aa98/cs-aa98.pdf (10/20/00)

Broder, A. et al. (2000). Graph structure in the web. In: Proceedings of the 9th International World Wide Web Conference. Amsterdam, Netherlands. http://www9.org or http://www.almaden.ibm.com/cs/k53/www9.final/ (03/14/01)

Brooks, T.A. (1985). Private acts and public objects : an investigation of citer motivations. In: Journal of the American Society for Information Science. Vol. 36, no. 4, p. 223-229.

Brooks, T.A. (1986). Evidence of complex citer motivations. In: Journal of the American Society for Information Science. Vol. 37, no. 1, p. 34-36.

Cano, V. (1989). Citation behavior : classification, utility, and location. In: Journal of the American Society for Information Science. Vol. 40, no. 4, p. 284-290.

Carr, L.; Hall, W.; Miles-Board, T. (2000). Writing and reading hypermedia on the web. United Kingdom: Univ. of South Hampton. (Technical Report No. ECSTR-IAM00-1). http://www.bib.ecs.soton.ac.uk/data/3368/html/WRWH.html (04/04/01)

Chakrabarti, S. et al. (1999a). Hypersearching the Web. In: Scientific American. no. 6, June. http://www.sciam.com/1999/0699issue/0699raghavan.html (10/20/00)

Chakrabarti, S. et al. (1999b). Mining the Web's Link Structure. In: Computer. Vol. 32, no. 8, p. 60-67.

Chakrabarti, S.; Gibson, D.A.; McCurley, K.S. (1999c). Surfing the Web Backwards. In: Proceedings of The Eigth International World Wide Web Conference. Toronto, Canada. http://www8.org/w8-papers/5b-hypertext-media/surfing/surfing.html (01/30/01)

Clever Project, The; Project overview. [not dated]. http://www.almaden.ibm.com/cs/k53/clever.html (04/04/01)

Cozzens, S.E. (1989). What do citations count? the rethorical-first model. In: Scientometrics. Vol. 15, no. 5-6, p. 437-447.

Cybermetrics. available: http://www.cindoc.csic.es/cybermetrics (04/04/01)

Egghe, L.; Rousseau, R. (1990). Introduction to informetrics : quantitative methods in library, documentation and information science. Amsterdam: Elsevier Science Publishers.

Ellegaard, M. (2000). Klumme : strategisk linksamarbejde. In: SOL ComON. May 14th. http://www.comon.dk/20/printview.asp?ID=5765 (12/28/00)

Fano, R.M. (1956). Information theory and the retrieval of recorded information. In: Documentation in action. New York: Reinhold Publ. Corp., p. 238-244.

Frankfort-Nachmias, C.; Nachmias, D. (1996). Research Methods in the Social Sciences. 5th ed. London: Arnold.

Garfield, E. (1989). Citation behaviour : -an aid or a hindrance to information retrieval?. In: Essays of an information scientist: creativity, delayed recognition, and other essays. Vol.12, p. 123-128. (Current contents. Vol. 18, p. 3-8)

Garfield, E. (1998a). From citation indexes to informetrics : is the tail now wagging the dog? In: Libri. Vol. 48, p. 67-80.

Garfield, E. (1998b). Random thoughts on citationology : its theory and practice. In: Scientometrics. Vol. 43, no. 1, p. 69-76.

Gibson, D.; Kleinberg, J.; Raghavan, P. (1998). Inferring Web Communities from Link Topology. In: Proceedings 9th ACM Conference on Hypertext and Hypermedia. http://www.cs.cornell.edu/home/kleinber/ht98.pdf (02/03/01)

Giles, C.L.; Bollacker, K.D.; Lawrence, S. (1998). CiteSeer : An automatic citation indexing system. In: I. Witten, R. Akscyn and F. Shipmann III (eds.). Digital Libraries 98 : Third ACM Conference on Digital Libraries. p. 89-98. http://www.it-uni.sdu.dk/mmp/Library/BollackerEtAlCiteSeer99.pdf (20/10/00)

Haas, S.W.; Grams, E.S. (1998). A link taxonomy for web pages. In: Proceedings of the 61st ASIS annual meeting. Vol. 35, p. 485-495. Medford, NJ: Info. Today.

Hellevik, O. (1997). Forskningsmetode i sosiologi og statsvitenskap. 5th ed. Oslo: Universitetsforlaget.

Hjortgaard Christensen, F.; Ingwersen, P. (1997). Online determination of the journal impact factor and its international properties. In: Scientometrics. Vol. 40, no. 3, p. 529-540.

Ingwersen, P. (1995). Information and Information Science. In: Encyclopedia of Library and Information Science. Vol. 56, suppl. 19, p. 136-174. Allen Kent & Carolyn M. Hall (editors). New York: Marcel Dekker, Inc.

Ingwersen, P. (1998). The calculation of web impact factors. In: Journal of documentation. Vol. 54, no. 2, p. 236-243.

Inktomi Corporation, NEC Research Institute (2000). Web Surpasses One Billion Documents. Press release issued January 18th. http://www.inktomi.com/new/press/2000/billion.html and http://www.inktomi.com/webmap/ (01/20/01)

Kaplan, N. (1965). The norms of citation behavior: Prolegomena to the footnote. In: American Documentation. Vol. 16, no. 3, p. 179-184.

Kessler, M.M. (1963). An experimental study of bibliographic coupling between technical papers. In: IEEE transactions on information theory. PTGIT IT-9, p. 49-51.

Kim, H.J. (2000). Motivations for hyperlinking in scholarly electronic articles : a qualitative study. In: Journal of the American Society for Information Science. Vol. 51, no. 10, p. 887-899.

Kleinberg, J. M. (1998). Authoritative sources in a hyperlinked environment. In: 9th proceedings of the annual ACM-SIAM symposium on discrete algorithms. p. 668-677. Full version available at: http://www.cs.cornell.edu/home/kleinber/ (02/03/01)

Larson, R. (1996). Bibliometrics of the World Wide Web: an exploratory analysis of the intellectual structure of cyberspace. ASIS96. Available: http://sherlock.berkeley.edu/asis96/asis96.html (04/04/01)

Latour, B. (1987). Science in action : how to follow scientists and engineers through society. Open University Press, Milton Keynes.

Lawrence, S.; Giles, C.L. (1998). Searching the World Wide Web. In: Science. Vol. 280, p. 98-100.

Lawrence, S.; Giles, C.L. (1999a). Accessibility of information on the web. In: Nature. Vol. 400, p. 107-109.

Lawrence, S.; Bollacker, K.; Giles, C.L. (1999b). Indexing and Retrieval of Scientific Literature. In: Eighth International Conference on Information and Knowledge Management, CIKM 99. Kansas City, Missouri. p. 139-146. http://www.neci.nec.com/~lawrence/papers/cs-cikm99/cs-cikm99.pdf (10/20/00)

Leydesdorff, L. (1998). Theories of citation?. In: Scientometrics. Vol. 43, no. 1, p. 5-25.

Luukkonen, T. (1997). Why has Latour's theory of citations been ignored by the bibliometric community? : discussion of sociological interpretations of citation analysis. In: Scientometrics. Vol. 38, no. 1, p. 27-37.

MacRoberts, M.H.; MacRoberts, B.R. (1986). Quantitative measures of communication in science : a study of the formal level. In: Social Studies of Science. Vol. 16, p. 151-172.

MacRoberts, M.H.; MacRoberts, B.R. (1989). Problems of citation analysis : a critical review. In: Journal of the American Society for Information Science. Vol. 40, no. 5, p. 342-349.

MacRoberts, M.H.; MacRoberts, B.R. (1996). Problems of citation analysis. In: Scientometrics. Vol. 36, no. 3, p. 435-444.

McKiernan, G. (1996). CitedSites(sm): Citation Indexing of Web Resources. http://www.public.iastate.edu/~CYBERSTACKS/Cited.htm (03/21/01)

Marshakova, I.V. (1973). A system of document links constructed on the basis of citations (according to the "Science Citation Index"). In: Nauchno-Tekhnicheskaya Informatisiya: Scientific and Technical Information Processing. Series 2, no. 6, p. 49-57 (p. 3-8 in the Russian edition.)

Merton, R.K. (1979). Foreword, p. vii-xi. In: E. Garfield. Citation indexing : -Its theory and application in science, technology, and humanities. New York: John Wiley & Sons. (Information Sciences Series).

Moravcsik, M.J.; Murugesan, P. (1975). Some results on the function and quality of citations. In: Social studies of Science. Vol. 5, p. 86-92.

NEC Research Institute. (n.d.). Terms of Service. http://citeseer.nj.nec.com/terms.html (05/03/01)

Notess, G. (2000a). Search Engine Inconsistencies. In: Online. Vol. 24, no. 2. http://www.onlineinc.com/onlinemag/OL2000/net3.html (06/13/00)

Notess, G. (2000b). Search Engine Statistics : Database overlap. In: Search Engine Showdown : The user's guide to web searching. February 21st. http://www.searchengineshowdown.com/stats/overlap.shtml (02/02/01)

Notess, G. (2001a). Search Engine Statistics : Database total size estimates. In: Search Engine Showdown : The user's guide to web searching. April 7th. http://www.searchengineshowdown.com/stats/sizeest.shtml (02/02/01)

Notess, G. (2001b). Search Engine Statistics : Relative size showdown. In: Search Engine Showdown : The user's guide to web searching. April 7th. http://www.searchengineshowdown.com/stats/size.shtml (02/02/01)

Rousseau, R. (1997). Sitations : an exploratory study. In: Cybermetrics. Vol. 1, no. 1, paper 1. Available: http://www.cindoc.csic.es/cybermetrics/articles/v1i1p1.html (04/04/01)

Skovmark, H. (2001). Cyberkrig på boligmarkedet. In: Bitconomy, April 2nd. http://www.bitconomy.dk/nyheder.asp?id=1952 (04/04/01)

Small, H. (1973). Co-citation in the scientific literature : a new measure of the relationship between two documents. In: Journal of the American Society for Information Science. Vol. 24, no. 4, p. 265-269.

Small, H. (1998). Letter to the editor : Citations and consilience in science. In: Scientometrics. Vol. 43, no. 1, p. 143-148.

Snyder, H.; Rosenbaum, H. (1999). Can search engines be used as tools for web-link analysis? : a critical review. In: Journal of Documentation. Vol. 55, no. 4, p. 375-384.

Tague-Sutcliffe, J. (1992). An introduction to informetrics. In: Information Processing & Management. Vol. 28, no. 1, p. 1-3.

Thomas, O.; Willet, P. (2000). Webometric analysis of departments of librarianship and information science. In: Journal of Information Science. Vol. 26, no. 6, p. 421-428.

Van Raan, A.F.J. (1998). In matters of quantitative studies of science : The fault of theorists is offering too little and asking too much. In: Scientometrics. Vol. 43, no. 1, p. 129-139.

Vinkler, P. (1987). A quasi-quantitative citation model. In: Scientometrics. Vol. 12, no. 1-2, p. 47-72.

White, H.D.; McCain, K.W. (1998). Visualizing a discipline : an author co-citation analysis of information science, 1972-1995. In: Journal of the American Society for Information Science. Vol. 49, no. 4, p. 327-355.

White, M.D.; Wang, P. (1997). A qualitative study of citing behavior : contributions, criteria, and metalevel documentation concerns. In: Library Quarterly. Vol. 67, no. 2, p. 122-154.

Zuckerman, H. (1987). Citation analysis and the complex problem of intellectual influence. In: Scientometrics. Vol. 12, no. 5-6, p. 329-338.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item