Longitudinal study of contents and elements in the scientific Web environment

Ortega-Priego, José-Luis, Aguillo, Isidro F. and Prieto-Valverde, José Antonio Longitudinal study of contents and elements in the scientific Web environment. Journal of Information Science, 2006, vol. 32, n. 4, pp. 344-351. [Journal article (Paginated)]

[thumbnail of Longitudinal_preprint.pdf]
Preview
PDF
Longitudinal_preprint.pdf

Download (162kB) | Preview

English abstract

The aim of this work is the longitudinal study of the evolution and the state of 738 web sites in two different points in time (1997 and 2004). It tries to establish the rate of growth and decay of the Web and all the web elements. To this end, the structure and the contents of these web sites are extracted through a crawler and compared at the two different moments in time. The main results confirm a growth of web contents and elements in the web, although there is also a high degree of web content decay. The results suggest that in the seven year period covered by this study the web is characterized by both strong dynamism and instability.

Item type: Journal article (Paginated)
Keywords: Webometrics; Web persistence; Web growth; Web decay; Linkrot
Subjects: L. Information technology and library technology > LC. Internet, including WWW.
Depositing user: José Luis Ortega Priego
Date deposited: 31 Jan 2007
Last modified: 02 Oct 2014 12:05
URI: http://hdl.handle.net/10760/8333

References

D. Pennock, G.W. Flake, S. Lawrence, E.J. Glover, C.L. Giles, Winners don't take all: Characterizing the competition for links on the web, Proc. Natl. Acad. Sci. USA 99 (8) (2002) 5207-5211 Available at: http://www.pnas.org/cgi/reprint/99/8/5207 (accessed 28 October 2005).

Internet Systems Cosortium, Inc, Redwood, CA. (2004). Available at: http://www.isc.org/index.pl?/ops/ds/ (accessed 28 October 2005).

E.T. O’Neill, B.F. Lavoie, R. Bennet, Trends in the Evolution of the Public Web 1998-2002, D-Lib Magazine 9 (4) (2003).

S. Harter, H. Kim, Electronic journals and scholarly communication: a citation and reference study, Information Research 2 (1) (1996) paper 9. Available at: http://informationr.net/ir/2-1/paper9a.html (accessed 28 October 2005)

S. Lawrence, F. Coetzee, E. Glover, D. Pennock, G. Flake, F. Nielsen, B. Krovetz, A. Kruger, L. Giles, Persistence of Web References in Scientific Research, IEEE Computer 34(2) (2003) 26-31

W. Koehler, An Analysis of Web page and Web site constancy and permanence, Journal of the American Society for Information Science, 50 (2) (1999) 162-180

W. Koehler, Web page change and persistence – a four-year longitudinal study, Journal of the American Society for Information Science and Technology, 53 (2) (2002) 162-171

W. Koehler, A longitudinal study of Web pages continued: a report after six years, Information Research, 9 (2) (2004) paper 174. Available at: http://informationr.net/ir/9-2/paper174.html (accessed 28 October 2005)

M. Nelson, B. Allen, Object persistence and availability in digital libraries, D-Lib Magazine 8 (1) (2002). Available at: http://www.dlib.org/dlib/january02/nelson/01nelson.html (accessed 28 October 2005)

D. Fetterly, M. Manasse, M. Najork, J.L. Wiener, A Large-Scale Study of the Evolution of Web pages, Software Practice and Experience 1 (1) (2003) 1-27

J. Cho, H. García-Molina, The evolution of the web and implications for an incremental crawler, Proceeding of the 26th International Conference on Very Large Databases, (2000)

J. Bar-Ilan, B.C. Peritz, Evolution, Continuity, and Disappearance of Documents on a Specific Topic on the Web: A Longitudinal Study of ‘Informetrics, Journal of the American Society for Information Science and Technology, 55 (11) (2004) 980-990

P. Wouters, I. Hellsten, L. Leydesdorff, Internet Time and the reliability of Search Engines, First Monday, 9 (10) (2004) Available at: http://www.firstmonday.org/issues/issue9_10/wouters/ (accessed 28 October 2005)

J.L. Ortega, J. A. Prieto, N. Arroyo, V.M. Pareja, I.F. Aguillo, Análisis de la persistencia y del estado de páginas web en los resultados de Google, 9ª Jornadas Españolas de Documentación FESABID 2005, Madrid, 14 y 15 de Abril (2005). Available at: http://internetlab.cindoc.csic.es/cv/11/Ortega2005.pdf (accessed 28 October 2005)

NetCarta.com (1997) NetCarta WebMap Library. Available at: http://www.netcarta.com/ (accessed 16 April 1997)

N. Arroyo, V. Pareja, I. Aguillo, Description of Web Data in D3.1. Deliverable. IST-1999-20350 (2003) Available at: http://www.eicstes.org/EICSTES_PDF/Deliverables/Web Data description.pdf (accessed 28 October 2005)

Xenu's Link Sleuth. Ver. 1.2f [s. l.]: Tilman Hausherr, c1997-2004. Software. Available at: http://home.snafu.de/tilman/xenulink.html (accessed 28 October 2005)


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item