Caracterización del Espacio Web de Argentina

Tolosa, Gabriel H., Bordignon, Fernando R. A., Baeza Yates, Ricardo and Castillo, Carlos Caracterización del Espacio Web de Argentina., 2007 . In CLEI 2007 - XXIII Conferencia Latinoamericana de Informática (en evaluación por parte del comité académico). (Unpublished) [Conference paper]

[thumbnail of La_web_de_Argentina-Tolosa-Bordignon-Baeza-Castillo.pdf]
Preview
PDF
La_web_de_Argentina-Tolosa-Bordignon-Baeza-Castillo.pdf

Download (269kB) | Preview

English abstract

Spanish Abstract: En este trabajo de investigación se caracteriza el espacio web argentino a partir del análisis de una muestra, tomada a principios del año 2006, cercana a los 10 millones de páginas extraídas de 150.000 sitios. En particular, se realizó análisis de contenidos, de enlaces y de tecnologías utilizadas para construir sitios. Los resultados obtenidos son consistentes con los de otros espacios webs nacionales. Del estudio surgen las siguientes observaciones: Existe una importante proporción de dominios “com.ar” (97.6%). En lo referente al contenido, predominan términos relacionados con la actividad comercial, mientras que en los nombres de los sitios, aparecen mayormente términos relacionados con el turismo. El 72% de las páginas han sido creadas o modificadas en el último año, lo cual indica que el espacio web argentino está creciendo aceleradamente. Con referencia a tecnologías, el 48% de las páginas de la muestra son estáticas y el 52%, dinámicas, las cuales se encuentran construidas en gran parte utilizando herramientas libres. El 76% de los sitios se hallan alojados en servidores que residen en Argentina. De los indicadores anteriores se desprende que existe un importante desarrollo tecnológico y de la infraestructura de comunicaciones de Argentina relacionada con la web. English Abstract: This article presents the results of research on the characterization of the Argentinian web domain over a sample of almost 10 million web pages from 150.000 sites collected in the early 2006. Particularly, we have studied page contents, link structure and technologies used in the construction of the sites. The results are consistent with earlier research on other national web domains. This study reveals a number of interesting facts: To begin with, there is a significant proportion (97.6%) of “.com.ar” domains. As regards page contents, we have found a predominance of terms related to commercial activity. However, terms found in site names, extracted from their URLs, are mostly related to tourism. 72% of the pages have been created or modified in the last year, which indicates that the Argentinian web space is growing quickly. As for technologies, 48% of the pages from the sample are static and 52% dynamic, the latter being mostly built using free tools. Besides, 76% of the sites are hosted in servers geographically located in Argentina. These two facts show there is an important web-related technological development and communication infrastructure in Argentina.

Item type: Conference paper
Keywords: Caracterización Web, Dominio Nacional Argentino, Webmetría, Análisis de links, characterization, Argentinian National Domain, Web Measurement, Link Analysis
Subjects: H. Information sources, supports, channels. > HQ. Web pages.
Depositing user: Fernando Bordignon
Date deposited: 28 Apr 2007
Last modified: 02 Oct 2014 12:07
URI: http://hdl.handle.net/10760/9417

References

[1] L.A. Adamic and B.A. Huberman. Zipf's law and the Internet. Glottometrics 3, pp. 143-150, 2002.

[2] R. Albert R. and A.-L. Barabasi. Statistical mechanics of complex networks. Review of Modern Physics 74, pp. 47-94, 2002.

[3] R. Baeza-Yates and C. Castillo. Relating Web characteristics with link based Web page ranking. In Proceedings of String Processing and Information Retrieval (SPIRE), IEEE Cs. Press, pp. 21-32. Laguna San Rafael, Chile, 2001.

[4] R. Baeza-Yates and F. Lalanne. Characteristics of the Korean Web. Technical Report, Korea-Chile IT Cooperation Center, ITCC, 2004.

[5] R. Baeza-Yates and C. Castillo. Características de la Web Chilena 2004. Technical Report, Center for Web Research, University of Chile, 2005.

[6] R. Baeza-Yates, C. Castillo and V. Lopez. Characteristics of the Web of Spain. Cybermetrics, Vol. 9, Nro. 1, 2005.

[7] R. Baeza-Yates, and C. Castillo. Link Analysis in National Web Domains. Workshop on Open Source Web Information Retrieval (OSWIR), pp. 15-18. Compiegne, France, 2005.

[8] R. Baeza-Yates, C. Castillo, and E. Efthimiadis. Characterization of national Web domains. Technical report, Universitat Pompeu Fabra, July 2005.

[9] A. L. Barabasi and A. Albert. Emergence of Scaling in Random Networks. Science, (286): pp. 509-512, 1999.

[8] K.Bharat, B-W. Chang, M. Herzinger and M. Rhul. Who Links to Whom: Mining Linkage between Web Sites. In Proceedings of the IEEE International Conference on Data Mining, 2001.

[11] A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, J. Wiener, Graph Structure in the Web. In Proceedings of the WWW9 Conference pp. 309-320, 2000.

[12] C. Castillo and R. Baeza-Yates. WIRE: an Open Source Web Information Retrieval Environment. Workshop on Open Source Web Information Retrieval (OSWIR), 2005.

[13] S. Chakrabarti, B.E. Dom, D. Gibson, D., and J. Kleinberg. Mining the Link Structure of the World Wide Web. IEEE Computer, Vol. 32, No. 8, pp: 60-67, 1999.

[14] S. Dill, R. Kumar, K.S. Mccurley, S. Rajagopalan, D. Sivakumar, and A. Tomkins. Self-similarity in the web. ACM Transactions on Internet Technology, Vol. 2, Nro.3, pp. 205-223, 2002.

[15] E. Efthimiadis and C. Castillo. Charting the Greek Web. In Proceedings of the Conference of the American Society for Information Science and Technology (ASIST), Providence, Rhode Island, USA, November, 2004.

[16] J. Kleinberg, R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. The Web as a Graph: Measurements, Models and Methods. In Proceedings of the International Conference on Combinatorics and Computing, 1999.

[17] J. Kleinberg. Authoritative Sources in a Hyperlinked Environment. Association for Computing Machinery - Journal

of the Association for Computing Machinery, Vol. 46, Nro. 5, pp. 604-632, 1999.

[18] M. Modesto, A. Pereira, N. Ziviani, C. Castillo and R. Baeza-Yates. Un Novo Retrato da Werb Brasileira. In Proceedings of SEMISH, São Leopoldo, Brazil, 2005.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item