Edición de contenidos en un entorno colaborativo: el caso de la Wikipedia en español

Zazo-Rodríguez, Ángel-F., G. Figuerola, Carlos and Alonso-Berrocal, José-Luis Edición de contenidos en un entorno colaborativo: el caso de la Wikipedia en español. Scire, 2015, vol. 21, n. 2, pp. 57-67. [Journal article (Paginated)]

[thumbnail of Texto en Español] Text (Texto en Español)
4243 - Published version
Available under License Creative Commons Attribution.

Download (22kB)

English abstract

This work uses the database backup dumps that collect content and history reviews of the encyclopae- dic articles of Spanish Wikipedia since its creation, in order to characterize and understand the underlying activity of the editors in content creation. Some quan- titative characteristics of articles are analyzed: length, assigned categories and in-links and out-links to other articles. Some characteristics have similar patterns to the ones found in webometric studies. The categories system, even though is functionally well built, is not used properly by the editors, which undermines the access to knowledge. We have also obtained patterns of the editors’ activity related to article creation, con- tent reviewing, activity days, reversions, vandalism, and authors’ countries of origin. We have found that an important part of Wikipedia lies on a few number of users who oversee the new content, aided by robots that facilitate the process. In general, content creation is performed by two different kinds of users: small individual contributions of a great legion of users and a large number of contributions made by a small group of extremely active users. For many users we have obtained the origin country, which has allowed us to know the contributions procedence.

Spanish abstract

Se analizan las características y la actividad que los usuarios editores de la Wikipedia en español realizan en el proceso de creación de contenidos. Tras volcar los datos de los artículos enciclopédicos, se han analizado aspectos cuantitativos de los artículos, como su longitud, enlaces entrantes y salientes entre ellos, y categorías a las que pueden ser asignados. En algunos casos, esas características siguen patrones similares a los encontrados en estudios webmétricos. El sistema de categorías, pese a que funcionalmente está bien constituido, no se utiliza de manera adecuada por los usuarios editores, lo cual menoscaba una buena forma de acceso al conocimiento. En cuanto a las ediciones que realizan los usuarios, se han obtenido patrones de actividad relacionados con la creación de artículos, la revisión de contenidos, días de actividad, reversiones, vandalismo y país de origen. Una parte importante del funcionamiento de Wikipedia recae en unos pocos usuarios que supervisan los nuevos contenidos, ayudados por robots que facilitan el proceso. En general, la creación de contenidos se lleva a cabo por dos grupos diferentes de usuarios: pequeñas contribuciones individuales de una gran legión de usuarios, y un gran número de contribuciones que realizan un reducido grupo de usuarios muy activos. Se ha obtenido el país de origen de muchos usuarios, lo cual ha permitido contabilizar las contribuciones realizadas desde cada uno de ellos.

Item type: Journal article (Paginated)
Keywords: Wikipedia; edición colaborativa; estudios de usuarios; Organización del conocimento
Subjects: B. Information use and sociology of information > BG. Information dissemination and diffusion.
C. Users, literacy and reading. > CB. User studies.
Depositing user: Carlos G. Figuerola
Date deposited: 18 May 2016 21:24
Last modified: 18 May 2016 21:24
URI: http://hdl.handle.net/10760/29276


Adler, B.T.; De Alfaro, L.; Kulshreshtha, A.; Pye, I.; (2011a). Reputation systems for open collaboration. // Communications of the ACM, 54:8, 81-87.

Adler, B.T.; De Alfaro, L.; Mola-Velasco, S.M.; West, A.G. (2011b). Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Features, LNCS: Computational Linguistics Reputation and Intelligent Text Processing, 6609, 277-288.

Rosso, P.; Baeza-Yates, R.; Castillo, C., Lopez, V. (2005): Characteristics of the Web of Spain. Cybermetrics, 9(1).

Baeza-Yates, R.; Castillo, C., y Graells, E. (2006): Características de la Web Chilena. Universidad de Chile.

Berrocal, J. L. A.; Figuerola, C. G.; Zazo, Á. F. (2004). Cibermetría: nuevas técnicas de estudio aplicables al Web. Ed. Trea.

Fernández-Molina, J. C.; Guimarães, J. A. C. (2002). Ethical aspects of knowledge organization and representation in the digital environment: their articulation in professional codes of ethics. A// dvances in Knowledge Organization, 8, 487-492.

Francke, H.; Sundin, O. (2010). An inside view: credibility in Wikipedia from the perspective of editors. // Information Research, Special Supplement: Proceedings of the 7th International Conference on Conceptions of Library and Information Science, London. 15:3.

Gabrilovich, E.; Markovitch, S. (2009). Wikipedia-based semantic interpretation for natural language processing. // Journal of Artificial Intelligence Research. 34, 443-498.

Gandica, Y.; Carvalho, J.; dos Aidos, F. S. (2015). Wikipedia editing dynamics. // Physical Review E. 91:1, 012824.

Geiger, R. S.; Ribes, D. (2010). The work of sustaining order in wikipedia: the banning of a vandal. // Proceedings of the 2010 ACM conference on Computer Supported Cooperative Work. 117-126.

Hu, M.; Lim, E. P.; Sun, A.; Lauw, H. W.; Vuong, B. Q. (2007). Measuring article quality in Wikipedia: models and evaluation. // Proceedings of the sixteenth ACM Conference on Information and Knowledge Management. 243-252.

Kimmons, R. M. (2011). Understanding collaboration in Wikipedia. First Monday. 16:12.

Leskovec, J.; Huttenlocher, D.; Kleinberg, J. (2010) Governance in social media: A case study of the Wikipedia promotion process. // Proceedings of the International Conference on Weblogs and Social Media, ICWSM’10.

Lewandowski, D.; Spree, U. (2011). Ranking of Wikipedia articles in search engines revisited: Fair ranking for reasonable quality? // Journal of the American Society for Information Science and Technology. 62:1, 117-132.

Luyt, B.; Aaron, T. C. H.; Thian, L. H.; Hong, C. K. (2008): Improving Wikipedia’s accuracy: Is edit age a solution? // Journal of the American Society for Information Science and Technology. 59:2, 318–330.

Magnus, P. D. (2009). On trusting Wikipedia. // Episteme: A Journal of Social Epistemology. 6:1, 74-90.

Medelyan, O.; Milne, D.; Legg, C.; Witten, I. H. (2009). Mining meaning from Wikipedia. // International Journal of Human-Computer Studies. 67:9, 716-754.

Mesgari, M.; Okoli, C.; Mehdi, M.; Nielsen, F. Å.; Lanamäki, A. (2015). The sum of all human knowledge: A systematic review of scholarly research on the content of Wikipedia. // Journal of the Association for Information Science and Technology. 66, 219–245.

Müller, C.; Gurevych, I. (2009). Using wikipedia and wiktionary in domain-specific information retrieval. // Evaluating Systems for Multilingual and Multimodal Information Access. Berlin, Heidelberg: Springer. 219-226.

Okoli, C.; Mehdi, M.; Mesgari, M.; Nielsen, F. Å.; Lanamäki, A. (2012). The people’s encyclopedia under the gaze of the sages: A systematic review of scholarly research on Wikipedia. // Social Science Research Network, 2021326.

Pehcevski, J.; Thom, J. A.; Vercoustre; A. M.; Naumovski, V. (2010). Entity ranking in Wikipedia: utilising categories, links and topic difficulty prediction. // Information Retrieval, 13:5, 568-600.

Priedhorsky, R.; Chen, J.; Lam, S. T. K.; Panciera, K.; Terveen, L.; Redl, J. (2007). Creating, destroying, and restoring value in Wikipedia. // Proceedings of the 2007 international ACM Conference on Supporting Group Work (pp. 259-268). ACM.

Sepehri, H.; Makazhanov, A.; Rafiei, D.; Barbosa, D. (2012). Leveraging editor collaboration patterns in Wikipedia. // Proceedings of the 23rd ACM Conference on Hypertext and Social Media, HT ’12. 13-22.

Soliman, M.; Gourdain, P. (2008). La revolución Wikipedia. Madrid: Alianza Editorial, ISBN: 978-84-206-8236-5.

Stuart, D. (2013). Web metrics for library and information professionals. London: Facet Publ.

Wu, G., Harrigan, M., Cunningham, P. (2011) Characterizing Wikipedia pages using edit network motif profiles. // Proceedings of the 3rd International Workshop on Search and Mining User-generated Contents.45-52.

Wu, Q.; Irani, D.; Pu, C.; Ramaswamy, L. (2010). Elusive vandalism detection in Wikipedia: a text stability-based approach. // Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM ’10. 1797-1800.

Yasseri, T.; Kertész, J. (2013). Value production in a collaborative environment. // Journal of Statistical Physics. 151:3-4, 414-439.


Downloads per month over past year

Actions (login required)

View Item View Item