Proximidad documental en repositorios académicos : Exploración intelectual de colecciones mediante análisis léxico

Moreno, Chiris Proximidad documental en repositorios académicos : Exploración intelectual de colecciones mediante análisis léxico. Información, cultura y sociedad, 2026, n. 54, pp. 31-45. [Journal article (Paginated)]

[thumbnail of n54a03Moreno.pdf]
Preview
Text
n54a03Moreno.pdf - Published version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (719kB) | Preview

English abstract

A lexicometric analysis was conducted on 121 doctoral thesis abstracts in Psychology retrieved from repositories of Argentine public universities. The aim was to examine whether vocabulary distribution can relate documents without relying on thematic descriptors or citation links. The texts were organized into a term-document matrix. The association between documents and vocabulary was significant and of moderate-to-high magnitude (χ² = 530,239.11; df = 353,760; Cramér’s V = 0.435). Correspondence Analysis enabled the construction of a geometric space in which documents were located according to χ² distance. This made it possible to identify both close and distant works based on lexical distribution profiles. Nearby documents tended to share terminological repertoires associated with research problems, theoretical approaches or methodological procedures. These findings address an operational difficulty of academic repositories: how to classify, group and explore collections beyond individual record retrieval. The collection thus functions not only as an archive, but also as an internal network of relations that may support semantic navigation and the identification of prior work.

Spanish abstract

Se presenta un análisis lexicométrico de 121 resúmenes de tesis doctorales en psicología obtenidos de repositorios de universidades nacionales de Argentina. El objetivo fue observar si la distribución del vocabulario permite relacionar documentos sin usar descriptores temáticos ni citaciones. Los documentos se organizaron en una matriz término-documento. La asociación entre documentos y vocabulario resultó significativa y de intensidad moderada-alta (χ² = 530.239,11; gl = 353.760; V de Cramér = 0,435). El Análisis Factorial de Correspondencias permitió construir un espacio geométrico para localizar documentos próximos y distantes mediante distancia chi-cuadrado. Así se identificaron trabajos relacionados a partir de perfiles de distribución léxica. Los documentos cercanos tienden a compartir repertorios terminológicos asociados a problemas, enfoques teóricos o procedimientos metodológicos. Este hallazgo aborda una dificultad operativa de los repositorios académicos: cómo clasificar, agrupar y recorrer colecciones más allá del acceso individual a los registros. La colección funciona, además de archivo, como una red interna de relaciones que puede orientar la navegación semántica y la identificación de antecedentes.

Item type: Journal article (Paginated)
Keywords: Institutional repositories, Knowledge organization, Lexicometrics, Semantic navigation, Correspondence Factor Analysis, Organización del conocimiento, Repositorios institucionales, Análisis Factorial de Correspondencias, Lexicometría, Navegación semántica
Subjects: H. Information sources, supports, channels. > HS. Repositories.
Depositing user: Graciela Giunti
Date deposited: 01 Jul 2026 21:33
Last modified: 01 Jul 2026 21:33
URI: http://hdl.handle.net/10760/47843

References

Ahlgren, Per y Cristian Colliander. 2009. Document-document similarity approaches and science mapping: Experimental comparison of five approaches. En Journal of Informetrics. Vol. 3, no. 1, 49–63. <https://doi.org/10.1016/j.joi.2008.11.003>

Argentina. 2013. Ley 26.899. Repositorios digitales institucionales de acceso abierto, propios o compartidos. En Boletín Oficial de la República Argentina, 13 de noviembre de 2013. <https://www.argentina.gob.ar/normativa/nacional/ley-26899-222648> [Consulta: 10 mayo 2025].

Baeza-Yates, Ricardo y Berthier Ribeiro-Neto. 1999. Modern information retrieval. Reading: Addison-Wesley.

Bates, Marcia J. 1989. The design of browsing and berrypicking techniques for the online search interface. En Online Review. Vol. 13, no. 5, 407–424. <https://doi.org/10.1108/eb024320>

Benzécri, Jean-Paul. 1973. L’analyse des données. Tome 2: L’analyse des correspondances. Paris: Dunod.

Börner, Katy, Chaomei Chen y Kevin W. Boyack. 2003. Visualizing knowledge domains. En Annual Review of Information Science and Technology. Vol. 37, no. 1, 179–255. <https://doi.org/10.1002/aris.1440370106>

Callon, Michel, Jean-Pierre Courtial y Françoise Laville. 1991. Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemistry. En Scientometrics. Vol. 22, no. 1, 155–205. <https://doi.org/10.1007/BF02019280>

Greenacre, Michael. 1984. Theory and applications of correspondence analysis. London: Academic Press.

Greenacre, Michael. 2017. Correspondence analysis in practice. 3rd ed. Boca Raton: Chapman & Hall/CRC <https://doi.org/10.1201/9781315369984>

Hjørland, Birger. 2016. Knowledge organization (KO). En Knowledge Organization. Vol. 43, no. 6, 475–484. <https://doi.org/10.5771/0943-7444-2016-6-475>

Husson, François, Sébastien Lê y Jérôme Pagès. 2017. Exploratory multivariate analysis by example using R. 2nd ed. Boca Raton: Chapman & Hall/CRC. <https://doi.org/10.1201/b21874>

Hyland, Ken. 2000. Disciplinary discourses: Social interactions in academic writing. London: Longman.

Lebart, Ludovic y André Salem. 1994. Statistique textuelle. Paris: Dunod.

Lebart, Ludovic, André Salem y Lisette Berry. 1998. Exploring textual data. Dordrecht: Kluwer Academic Publishers.

Leydesdorff, Loet. 2001. The challenge of scientometrics: The development, measurement, and self-organization of scientific communications. Boca Raton: Universal Publishers.

Manning, Christopher D., Prabhakar Raghavan y Hinrich Schütze. 2008. Introduction to information retrieval. Cambridge: Cambridge University Press. <https://doi.org/10.1017/CBO9780511809071>

Marchionini, Gary. 1995. Information seeking in electronic environments. Cambridge: Cambridge University Press. <https://doi.org/10.1017/CBO9780511626388>

Marchionini, Gary. 2006. Exploratory search: From finding to understanding. En Communications of the ACM. Vol. 49, no. 4, 41–46. <https://doi.org/10.1145/1121949.1121979>

Morin, Annie. 2006. Intensive use of Factorial Correspondence Analysis for text mining: application with statistical education publications. En Proceedings of the Seventh International Conference on Teaching Statistics (ICOTS-7). Estados Unidos: International Association for Statistical Education.

Murtagh, Fionn. 2005. Correspondence analysis and data coding with Java and R. Boca Raton: Chapman & Hall/CRC. <https://doi.org/10.1201/9781420034943>

Noyons, Ed C. M. 2012. Using bibliometric maps of science in a science policy context. En Em Questão. Vol. 18, edición especial, 15–27.

Petrović, Saša, Bojana Dalbelo Bašić, Annie Morin, Blaž Zupan y Jean-Hugues Chauchat. 2009. Textual features for corpus visualization using correspondence analysis. En Intelligent Data Analysis. Vol. 13, no. 5, 795–813. <https://doi.org/10.3233/IDA-2009-0393>

Price, Derek J. de Solla. 1965. Networks of scientific papers. En Science. Vol. 149, no. 3683, 510–515. <https://doi.org/10.1126/science.149.3683.510>

Salton, Gerard y Michael J. McGill. 1983. Introduction to modern information retrieval. New York: McGraw-Hill.

Small, Henry. 1973. Co-citation in the scientific literature: A new measure of the relationship between two documents. En Journal of the American Society for Information Science. Vol. 24, no. 4, 265–269. <https://doi.org/10.1002/asi.4630240406>

Swales, John M. 1990. Genre analysis: English in academic and research settings. Cambridge: Cambridge University Press.

Van Raan, Anthony F. J. 2005. For your citations only? Hot topics in bibliometric analysis. En Measurement: Interdisciplinary Research and Perspectives. Vol. 3, no. 1, 50–62. <https://doi.org/10.1207/s15366359mea0301_7>


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item