Álvarez-Llorente, Jesús M., Guerrero-Bote, Vicente P. and De-Moya-Anegón, Félix Algorithms for Scientific Documents: Past and Present. Infonomy, 2025, vol. 3, n. 4. [Journal article (Unpaginated)]
Preview |
Text (Research article)
EN_Alvarez-Guerrero-De-Moya-Algorithms-for-Scientific-Documents.pdf - Published version Available under License Creative Commons Attribution. Download (1MB) | Preview |
English abstract
This study offers a comprehensive overview of document-level classification algorithms in scientific research, proposed as an alternative to the journal-based categorizations employed by major bibliographic databases such as Web of Science and Scopus. These journal-driven schemes often introduce significant inaccuracies in both information retrieval and research evaluation, as they fail to categorize articles in accordance with their actual content. First, we provide a historical review of the main approaches developed since the emergence of scientific databases, highlighting their contributions as well as their limitations. Automatic clustering techniques and community detection algorithms have represented important advances in the organization of scientific knowledge, yet they cannot serve as a practical substitute for journal-based classifications. Other approaches, such as those relying on neural networks or text mining, face scalability issues that prevent their application at the global level of science. The most recent and promising strategies are built upon simple algorithms that, starting from existing journal categorizations, reclassify articles into the same thematic hierarchies used by bibliographic databases, relying primarily on the analysis of straightforward citation and reference patterns.
| Item type: | Journal article (Unpaginated) |
|---|---|
| Keywords: | Classification algorithms; Document-level classifications; Classifications; Science classification; Scientific databases; Scientometrics; Citation; Classification schemes; ASJC; Scopus; Web of Science. |
| Subjects: | H. Information sources, supports, channels. > HN. e-journals. H. Information sources, supports, channels. > HP. e-resources. I. Information treatment for information services > IC. Index languages, processes and schemes. |
| Depositing user: | Tomàs Baiget |
| Date deposited: | 14 Sep 2025 16:32 |
| Last modified: | 14 Sep 2025 16:32 |
| URI: | http://hdl.handle.net/10760/47134 |
References
Downloads
Downloads per month over past year
Actions (login required)
![]() |
View Item |
