Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies

Jimenez-Contreras, Evaristo, Ruiz-Pérez, Rafael and Delgado-Lopez-Cozar, Emilio Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies. Journal Medical Library Association, 2002, vol. 90, n. 4. [Journal article (Unpaginated)]

[thumbnail of Spanish_personal_name_variations_in_national_and_international_biomedical_databases_implications_for_information_retrieval_and_bibliometric_studies.pdf]
Preview
PDF
Spanish_personal_name_variations_in_national_and_international_biomedical_databases_implications_for_information_retrieval_and_bibliometric_studies.pdf

Download (290kB) | Preview

English abstract

Objectives: The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. Methods: The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and I´ndice Me´dico Español (IME). The number and types of variants that appeared for each author’s name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author’s name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. Results: The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as ‘‘first surname second surname, first name initial middle name initial’’ (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was ‘‘first surname, first name initial second name initial’’ (48.0% of the records) and first surname and second surname run together, first name initial (18.3%). Conclusions: Retrievability on the basis of author’s name was poor in all three databases. Each database uses accurate indexing methods, but these methods fail to result in consistency or coherence for specific entries. The likely causes of inconsistency are: (1) use by authors of variants of their names during their publication careers, (2) lack of authority control in all three databases, (3) the use of an inappropriate indexing method for Spanish names in SCI, (4) authors’ inconsistent behaviors, and (5) possible editorial interventions by some journals. We offer some suggestions as to how to avert the proliferation of author name variants in the databases.

Item type: Journal article (Unpaginated)
Keywords: Information retrieval
Subjects: B. Information use and sociology of information > BB. Bibliometric methods
Depositing user: Daniel Torres-Salinas
Date deposited: 30 Mar 2009
Last modified: 02 Oct 2014 12:13
URI: http://hdl.handle.net/10760/12869

References

WILLIAMS ME, LANNOM L. Lack of standardization of the journal title data element in data bases. J Am Soc Inf Sci 1981 May;32(3):229–33.

HAWKINS DT. Unconventional uses of on-line information retrieval systems: on-line bibliometric studies. J Am Soc Inf Sci 1977 Jan;28(1):13–8.

SMITH LC. Citation analysis. Libr Trends 1981 Summer; 30(1):83–106.

GALBÁ N C, VÁ ZQUEZ M. Las bases de datos como fuentes de información para estudios bibliométricos. Bol Anabad 1988;38(1–2):369–81.

MACROBERTS MH, MACROBERTS B. Problems of citation analysis: a critical review. J Am Soc Inf Sci 1989 Sep;40(5): 342–9.

MOED HF, VIRIENS M. Possible inaccuracies occurring in citation analysis. J Inf Sci 1989;15(2):95–117.

RICE RE, BORGMAN CL, BEDNARSKI D, HART PJ. Journalto-journal citation data: issues of validity and reliability. Scientometrics 1989 Mar;15(3–4):257–82.

LARDY JP, HERZHAFT L. Bibliometric treatments according to bibliographic errors and data heterogeneity: the end-user point of view. In: Online Information 92, Proceedings of the 16th International Online Information Meeting; London, U.K.; 8–10 Dec 1992. Oxford, NJ: Learned Information, 1992: 547–56.

SANCHO R. Indicadores bibliométricos utilizados en la evaluacio ´n de la ciencia. Rev Esp Doc Cient 1990;13(3–4):842–65.

LÓ PEZ PIN˜ ERO JM, TERRADA ML. Los indicadores bibliome ´tricos y la evaluación de la actividad médico-científica: (III) los indicadores de producción, circulación y dispersión, consumo de la información y repercusión. Med Clı´n (Barc) 1992 Feb 1;98(4):142–8.

JEANNIN PH. L’evaluation quantitative de la recherche en sciences sociales et humaines. In: Revue de sciences sociales et humaines. Actes du séminaire ‘‘La communication et l’information scientifiques entre spécialistes’’ (1991–1992). Toulouse: IUT, Université de Toulouse III, 1992:42.

PULIDO M, GONZÁ LEZ JC, SANZ F. Errores en las referencias bibliográficas: un estudio retrospectivo en Medicina Clínica (1962–1992). Med Clı´n (Barc) 1995 Feb 11;104(5):170–4.

VÁ ZQUEZ M, GALBÁ N C. Lack of standardisation in the corporate source field of different databases. In: Proceedings of the 10th International Online Information Meeting; London, U.K.; 2–4 Dec 1986:335–52.

RICE, op. cit.

HUDNUT SK. Should journal references be standardized? In: Proceedings of the 12th National Online Meeting 1991. Medford NJ: Learned Information, 1991:149–55.

RITTBERGER M, RITTBERGER W. Measuring quality in the production of databases. J Inf Sci 1997;23(1):25–37.

SPINAK E. Errores ortográficos en el ingreso en bases de datos. Rev Esp Doc Cient 1995;18(3):307–19.

BELL J, SPEER S. Bibliographic verification for interlibrary loan: is it necessary? Coll Res Libr 1988 Nov;49(6):494–500.

FULLER EE. Variation in personal names in works represented in the catalog. Cat Class Quart 1989;9(3):75–95.

WEINTRAUB TS. Personal name variations: implications for authority control in computerized catalogs. Libr Resour Tech Serv 1991 Apr;35(2):217–28.

JONES EA. Consistency in choice and form of main entry: a comparison of Library Congress and British Library monograph cataloging. Libr Resour Tech Serv 1992 Apr;36(2):209–23.

PITERNICK AB. What’s in a name? use of names and titles in subject searching. Database 1985 Dec;8(4):22–8.

PITERNICK AB. Name of an author! Indexer 1992 Oct; 18(2):95–100.

KOTIAHO JS, TOMKINS JL, SIMMONS LW. Unfamiliar citations breed mistakes. Nature 1999 Jul 22;400(6742):307.

CORROCHANO LM. Spanish practice. Nature 1996 Nov 14;384(6605):106.

SELLICK JTC. Multiple author. Nature 1996 Oct 17; 383(6601):569.

PILACHOWSKI DM, EVERETT D. What’s in a name? looking for people online-social sciences. Database 1985;8(3):47–65.

SNOW B. Caduceus: people in medicine names online. Online 1986 Sep;10(5):122–7.

MENEGHINI R. Systematization of academic and scientific affiliation, or how to prevent data on your publications from being lost in the national and international data base. Braz J Med Biol Res 1995 Jun;28(6):617–9.

D’AURIA D. Six characters in search of an author [editorial]. Occup Med (Oxford) 1997 May;47(4):195.

SHORE ML. Variation between personal name headings and title page usage. Cat Class Quart 1984 Summer;4(4):1– 11.

SWEETLAND JH. Errors in bibliographic citations: a continuing problem. The Libr Quart 1989 Oct;59(4):291–304.

BORGMAN CL, SIEGFRIED SL. Getty’s synoname and its cousins: a survey of applications of personal name-matching algorithms. J Am Soc Infor Sc 1992 Aug;43(7):459–76.

AACR2 1998. Anglo-American cataloguing rules. 2d ed., 1998 revision. Ottawa, ON, Canada: Canadian Library Association; London, U.K.: Library Association Publishing; Chicago,IL: American Library Association.

RCE 1995. Reglas de Catalogación Espan˜olas. Madrid, Spain: Dirección General del Libro, Archivos y Bibliotecas, 1995:431–454.

INTERNATIONAL FEDERATION OF LIBRARY ASSOCIATIONS AND INSTITUTIONS. Names of persons: national usages for entries in catalogues. London, U.K.: IFLA International Office for UBC, 1977:39–41.

GÓ MEZ I, CCMA L, MORILLO F, CAMı´ J. Medicina Clínica (1992–1993) vista a través del Science Citation Index. Med Clı´n (Barc) 1997 Oct 18;109(13):497–505.

LÓ PEZ-CÓ ZAR E. Incidencia de la normalización de las revistas científicas en la transferencia y evaluación de la informacio ´n científica. Rev Neurol 1997 Dec;25(148):1942–6.

SALVADÓ PÉ REZ L, MOLINA TROYA J. ¿MEDLINE e Índice Médico Espan˜ol son mutuamente excluyentes? Med Clı´n (Barc) 1997 Jan 18;108(2):79.

BROWN CM. Complementary use of the SciSearch database for improved biomedical information searching. Bull Med Libr Assoc 1998 Jan;86(1):63–7.

TERRADA ML, CUEVA A, MOTA A, OSCA MJ, ALEIXANDRE R, CEBRIAN M, GIMENO E, ALMERO A, CUSSAC N. La base de datos IME y el repertorio Índice Médico Espan˜ol (1965–1992). In: Congreso y Conferencia FID XLVI 199; Madrid, Spain, 1992;5:210–6.

CUEVA A, TERRADA ML. La documentación médica espan ˜ola. el Índice Médico Espan˜ol y el estudio de la actividad científica. Cuad Salud 1991;13:121–6.

PULIDO M. Index Medicus: cobertura y manejo. Med Clín (Barc) 1987 Mar 28;88(12):500–4.

JORDA M. Documentación biomédica: estructura y funcionamiento de las bases de datos bibliográficas. Med Clín (Barc) 1991 Sep 7;97(7):265–71.

PESTAN˜ A A. El MEDLINE como fuente de información bibliométrica de la producción Espan˜ola en biomedicina y ciencias médicas. comparación con el Science Citation Index. Med Clı´n (Barc) 1997;109(13):506–11.

RCE 1995, op. cit., 431–54.

INTERNATIONAL FEDERATION OF LIBRARY ASSOCIATIONS AND INSTITUTION, op. cit., 39–41.

O’NEILL ET, VIZINE-GOETZ D. Quality control in online database. Annu Rev Inform Sci Tech 1988;23:125–47.

ODDY P. Authority control in the local, national, and international environment. In: Standards for the international exchange of bibliographic information. London, U.K.: Library Association Publishing, 1991:66–72.

JACSO P. Content evaluation of databases. Annu Rev Inform Sci Tech 1997;32(Chap 5):231–67.

NLM CATALOGING SECTION. Cataloging system. [Web document]. [rev. 16 Jan 2001; cited 18 Jun 2002]. ,http://www.nlm.nih.gov/tsd/cataloging/topics.html#CatSys..

PROGRAM FOR COOPERATIVE CATALOGING. NACO. [Web document]. [rev. 16 Jan 2001; cited 18 Jun 2002]. ,http://www.loc.gov/catdir/pcc/naco.html..

MARC 21 CONCISE BIBLIOGRAPHIC. Main entry personal name. [Web document]. [rev. 16 Jan 2001; cited 18 Jun 2002].http://lcweb.loc.gov/marc/bibliographic/ecbdmain.html#mrcb100

WILLIAMS RM (ISI Europe, England). Indexing rules. Email (rwilliams@smtpgwy.isinet.com) sent to: rruiz@ugr.es. 19 May 1998.

SILVA GA. Nombres de pila completos: las iniciales no bastan. Med Clin (Barc.) 1992 Oct 10;99(11):435.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item