The citation from patents to scientific output revisited: A new approach to Patstat / Scopus matching

Guerrero-Bote, Vicente-P., Sánchez-Jiménez, Rodrigo and De-Moya-Anegón, Félix The citation from patents to scientific output revisited: A new approach to Patstat / Scopus matching. El profesional de la información, 2019, vol. 28, n. 4. [Journal article (Unpaginated)]

[thumbnail of Research article]
Preview
Text (Research article)
280401_Guerrero_Sanchez_De-Moya_ingles.pdf - Published version
Available under License Creative Commons Attribution.

Download (2MB) | Preview

English abstract

Patents include citations, both to other patents and to documents that are not patents (NPL, Non-patent literature). Non-patent literature (NPL) includes articles published in scientific journals. The technological impact of scientific works can be studied through the citations they receive from patents, just like the scientific impact of articles can be analyzed through the citations. The NPL references included in patents are far from being standardized, so determining which scientific article they refer to is not a trivial task. This paper presents a procedure for linking the NPL references of the patents collected in the Patstat database and the scientific works indexed in the Scopus bibliographic database. This procedure consists of two phases: a broad generation of candidate couples and another phase of validation of couples, and it has been implemented with reasonably good results at a low cost.

Spanish abstract

Las patentes incluyen citas, tanto a otras patentes como a documentos que no son patentes (NPL, Non-patent literature). Entre estas últimas se incluyen citas a artículos publicados en revistas científicas. Igual que se estudia el impacto científico a través la citación de artículos y otros trabajos científicos, también se puede estudiar el impacto tecnológico de los trabajos científicos a través de la citación que reciben de las patentes. Las referencias NPL incluidas en las patentes están lejos de estar normalizadas, por lo que determinar a qué artículo científico se refieren no es trivial. En este trabajo se presenta un procedimiento de enlazado de las referencias NPL de las patentes recogidas en la base de datos Patstat y los trabajos científicos indexados en la base de datos bibliográfica Scopus. Dicho procedimiento se compone de dos fases: una generación amplia de parejas candidatas y otra fase de validación de las parejas. Ha sido implementado con resultados razonables y costes asumibles.

Item type: Journal article (Unpaginated)
Keywords: Citation; Quotes; Bibliographic references; Patents; Articles; Scientific production; Pairing; Databases; Patstat; Scopus; Methods; Methodology; Bibliometrics; Informetrics; Statistics; Analysis; Journals; Impact; Mapping; Name game; Citación; Citas; Referencias bibliográficas; Patentes; Artículos; Producción científica; Emparejamiento; Bases de datos; Patstat; Scopus; Métodos; Metodología; Bibliometría; Informetría; Estadísticas; Análisis; Revistas; Impacto; Mapeado; Name game.
Subjects: B. Information use and sociology of information > BB. Bibliometric methods
H. Information sources, supports, channels. > HB. Gray literature.
H. Information sources, supports, channels. > HN. e-journals.
Depositing user: Tomàs Baiget
Date deposited: 27 Feb 2026 15:28
Last modified: 27 Feb 2026 15:28
URI: http://hdl.handle.net/10760/47713

References

Archambault, Éric; Campbell, David; Gingras, Yves; Larivière, Vincent (2009). "Comparing bibliometric statistics obtained from the Web of Science and Scopus". Journal of the American Society for Information Science and Technology (Jasist), v. 60, n. 7, pp. 1320-1326.

https://doi.org/10.1002/asi.21062

Coffano, Monica; Tarasconi, Gianluca (2014), Crios - Patstat database: Sources, contents and access rules. Center for Research on Innovation, Organization and Strategy, Crios Working Paper n. 1.

https://ssrn.com/abstract=2404344 https://doi.org/10.2139/ssrn.2404344

De-Moya-Anegón, Félix; Chinchilla-Rodríguez, Zaida; Vargas-Quesada, Benjamín; Corera-Álvarez, Elena; Muñoz-Fernández, Francisco-José; González-Molina, Antonio; Herrero-Solana, Víctor (2007). "Coverage analysis of Scopus: A journal metric approach". Scientometrics, v. 73, n. 1, pp. 53-78.

https://doi.org/10.1007/s11192-007-1681-4

De-Moya-Anegón, Félix; Guerrero-Bote, Vicente P.; López-Illescas, Carmen; Moed, Henk F. (2018). "Statistical relationships between corresponding authorship, international co-authorship and citation impact of national research systems". Journal of informetrics, v. 12, n. 4, pp. 1251-1262.

https://doi.org/10.1016/j.joi.2018.10.004

De-Rassenfosse, Gaétan; Dernis, Hélène; Boedt, Geert (2014). "An introduction to the Patstat database with example queries". Australian economic review, v. 47, n. 3, pp. 395-408.

https://doi.org/10.1111/1467-8462.12073

Derwent (2000). World Patents Index - Derwent patentee codes, Revised edition 8. Thomson Corporation. Leuven Manual. ISBN: 0 901157 38 4

http://ips.clarivate.com/m/pdfs/mgr/patenteecodes.pdf

Etzkowitz, Henry; Leydesdorff, Loet (2000). "The dynamics of innovation: from National Systems and ‘Mode 2’ to a Triple Helix of university–industry–government relations". Research policy, v. 29, n. 2, pp. 109-123.

https://doi.org/10.1016/S0048-7333(99)00055-4

European Patent Office (2018). Data catalog Patstat global. Versión 5.11. EPO Patstat customers.

https://www.epo.org

Gorraiz, Juan; Gumpenberger, Christian; Wieland, Martin (2011). "Galton 2011 revisited: a bibliometric journey in the footprints of a universal genius". Scientometrics, v. 88, n. 2, pp. 627-652.

https://doi.org/10.1007/s11192-011-0393-y

Guerrero‐Bote, Vicente P.; De-Moya‐Anegón, Félix (2015). "Analysis of scientific production in food science from 2003 to 2013". Journal of food science, v. 80, n. 12, R2619-R2626.

https://doi.org/10.1111/1750-3841.13108

Hane, Paula J. (2004). "Elsevier announces Scopus service". Information today. http://newsbreaks.infotoday.com/nbreader.asp?ArticleID=16494

Jacsó, Péter (2011). "The h-index, h-core citation rate and the bibliometric profile of the Scopus database". Online information review, v. 35, n. 3, pp. 492-501.

https://doi.org/10.1108/14684521111151487

Jefferson, Osmat A.; Jaffe, Adam; Ashton, Doug; Warren, Ben; Koellhofer, Deniz; Dulleck, Uwe; Bilder, G.; Ballagh, Aaron; Moe, John; DiCuccio, Michael; Ward, Karl; Bilder, Geoff; Dolby, Kevin; Jefferson, Richard A. (2018). "Mapping the global influence of published research on industry and innovation". Nature biotechnology, v. 36, n. 1, pp. 31-39.

https://doi.org/10.1038/nbt0818-772a

Kang, Byeongwoo; Tarasconi, Gianluca (2016). "Patstat revisited: Suggestions for better usage". World patent information, v. 46, pp. 56-63.

https://doi.org/10.1016/j.wpi.2016.06.001

Leydesdorff, Loet; De-Moya Anegón, Félix; Guerrero-Bote, Vicente P. (2010). "Journal maps on the basis of Scopus data: A comparison with the Journal Citation Reports of the ISI". Journal of the American Society for Information Science and Technology, v. 61, n. 2, pp. 352-369.

https://doi.org/10.1002/asi.21250

Lissoni, Francesco (2012). "Academic patenting in Europe: an overview of recent research and new perspectives". World patent information, v. 34, n. 3, pp. 197-205.

https://doi.org/10.1016/j.wpi.2012.03.002

Lotti, Francesca; Marin, Giovanni (2013). "Matching of Patstat applications to AIDA firms: Discussion of the methodology and results". Bank of Italy occasional paper, n. 166.

https://ssrn.com/abstract=2283111 https://doi.org/10.2139/ssrn.2283111

Magerman, Tom; Van-Looy, Bart; Song, Xiaoyan (2006). Data production methods for harmonized patent statistics: Patentee name standardization. Technical report, K.U. Leuven.

https://ec.europa.eu/eurostat/documents/3888793/5836029/KS-AV-06-002-EN.PDF

Maraut, Stéphane; Martínez, Catalina (2014). "Identifying author-inventors from Spain: methods and a first insight into results". Scientometrics, v. 101, n. 1, pp. 445-476.

https://doi.org/10.1007/s11192-014-1409-1

Pickering, Bobby (2004). "Elsevier prepares Scopus to rival ISI Web of science". Information world review, n. 8.

Raffo, Julio D.; Lhuillery, Stéphane (2009). "How to play the 'Names game': Patent retrieval comparing different heuristics". Research policy, v. 38, n. 10, pp. 1617-1627.

https://doi.org/10.2139/ssrn.1441172

Schoen, Anja; Heinisch, Dominik; Buenstorf, Guido (2014). "Playing the ‘Name game’ to identify academic patents in Germany". Scientometrics, v. 101, n. 1, pp. 527-545.

https://doi.org/10.1007/s11192-014-1400-x

Thoma, Grid; Torrisi, Salvatore (2007). Creating powerful indicators for innovation studies with approximate matching algorithms. A test based on Patstat and Amadeus databases (No. 211). KITeS, Centre for Knowledge, Internationalization and Technology Studies, Università Bocconi, Milano, Italy.

http://citeseerx.ist.psu.edu/viewdoc/download?

doi=10.1.1.573.8107&rep=rep1&type=pdf


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item