Classification of keywords selected from research articles on physics and development of a quantitative subject access tool

Dutta, Bidyarthi and Majumder, Krishnapada and Sen, B. K. Classification of keywords selected from research articles on physics and development of a quantitative subject access tool., 2013 . In IFLA World Library & Information Congress (WLIC) 2013 Singapore, 17/08/2013-22/08/2013. [Conference paper]


Download (113kB) | Preview
112-dutta-en.pdf - Published version

Download (113kB) | Preview

English abstract

All research articles begin with a title. Most include an abstract. Several include keywords. All three of these features describe an article’s content in details. The title sends an instant reflection of the central theme of the research topic. The abstract summarizes the content. The keywords indicate the core and allied fields of concern. The researchers and indexers quickly and easily locate particular articles within their areas of interest with the aid of keywords. Keywords hold prime importance in abstracting and indexing services. Keywords play major role in information retrieval function. This paper is based on analysis of 14,221 keywords collected from 2,526 research articles published in three journals, viz. Chaos, Physics of plasmas and Low temperature physics since 2006 to 2012. Out of all these author-assigned keywords, the number of distinct bits obtained was 2571. After collection, the lexically close keywords are identified that form clusters. Several such clusters are found and the composition of keywords in nearly all clusters varies over the said time span. Four indicators have been defined on the basis of fluctuating keyword composition within clusters. The name given to these four indicators are stability index, integrated visibility index, momentary visibility index and potency index respectively. These indicators hold different values for different clusters. The value ranges of them are categorized in five groups, viz. very high, high, medium, low and very low. A new quantitative subject access tool has been proposed on the basis of these indicators, which can predict the probable new and obsolete keywords in any subject domain. The name given to this new tool is keysaurus, i.e. keyword-based-thesaurus.

English abstract

Item type: Conference paper
Keywords: Keyword cluster analysis; information retrieval; information retrieval thesaurus; keysaurus, knowledge classification; subject access tool; keywords of physics; knowledge indicators
Subjects: A. Theoretical and general aspects of libraries and information. > AA. Library and information science as a field.
A. Theoretical and general aspects of libraries and information. > AB. Information theory and library theory.
I. Information treatment for information services > IB. Content analysis (A and I, class.)
I. Information treatment for information services > ID. Knowledge representation.
Depositing user: Bidyarthi Dutta
Date deposited: 31 Jul 2013 05:35
Last modified: 02 Oct 2014 12:27


"SEEK" links will first look for possible matches inside E-LIS and query Google Scholar if no results are found.


1) Bertrand A, Cellier J M, Psychological approach to indexing: effects of the operator’s expertise upon indexing behaviour, Journal of Information Science, 21 (6) (1995) 459-472.

2) Suraud M G et al, On the significance of databases keywords for a large-scale bibliometric investigation in fundamental physics, Scientometrics, 33 (1) (1995) 41-63.

3) Voorbij H J, Title Keywords and Subject Descriptors: A Comparison of Subject Search Entries of Books in the Humanities and Social Sciences, Journal of Documentation, 54 (1998) 466-476.

4) Solomon P, Use-based methods for classification development. Proceedings of the 2nd ASIS SIG/CR Classification Research Workshop. Washington DC (1991).

5) Hurt C D, Classification and subject analysis: looking to the future at a distance, Cataloguing and Classification Quarterly, 24 (1-2) (1997) 97-112.

6) Soergel D et al, Re-engineering thesauri for new applications: the AGROVOC example, Journal of Digital Information, 4 (4) (2004).

7) Bates M J, Wilde D N and Siegfried S, An analysis of search terminology used by humanities scholars: the Getty Online Searching Project Report, No.1, Library Quarterly, 63 (1) (1993) 1-39.

8) Willett P, Recent trends in hierarchical document clustering: A critical review, Information Processing & Management, 24 (5) (1988) 577-597.

9) Sparck Jones, K. (1970). Some thoughts on classification for retrieval. Journal of Documentation 26 (2): 89-101.


Downloads per month over past year

Actions (login required)

View Item View Item