Kanič, Ivan Nekaj pogledov na strokovni jezik v bibliotekarskih objavah: Korpusna analiza strokovnih in znanstvenih člankov, objavljenih v reviji Knjižnica v letihod 1997 do 2016. Knjižnica, 2017, vol. 61, n. 3, pp. 9-58. [Journal article (Paginated)]
Preview |
Text
6606-16531-1-SM.pdf - Published version Download (2MB) | Preview |
English abstract
Purpose: Slovenian library and information science is involved in most advanced international trends and development in the field, consequently rather dynamic changes are imposed on Slovenian LIS terminology. Knjižnica is the top-ranking Slovenian LIS journal so it proves to be the best source for terminology studies and tracking recent advances. Construction of a specialised text corpus which will include all professional and scientific articles, published in Knjižnica in the period between 1997 and 2016, will foster quantitative and semantic analyses and support studies related to the usage of individual terms and collocations, including comparative examination of the national reference corpus. Methodology/approach: In compliance with the preset criteria 553 journal articles were selected and downloaded from the portal where they are published by Knjižnica in public access, they were subject to technical pre-processing and uploaded to the webbased Sketch Engine corpus tool. A tagged corpus comprising 2.4 million words was compiled and segmented into two subcorpora following a temporal scheme of publication. Comparative analyses involved other corpora as well, mostly the national reference corpus FidaPLUS comprising 621 million words. Results: The study was focused on quantitative analyses regarding the frequency of words and semantic relations in concordances, collocations and word sketches in the frame of the special corpus and the reference corpus comparatively. The hypothesis has been confirmed proving that journal articles published in Knjižnica provide a rich selection of traditional as well as most recent LIS related terms, but occuring in rather different collocations than it is the case with the same words in the reference corpus of the general language. A short analysis dealt with author keywords and their frequency in five-year periods of publication to show the subject focus point shift. Research limitation: The study was restricted by the moderate size of the corpus and the number of texts thus limiting the diversity and possibilities to extract less known and low frequency terms. The construction of a larger corpus containing all articles published by Knjižnica would be a feasible practical solution. Practical implications: The recently constructed specialised corpus, practical results achieved and know-how resulting from the study are a solid start point and support for further terminology studies providing useful sources for the updating of the dictionaries, automatic term extraction routine being one of them.
Slovenian abstract
Namen in zasnova: Slovensko bibliotekarstvo je stroka, ki uspešno sledi najnaprednejšim razvojnim tokovom v svetu, to pa se dinamično odraža tudi v njeni terminologiji. Revija Knjižnica je osrednja slovenska znanstvena revija za področje bibliotekarstva, informacijske znanosti in knjigarstva, zato je najprimernejši vir za proučevanje sodobnega strokovnega izrazja. Namen raziskave je vzpostavitev korpusa strokovnih in znanstvenih člankov, ki so bili objavljeni v reviji Knjižnica v obdobju 1997–2016, kvantitativna in pomenska analiza dobljenega gradiva ter ugotavljanje rabe nekaterih izbranih terminov tudi primerjalno z nacionalnim referenčnim korpusom. Metodologija/pristop: Po vnaprej določenih kriterijih je bilo izbranih 553 člankov, ki so prosto dostopni na portalu revije Knjižnica. Po tehnični pripravi in obdelavi besedil je bil s spletnim orodjem Sketch Engine vzpostavljen označeni korpus, ki obsega 2,4 milijona besed. Zaradi časovnih primerjav je razdeljen v dva podkorpusa, primerjalne analize pa so potekale tudi z drugimi korpusi, predvsem z nacionalnim referenčnim korpusom FidaPLUS, ki vsebuje 621 milijonov besed. Analiza/rezultati: Raziskava je obsegala kvantitativne analize pogostnosti besed ter pomenske analize konkordanc, kolokacij in besednih skic v korpusu člankov in tudi primerjalno z referenčnim korpusom. Potrjena je bila hipoteza, da vsebujejo članki v reviji Knjižnica bogat nabor tradicionalnih, pa tudi najnovejših terminov, katerih raba se v strokovnem jeziku po kolokacijah zelo razlikuje od rabe v splošnem jeziku. Kratka analiza je zajela tudi avtorske ključne besede z enostavno primerjavo po štirih petletnih kvartalih. Omejitve raziskave: Količina zajetih besedil omejuje pestrost nabora predvsem redkeje uporabljanih terminov, kar bi lahko brez velikih zadržkov rešila vzpostavitev mnogo večjega korpusa večine v reviji Knjižnica objavljenih člankov. Uporabnost raziskave: Specializirani korpus, rezultati prvih analiz in v raziskavi pridobljena znanja so dobro izhodišče za podporo nadaljnjemu terminološkemu delu in dopolnjevanju slovarjev, veliko obeta tudi avtomatska ekstrakcija enobesednih in večbesednih terminov.
Item type: | Journal article (Paginated) |
---|---|
Keywords: | library science, LIS, terminology, corpus linguistics, text corpus, journal Library |
Subjects: | A. Theoretical and general aspects of libraries and information. > AA. Library and information science as a field. C. Users, literacy and reading. > CZ. None of these, but in this section. L. Information technology and library technology > LL. Automated language processing. |
Depositing user: | Mr Ivan Kanič |
Date deposited: | 26 Jul 2018 07:30 |
Last modified: | 26 Jul 2018 07:30 |
URI: | http://hdl.handle.net/10760/32230 |
References
Downloads
Downloads per month over past year
Actions (login required)
View Item |