Üstverinin Tam-Metin Bilgi Erişim Performansı Üzerindeki Etkisi: Küçük Ölçekli Türkçe Külliyat Üzerinde Deneysel Bir Araştırma / Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

Çapkın, Çağdaş Üstverinin Tam-Metin Bilgi Erişim Performansı Üzerindeki Etkisi: Küçük Ölçekli Türkçe Külliyat Üzerinde Deneysel Bir Araştırma / Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus. Türk Kütüphaceciliği, 2016, vol. 30, n. 4, pp. 678-701. [Journal article (Paginated)]

[thumbnail of impact_of_metada_on_fulltext_information_retrieval_performance.pdf]

Preview

Text
impact_of_metada_on_fulltext_information_retrieval_performance.pdf
Download (1MB) | Preview

Alternative locations: http://tk.org.tr/index.php/TK/article/view/2731/2691

English abstract

Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR), full-text (FIR) and hybrid (HIR) content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there was no significant differences between MIR and FIR in mean average precision (MAP) performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there was no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR) and information retrieval performance improved. [There is an extended English summary at the end of the article.]

Turkish abstract

Bilgi kurumları üstveri, tam-metin veya hem üstveri hem de tam-metin (melez) içerikleri depolamak, dizinlemek ve eriştirmek için metin tabanlı bilgi erişim sistemleri kullanmaktadır. Araştırmanın amacı, bu içeriklerin bilgi erişim performansı üzerindeki etkisini değerlendirmektir. Bu amaçla, küçük ölçekli bir Türkçe külliyat için varsayılan Lucene bilgi erişim modelini kullanan üstveri (ÜBES), tam-metin (TBES) ve melez (MBES) içerik bilgi erişim sistemleri geliştirilmiştir. Bu üç sistemin performansını değerlendirmek için "duyarlılık - anma" ve "normalize sıralama" testleri yapılmıştır. Deneysel bulgular, ÜBES ve TBES arasında ortalama duyarlılık performansında anlamlı bir fark olmadığını göstermiştir. Diğer taraftan, MBES’in ortalama duyarlılık performansı ÜBES ve TBES’ten anlamlı olarak yüksektir. Bilgi erişim performansı kullanıcı-merkezli olarak değerlendirildiğinde, ÜBES ve MBES’in normalize sıralama performansları TBES’e göre anlamlı olarak yüksektir. Ayrıca, üç bilgi erişim sisteminin eriştiği ilgili doküman ortalamaları arasında anlamlı bir farka ulaşılamamıştır. Bilgi erişim sistemlerinde üstveri ve tam-metin gibi faklı türlerdeki içeriklerin işlenmesinde terim yönetimi bakımından bazı avantajlar ve dezavantajlar bulunmaktadır. Melez içerik işleme (MBES), avantajları bir araya getirmiş ve bilgi erişim performansını artırmıştır.

Item type:	Journal article (Paginated)
Keywords:	Bilgi erişim; dizinleme; otomatik dizinleme; üstveri; performans değerlendirme; Türk Kütüphaneciliği; Apache Lucene; information retrieval; indexing; automatic indexing; metadata; performance evaluation; Turkish Librarianship
Subjects:	I. Information treatment for information services > IC. Index languages, processes and schemes. L. Information technology and library technology > LM. Automatic text retrieval. L. Information technology and library technology > LR. OPAC systems. L. Information technology and library technology > LS. Search engines.
Depositing user:	Dr. Çağdaş ÇAPKIN
Date deposited:	18 Jan 2017 20:19
Last modified:	18 Jan 2017 20:19
URI:	http://hdl.handle.net/10760/30523

Check full metadata for this record

References

Downloads

Downloads per month over past year

Actions (login required)

View Item

Facebook

Twitter

RSS