Automatic vs. manual categorization of documents in Spanish

G.-Figuerola, Carlos and Zazo, Ángel F. and Alonso-Berrocal, José-Luis Automatic vs. manual categorization of documents in Spanish. Journal of Documentation, 2001, vol. 57, n. 6, pp. 763-773. [Journal article (Paginated)]

[img]
Preview
PDF
figuerola2001automatic.pdf

Download (45kB) | Preview

English abstract

Automatic categorisation can be understood as a learning process during which a programme recognises the characteristics that distinguish each category or class from others, i.e. those characteristics which the documents should have in order to belong to that category. As yet few experiments have been carried out with documents in Spanish. Here we show the possibilities of elaborating pattern vectors that include the characteristics of different classes or categories of documents, using techniques based on those applied to the expansion of queries by relevance; likewise, the results of applying these techniques to a collection of documents in Spanish are given. The same collection of documents was classified manually and the results of both procedures were compared.

Item type: Journal article (Paginated)
Keywords: Information retrieval, categorization
Subjects: L. Information technology and library technology > LM. Automatic text retrieval.
Depositing user: Ángel F. Zazo Rodríguez
Date deposited: 15 Feb 2010
Last modified: 02 Oct 2014 12:15
URI: http://hdl.handle.net/10760/13924

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item