Minería textual

Eíto Brun, Ricardo and Senso, José A. Minería textual. El profesional de la información, 2004, vol. 13, n. 1. [Journal article (Unpaginated)]


Download (623kB) | Preview

English abstract

This article attempts to establish a definition for "text mining" and, at the same time, to identify its relationship with other fields: text retrieval, data mining and computational linguistics. In addition, there is an analysis of the impact of text mining, a reference to existing commercial applications on the market and, lastly, a brief description of the techniques used for developing and implementing text mining systems.

Spanish abstract

: Este artículo trata de establecer una definición de minería de textos así como delimitar su relación con otras disciplinas: recuperación textual, minería de datos y lingüística computacional. Se analizan además el impacto de la minería textual, algunas de las aplicaciones comerciales existentes en el mercado y, por último, se realiza una breve descripción de las técnicas utilizadas para desarrollar e implementar sistemas de minería de textos

Item type: Journal article (Unpaginated)
Keywords: Text minig, Data mining, Information retrieval, Clustering, Categorizing, Concept classification
Subjects: A. Theoretical and general aspects of libraries and information.
Depositing user: Bernardita Alvarez
Date deposited: 05 May 2008
Last modified: 02 Oct 2014 12:11
URI: http://hdl.handle.net/10760/11491


Arenas, Lourdes; Moral, Anselmo del. “Automatic indexing of documents”. En: Nuevas tendencias en inteligencia artificial. Bilbao: Universidad de Deusto, 1992, pp. 355-367.

Ananyan, Sergei; Kharlamov, Alexander. Automated analysis of natural language texts. White paper de Megaputer. Consultado en: 27-12-03. http://www.megaputer.com/tech/wp/tm.php3

Baeza-Yates, Ricardo; Ribeiro-Neto, Berthier. Modern information retrieval. Harlow: Addison-Wesley , 1999, 514 p.

Berry, Michael W.; Drmac, Zlatko ; Jessup, Elizabetch R. “Matrices, vector spaces and information retrieval”. En: Siam Review, abril, 1999, v. 41, n. 2, pp. 335-362. Consultado en: 27-12-03. http://www.siam.org/journals/sirev/41-2/34703.html

Chakrabarti, Soumen. Mining the Web: Discovering Knowledge From Hypertext Data. Amsterdam: Morgan Kaufmann, 2003, xviii, 345 p. Cutting, Douglas R. [et al.]. “Scatter/gather: a cluster-based approach to browsing large document collections”. En: 15th Annual International Sirgi 92. Consultado en: 27-12-03. http://citeseer.nj.nec.com/cutting92scattergather.html

Darpa information access office web site. Consultado en: 27-12-03. http://www.darpa.mil/iao

Deerwester, Scott [et al.]. “Indexing by latent semantic analysis”. En: Journal of the American Society for Information Science, 1990, v. 41, n. 6, pp. 391-407. Consultado en: 27-12-03. http://lsa.colorado.edu/papers/JASIS.lsi.90.pdf

Etxeberría Murgiondo, Juan [et al.]. Análisis de datos y textos. Madrid: Ra-ma, 1995, xi, 372 p.

Kent, Allan; Lancour, Harold (ed.). Encyclopedia or library and information science. New York: Marcel Dekker, 1968-1989. FAS (Federation of American Scientist) web site. Consultado en: 27-12- 03.http://www.fas.org

Gemert, Jan Van. “Text mining tools on the internet: an overview”. En: Isis Technical Report Series, septiembre, 2000, v. 23. Consultado en: 27- 12-03. http://www.ai.mit.edu/people/jimmylin/papers/Gemert00.pdf

Hearst, Marti. “Untangling text data mining”. En: Proceedings of ACL'99: the 37th annual meeting of the Association For Computational Linguistics, junio, 1999. Consultado en: 27-12-03. http://www.sims.berkeley.edu/~hearst/papers/acl99/acl99-tdm.html

IBM. Intelligent miner for text: guía de iniciación versión 2.3. 2ª ed. [S.l.]:

IBM, diciembre, 1998. vii, 53 p. Publicación número SH 10-9238-01.

IBM. Text analysis tools: intelligent miner for text version 2.3.1. [S.l.]:

IBM, junio 2000, viii, 92 p. Publicación número SH 12-6370-01.

Frakes, William B.; Baeza-Yates, Ricardo (eds.). Information retrieval: data structures & algorithms. New Jersey: Prentice Hall, 1992, viii, 504 p.

Jain, A. K.; Murty, M. N.; Flynn, P. J. “Data clustering: a review”. En: ACM Computing Surveys, septiembre, 1999, v. 31, n. 3, pp. 265-323. Consultado en: 27-12-03. http://citeseer.nj.nec.com/jain99data.html

Klir, George J.; Yuan, Bo. Fuzzy sets and fuzzy logic: theory and applications. New Jersey: Prentice Hall, 1995, xv, 574 p.

Landauer, Thomas K.; Foltz, Peter W.; Lahan, Darrell. “An introduction to latent semantic analysis”. En: Discourse Processes, 1998, n. 25, pp. 259-284. Consultado en: 27-12-03. http://lsa.colorado.edu/papers/dp1.LSAintro.pdf

Lucas, Marty. Mining in textual mountains. [Entrevista a Marti Hearst realizada el 18 de noviembre de 1999]. Consultado en: 27-12-03. http://mappa.mundi.net/trip-m/hearst

Maron, M. E.; Kuhns, J. L. “On relevance, probabilistic indexing and information retrieval”. En: Journal of the ACM, 1960, v. 7, n. 3, pp. 216-244.

Miyamoto, Sadaaki. Fuzzy sets in information retrieval and cluster analysis. Dordrecht : Kluwer Academic Publishers , 1990, x, 258 p. National strategy for combating terrorism. Febrero 2003. Consultado en: 27-12-03. http://www.whitehouse.gov/news/releases/2003/02/counter_terrorism/counter_terrorism_strategy.pdf

Pao, Miranda Lee. Concepts of information retrieval. Englewood, Colorado: Libraries Unlimited, 1990.

Ravin, Yael. Extracting names from natural-language text: IBM research report. Almaden: IBM research division, 04/10/1997 (RC 20338 digital libraries), 30 p.

Rijsbergen, C. J. Van. Information retrieval. 2ª ed. London [etc.]: Butterworths, 1979 (reimp. 1980).

Salton, Gerard; McGill, Michael J. Introduction to modern information retrieval. New York [etc.]: McGraw Hill, 1983.

SAS Institute Inc. Getting started with SAS text miner software, release 8.2. Pubcode: 58859. Consultado en: 27-12-03. http://www.sas.com

SAS text miner: distilling textual data for competitive business advantage: a SAS white paper. Consultado en: 27-12-03. http://www.sas.com

“SAS signs text mining alliance with Inxight”. En: eWeek, 19 de enero de 2002. Consultado en: 27-12-03. http://www.eweek.com

Sebastiani, Fabrizio. “Machine learning in automated text categorization”. En: ACM Computing Surveys, marzo, 2002, v. 34, n. 1, pp. 1-47.

StatSoft textbook. Consultado en: 27-21-03 http://www.statsoft.com/textbook

Sullivan, Dan. Document warehousing and text mining. New York [etc.]: Wiley Computer Publishing, 2001, xviii, 542 p.

Swanson, Don R. “Assessing a gap in the biomedical literature: Magnesium deficiency and neurologic disease”. En: Neuroscince Research Communications, 1994, n. 15, pp. 1-9.

Swanson, Don R. “An interactive system for finding complementary literatures: a stimulus to scientific discovery”. En: Artificial Intelligence, 1997, n. 91, pp. 183-203.

Swanson, Don R. “Two medical literatures that are logically but not bibliographically connected”. En: Journal of the American Society for Information Science, 1987, v. 38, n. 4, pp. 228-233.

Tan, Ah-Hwee. “Text mining: the state of the art and the challenges”. En Proceedings Pakdd'99 workshop on knowledge discovery from advanced databases, abril, 1999, pp. 71-76. Consultado en: 27-12-03. http://textmining.krdl.org.sg/people/ahhwee/

Thomas, Timothy L. “Al qaeda and the internet: the danger of ‘cyberplanning’”. En: Parameters, primavera, 2003. Consultado en: 27-12-03. http://fmso.leavenworth.army.mil/fmsopubs/ISSUES/alqaedainternet.htm

Watkins, D. S. Fundamentals of matrix computations. New York: John Wiley & Sons, 1991. Yang, Y.; Pedersen, J. O. “A comparative study on feature selection in text categorization”. En: Proceedings of the fourteenth international conference on machine learning, 1997.


Downloads per month over past year

Actions (login required)

View Item View Item