Els models matemàtics de Recuperació de la Informació i la seva implementació en motors de cerca de propòsit general

Ardanuy, Jordi Els models matemàtics de Recuperació de la Informació i la seva implementació en motors de cerca de propòsit general., 2003 UNSPECIFIED. (Unpublished) [Other]

[thumbnail of motors.pdf]
Preview
PDF
motors.pdf

Download (370kB) | Preview

English abstract

Mathematical models of the Information Retrieval and real implementation in general search engines.

Catalan abstract

Models matemàtics per a la recuperació de la informació i la implementació real en buscadors.

Item type: Other
Keywords: Information retrieval, Search engines, Mathematical models
Subjects: L. Information technology and library technology > LS. Search engines.
Depositing user: Jordi Ardanuy
Date deposited: 01 Dec 2006
Last modified: 02 Oct 2014 12:05
URI: http://hdl.handle.net/10760/8507

References

Gianni Amati, Claudio Carpineto and Gianni Romano (2001). «FUB at TREC-10 Web Track: A Probabilistic Framework for Topic Relevance Term Weighting» [en línia]. En: NIST Special Publication 500-250: The Tenth Text REtrieval Conference (TREC 2001). NIST, National Institute of Standards and Technology, 2001. < http://trec.nist.gov/pubs/trec10/papers/fub01.pdf». [Consulta: 26 d’abril 2003]

Gianni Amati, Cornelis Joost Van Rijsbergen (2002). «Probabilistic models of information retrieval based on measuring the divergence from randomness». ACM Transactions on Information Systems, vol. 20, no 4 (October 2002), p. 357-389.

P. Anick, J. Brennan, R. Flynn, D. Hanssen, B. Aley, J. Robbins (1990). «A direct manipulation interface for Boolean information retrieval via natural language query.». En: Proceedings of the thirteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 135-150.

R. Baeza-Yates, B. Ribeiro-Neto (1999). Modern Information Retrieval. New York: ACM Press.

B. T. Bartell, G. W. Cottrell, R. K. Belew (1992). «Latent Semantic Indexing is an Optimal Special Case of Multidimensional Scaling». En: Proceedings of the fifteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 161-167.

B. T. Bartell, G. W. Cottrell, R. K. Belew (1994) «Automatic combination of multiple ranked retrieval systems». En: Proceedings of the seventeenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p.173 - 181. Disponible en línia <http://citeseer.nj.nec.com/bartell94automatic.html> [Consulta: 1 de maig de 2003].

N.J. Belkin, C. Cool, W.B. Croft J.P. Callan (1993). «The effect of multiple query representations on information retrieval performance». Proceedings of the sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 339-346. Disponible en línia <http://citeseer.nj.nec.com/ belkin93effect.html> [Consulta: 1 de maig de 2003].

M. Berry, S. T. Dumais, G. W. O’Brien (1995). «Using linear algebra for intelligent information retrieval». SIAM Review, vol 37, no 4 (1995), p. 573-595. Disponible en línia <http://citeseer.nj.nec.com/berry95using.html> [Consulta: 1 de maig de 2003].

A. Bookstein (1978). «On the perils of merging Boolean and weighted retrievals systems». Journal of the American Society for Information Science, v. 29, n 3 (March 1978), p. 156-178.

A. Bookstein (1980). «Fuzzy requests: an approach to weighted Boolean searches». Journal of the American Society for Information Science, vol. 31, no. 4 (July 1980), p. 240-247.

A. Bookstein (1985). «Implications of Boolean structure for probabilistic retrieval». En: Proceedings of the Eighth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 11-17.

E. H. Brenner (1996). Beyond Boolean: New approaches to information retrieval. Philadelphia: NFAIS.

J. P. Callan (1996). «Document filtering with inference networks». En: Proceedings of the Nineteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 262-269.

J. P. Callan, Z. Lu, W. B. Croft (1995). «Searching Distributed Collections With Inference Networks». En: Proceedings of the Eighteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 21-28.

W. S. Cooper (1991). «Some inconsistencies and misnomers in probabilistic information retrieval». En: Proceedings of the fourteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York: ACM Press, p. 57-61.

W. S. Cooper (1994). «The formalism of probability theory in IR: A foundation or an encumbrance?». En: Proceedings of the seventeenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York: ACM Press, p. 242-247.

W. S. Cooper, F.C. Gey, D.P. Dabhey. (1992). «Probabilistic Retrieval Based on Staged Logistic Regression». En: Proceedings of the Fifteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York: ACM Press, p. 198-210.

W. B. Croft (1983). «Experiments with representations in a document retrieval system». Information Technology: Research and Development, vol. 2, no 1 (January 1983), p. 1-21.

W. B., Croft, D. J. Harper (1979). «Using probabilistic models of retrieval without relevance information». Journal of Documentation, vol. 35, no 4 (1979) p. 285--295.

J. Dowling. (2002). Information Retrieval using Latent Semantic Indexing and a Semi-Discrete Matrix Decomposition [en línia]. [Melbourne]: Monash University, October, 2002. <http://www.pcug.org.au/~jdowling/BCompHons.PDF>. [Consulta: 1 maig 2003].

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer (1990). «Indexing by latent semantic analysis», Journal of the American Society for Information Science, vol 41, no 6 (September 1990) p. 391-407.

I. S. Dhillon, D. S. Modha (2001). «Concept Decompositions for Large Sparse Text Data Using Clustering». Machine Learning, vol 42, no 1-2 (January-February 2001), p. 143-175.

S. T. Dumais (1991). «Improving the retrieval of information from external sources». Behavior Research Methods, Instruments, & Computers, vol 23, no 2 (1991), p. 229—236.

S. T. Dumais, G. W. Furnas, T. K. Landauer, S Deerwester (1988). «Using latent semantic analysis to improve information retrieval». En: Proceedings of the SIGCHI conference on Human factors in computing systems, New York: ACM Press, p. 281-285.

D. Ellis. «Paradigms and research traditions in information retrieval research». Information Services & Use, vol 18, no 4 (1998), p. 225-241.

W. B. Frakes, Ricardo Baeza-Yates Eds. (1992). Information Retrieval: Data Structures & Algorithms. Englewood Cliffs, NJ: Prentice Hall.

N. Fuhr (1989). Models for retrieval with probabilistic indexing. Information Processing & Retrieval, vol. 25, no 1 (1989) p. 55-72.

N. Fuhr (1992). «Probabilistic models of information retrieval». Computer Journal, vol. 35, no 3 (1992) p. 244-255.

G. W. Furnas, S. Deerwester, S. T. Dumais, T. K. Landauer, R. A. Harshman, L. A. Streeter, K. E. Lochbaum (1988). Proceedings of the eleventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 465-480.

F. C. Gey (1994). «Inferring probability of relevance using the method of logistic regression». En: Proceedings of the seventeenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 222-231.

D. Haines, W. B. Croft (1993). «Relevance feedback and inference networks». En: Proceedings of the sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 2-11.

D. Harman (1992a). «Relevance feedback and other query modification techniques». En: W. B. Frakes, Ricardo Baeza-Yates Eds. Information Retrieval: Data Structures & Algorithms. Englewood Cliffs, NJ: Prentice Hall.

D. Harman (1992b). «Overview of the Second Text REtrieval Conference (TREC-2)». [en línia]. En: The Second Text REtrieval Conference (TREC 2). National Institute of Standards and Technology, august 2000. <http://trec.nist.gov/pubs/trec2/t2_proceedings.html>. [Consulta: 8 d’abril de 2003].

D. Harman, G. Candela (1990). «Retrieving records from a gigabyte of text on a minicomputer using statistical ranking». Journal of the American Society for Information Science, vol 41, no 8 (December 1990), p. 581-589.

Djoerd Hiemstra, Arjen de Vries (2000). «Relating the new language models of information retrieval to the traditional retrieval models». [en línia]. En: CTIT Technical Report TR-CTIT-00-09. [Enschede]: University of Twente, may 2000. <http://www.ub.utwente.nl/webdocs/ ctit/1/00000022.pdf> [Consulta: 26 d’abril de 2003].

K. L. Kwok (1995). «A Network Approach to Probabilistic Information Retrieval». ACM Transactions on Information Systems, vol. 13, no 3 (July 1995), p. 325-354.

D. H. Kraft and D. Buel (1983). «Fuzzy sets and generalised boolean retrieval systems». International Journal of Man-Machine Studies, vol. 19, no. 1 (January 1983), p. 45-56.

T. G. Kolda (1997). Limited-Memory Matrix Methods with Applications [en línia] [Maryland]: University of Maryland, College Park [1997]. PhD thesis, The Applied Mathematics Program. <http://citeseer.nj.nec.com/115586.html> . [Consulta: 1 de maig de 2003].

T. G. Kolda, D. P. O’Leary (1998). «A semidiscrete matrix decomposition for latent semantic indexing information retrieval». ACM Transactions on Information Systems, vol. 16, no 4 (octubre 1998), p. 322-346. Disponible en línia <http://citeseer.nj.nec.com/kolda97 semidiscrete.html> [Consulta: 1 de maig de 2003].

Ray R. Larson, Jerome McDonough, Paul O’ Leary, Lucy Kuntz (1996). «Cheshire II: Designing a Next-Generation Online Catalog». Journal of the American Society for Information Science, vol. 47, no. 37 (1996) p. 555-567.

J. H. Lee (1994). «Properties of extended Boolean models in information retrieval». En: Proceedings of the seventeenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 182-190.

J. H. Lee (1997). «Combining Multiple Evidence from Different Relevant Feedback Networks». En: Rodney W. Topor, Katsumi Tanaka (Eds.): Database Systems for Advanced Applications '97, Proceedings of the Fifth International Conference on Database Systems for Advanced Applications (DASFAA), Melbourne, Australia. Melbourne: World Scientific, p. 421-430. Disponible en línia <http://citeseer.nj.nec.com/9989.html> [Consulta: 1 de maig de 2003].

J. H. Lee, W. Y. Kim, Y. H. Lee, (1993). «Ranking documents in thesaurus-based Boolean retrieval systems». Information Processing and Management, vol. 30, n 1 (1993), p. 79-91.

J. H. Lee, W. Y. Kim, M. H. Kim, Y. J. Lee (1993). «On the Evaluation of Boolean Operators in the Extended Boolean Retrieval Framework». En: Proceedings of the sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 291-297.

J. J. Lee, P. B. Kantor (1991). «A study of probabilistic information retrieval systems in the case of inconsistent expert judgments». Journal of the American Society for Information Science, vol. 42, no 3 (April 1991), p. 166-172.

R. M. Losee (1997). «Comparing Boolean and Probabilistic Information Retrieval Systems across Queries and Disciplines». Journal of the American Society for Information Science, vol 48, no 2 (February 1997), p. 143-156.

R. M. Loose, A. Bookstein (1988). «Integrating Boolean queries in conjunctive normal form with probabilistic retrieval models». Information Processing and Management, vol. 24, no. 3 (1988), p. 315-321.

H. Luhn (1953). «A new method of recording and searching information». American Documentation, vol 4, no 1 (1953), p. 14-16.

H. Luhn (1957). «A statistical approach to mechanized encoding and searching of literary information». IBM Journal of Research and Development. Vol 1, no 4, p. 309-317.

U. Manber, S. Wu (1993). «GLIMPSE: A Tool to Search Through Entire File Systems». [en línia]. Tuckson: The University of Arizona, October 1993. «http://glimpse.cs. arizona.edu/pubs/glimpse.pdf». [Consulta 26 d’abril de 2003].

C. D. Manning, H Schütze (1999). Foundations of statistical natural language processing. Massachusetts: MIT Press.

M. E. Maron, J. L. Kuhns (1960). «On relevance, probabilistic indexing and information retrieval», Journal of the Associations of Computing Machinery, vol. 7, no.. 3 (July 1960), p. 216-244.

S. Miyamoto, T. Miyake (1986). «Fuzzy information retieval based on a fuzzy pseudothesaurus». EEE Transactions on Systems and Man Cybernetics, vol. 16, no 2 (1986), p. 278-282.

S. Miyamoto, T. Miyake, K. Nakayama (1983). «Generation of a pseudothesaurus for information retrieval based on cooccurrences and fuzzy set operations. IEEE Transactions on Systems and Man Cybernetics, vol. 13, no 1 (1983), p. 62-70.

Y. Ogawa, T. Morita, and K. Kobayashi (1991). «A fuzzy document retrieval system using the keyword connection matrix and its learning method». Fuzzy Sets and Systems, vol. 38 (1991), pp. 17-41.

Vijay V. Raghavan, S. K. M. Wong (1986). «A critical analysis of vector space model for information retrieval». Journal of the American Society for Information Science, vol. 37, no 5 (September 1986), p. 279—287.

T. Radecki (1976) « Mathematical model of information retrieval system based on the concept of Fuzzy thesaurus». Information Processing & Management, vol. 12, no. 5 1976, p. 313-318..

T. Radecki (1979), «Fuzzy Set Theoretical Approach to Document Retrieval». Information Processing & Management, vol. 15, no.5 (1979) p. 247-259.

B. A. Ribeiro-Neto, R. Muntz (1996). «A belief network model for IR». En: Proceedings of the Nineteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 253-260.

C. J. Rijsbergen (1975). Information Retrieval. London: Butterworths.

C. J. Rijsbergen (1975/79). Information Retrieval [en línia]. [Glasgow: University of Glasgow]. < http://www.dcs.gla.ac.uk/Keith/Preface.html> [Consulta: 14 abril 2003].

C. J. Rijsbergen (1979). Information Retrieval. 2a ed. London: Butterworths.

S.E. Robertson, K. Sparck Jones (1960). «Relevance weighting of search terms». Journal of the American Society for Information Science, vol. 27, no.3 (May 1976), p. 129-146.

W. M. Sachs, W. (1976). «An Approach to Associative Retrieval Through the Theory of Fuzzy Sets». Journal of the American Society for Information Science, vol. 27, no.2 (March 1976), p. 85-87.

G. Salton, C. Buckley (1988). «Term-weighting approaches in automatic text retrieval». Information Processing and Management, vol. 24, no.5 (1988) p. 513-523.

G. Salton, E. Fox, H. Wu (1983). «Extended Boolean information retrieval». Communications of the ACM, vol. 26, n11 (November 1983), p. 1022-1036.

G. Salton, M. E Lesk (1968). «Computer evaluation of indexing and text processing». Journal of the Associations of Computing Machinery, vol. 15, no.1 (January 1968), p. 8-36.

G. Salton, M. J. McGill (1983). Introduction to Modern Information Retieval. New York: McGraw-Hill.

G. Salton, E. A. Fox, H.Wu (1983). «Extended boolean information retrieval». Communications of the ACM, vol. 26, no.11 (November 1993), p. 1022-1036.

G. Salton, C.S. Yang (1973). «On the specification of term values in automatic indexing». Journal of Documentation, vol. 29, no.4 (April 1973), p- 351-372.

E. Sanchis, L. Moreno, I. Gil Eds. (2002). I Jornadas de Tratamiento i Recuperación de la Información (JOTRI). Valencia: Editorial de la UPV.

T. Saracevic (1996). «Modeling interaction in information retrieval (IR): a review and proposal». En: Proceedings of the American Society for Information Science, vol 33. Maryland: ASIS, p. 3-9.

K. Sparck Jones (1972). « A Statistical interpretation of term specificity and its application in retrieval». Journal of Documentation, vol. 28, no.1 (March 1972), p. 11-20.

K. Sparck Jones (1979a) «Experiments in relevance weighting of search terms». Information Processing and Management, vol. 15 (1979), p. 133-144.

K. Sparck Jones (1979b) «Search Term Relevance Weighting Given Little Relevance Information». Journal of Documentation, vol. 35, no.1, (March 1979), p. 30-48.

K. Spark Jones, S. Walker and S.E. Robertson [1998]. A probabilistic model of information retrieval: Development and status. [En línia]. Technical Report 446. Cambridge: University of Cambridge. <http://citeseer.nj.nec.com/jones98probabilistic.html> [Consulta: 1 de maig de 2003].

V. Tahani (1976). «A Fuzzy Model of Document Retrieval Systems». Information Processing & Management, vol. 12, no.3 (1976), p. 177-187.

H. Turtle (1991). Inference Networks for Document Retrieval. Ph. D. dissertation. [Amherst]: University of Massachusetts. Disponible en línia <http://citeseer.nj.nec. com/turtle91inference. html> [Consulta: 1 de maig de 2003].

H. Turtle, W. C. Croft. (1990). «Inference networks for document retrieval». En: Proceedings of the thirteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 1-24.

H. Turtle, W. C. Croft. (1991). «Evaluation of an inference network-based retrieval model». ACM Transactions on Information Systems, vol. 9, no 3 (July 1991), p. 187-222.

J. Verhoeff, William Goffman, Jack Belzer (1961): «Inefficiency of the use of

Boolean functions for information retrieval systems». Communications of the ACM, vol. 4 , no.12 (December 1961), p. 557-558.

E. Voorhees, D. Harman (2000). <Overview of the Ninth Text REtrieval Conference (TREC-9) [en línia]. En:. The Ninth Text REtrieval Conference (TREC 9). National Institute of Standards and Technology, February 2002. <http://trec.nist.gov/pubs/trec9/t9_proceedings.html>. [Consulta: 8 d’abril de 2003].

R. Wilkinson, P. Hingston (1991). «Using the cosine measure in a neural network for document retrieval». En: Proceedings of the fourteenth annual international ACM SIGIR Conference on Research and development in Information Retrieval. New York: ACM Press, p. 202–210.

I. H. Witten, A. Moffat, C. Bell (1994). Managing Gigabytes: Compressing and Indexing Documents and Images. New York: Van Nostrand Reinhold.

I. H. Witten, A. Moffat, C. Bell (1999). Managing Gigabytes: Compressing and Indexing Documents and Images. 2a ed. San Francisco: Morgan Kaufmann Publishing.

S. K. Wong, W. Ziarko, C. N. Wong (1985). «Generalized vector spaces model in information retrieval ». En: Proceedings of the eighteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, p. 18-25.

C.T. Yu, G. Salton (1976). «Precision weighting. an effective automatic indexing method». Journal of the Associations of Computing Machinery, vol. 23, no 1 (June 1976), p. 76-88.


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item