E-LIS, Eprints in Library and Information Science Homepage E-LIS, Eprints in Library and Information Science
   home   |   about   |   search   |   browse   |   register   |   registered users area   |   help   |   FAQ   |   JITA   

Characterising through Erasing: A Theoretical Framework for Representing Documents Inspired by Quantum Theory

Huertas-Rosero, Alvaro Francisco and Azzopardi, Leif and van Rijsbergen, C. J. (2008) Characterising through Erasing: A Theoretical Framework for Representing Documents Inspired by Quantum Theory. In Bruza, Peter D. and Lawless, William and van Rijsbergen, C. J. and Sofge, Donald, Eds. Proceedings Second Quantum Interaction Symposium, Oxford, UK.

Full text available as:
PDF (Portable Document Format) - Requires Adobe Acrobat Reader or other PDF viewer.

View statistics for this eprint

Abstract

The problem of representing text documents within an Information Retrieval system is formulated as an analogy to the problem of representing the quantum states of a physical system. Lexical measurements of text are proposed as a way of representing documents which are akin to physical measurements on quantum states. Consequently, the representation of the text is only known after measurements have been made, and because the process of measuring may destroy parts of the text, the document is characterised through erasure. The mathematical foundations of such a quantum representation of text are provided in this position paper as a starting point for indexing and retrieval within a "quantum like" Information Retrieval system.

Keywords:Indexing, Lexical Measurements, Quantum Theory
Subjects:I. Information treatment for information services > IC. Index languages, processes and schemes.
ID Code:12716
Deposited By:Huertas-Rosero, Alvaro Francisco
Deposited On:13 February 2008
Alternative Locations:http://www.dcs.gla.ac.uk/publications/PAPERS/8789/CharacterisingErasingFinal.pdf
All fields:Show all fields

Beltrametti, E. G., and Cassinelli, G. 1981. The logic of Quantum Mechanics. Addison Wesley. Chapter 9, 87.

Caves, C. M.; Fuchs, C. A.; and Schack, R. 2002. Unknown quantum states: The quantum de Finetti representation. J. Math. Phys. 43(9):4537–4559.

Gleason, A. M. 1957. Measures of the closed subspaces of the hilbert space. Journal of Mathematics and Mechanics 6:885–893.

Mana, P. G. L. 2003. Why can states and measurement outcomes be represented as vectors? http://arxiv.org/abs/quant-ph/0305117.

Ponte, J., and Croft, W. B. 1998. A language modeling approach to information retrieval. In Proc. of SIGIR’98, 275–281.

Salton, G., and Lesk, M. E. 1968. Computer evaluation of indexing and text processing. Journal of the ACM 15(1):8–36.

Song, D., and Bruza, P. D. 2003. Towards context-sensitive information inference. Journal of the American Society for Information Science and Technology (JASIST) 54:321–334.

Spärck-Jones, K. 2003. Document retrieval: Shallow data, deep theories; historical reflections, potential directions. In Sebastiani, F., ed., Advances in Information Retrieval, Springer.

van Rijsbergen, C. J. 1979. Information Retrieval. Butterworths. chapter 2: Automatic Text Analysis.

van Rijsbergen, C. J. 2004. The Geometry of Information Retrieval. Cambridge University Press.

Archive Staff Only: edit this record