Using Category Information for Relationship Exploration in Textual Data

Qu, Yan, Furnas, George and Walstrum, Ben Using Category Information for Relationship Exploration in Textual Data., 2006 . In 69th Annual Meeting of the American Society for Information Science and Technology (ASIST), Austin (US), 3-8 November 2006. [Conference paper]

[thumbnail of Qu_Using.pdf]
Preview
PDF
Qu_Using.pdf

Download (173kB) | Preview

English abstract

In the comprehension of textual data, it is critical for people to perceive relationships between topics. This work explores two approaches that use text categorizations to reveal underlying relationships: the Overlap approach, which visualizes overlaps between categories, and the Search approach, which shows topical search results in the context of categories. The effectiveness of these approaches is tested using various types of relationship questions. Our results show that the Overlap approach improves users’ performances in relationship exploration tasks. Conversely, the Search approach did not show the same effectiveness, primarily due to the Vocabulary Problem. Design implications are drawn from the experiment.

Item type: Conference paper
Keywords: text categorization ; overlap approach ; search approach
Subjects: I. Information treatment for information services > IB. Content analysis (A and I, class.)
Depositing user: Norm Medeiros
Date deposited: 16 Jan 2007
Last modified: 02 Oct 2014 12:06
URI: http://hdl.handle.net/10760/8843

References

Becker, R. A., & Cleveland, W. S. (1987) Brushing Scatterplots Technometrics 29(2):127--142

Bendix, F., Kosara, R., & Hauser, H. (2005) Parallel Sets: Visual Analysis of Categorical Data Proceedings of IEEE InfoVis '05 pp. 133-140

Burstein, J., Marcu, D., & Knight, K. (2003) Finding the WRITE Stuff: Automatic Identification of Discourse Structure in Student Essays IEEE Intelligent Systems 8(1): 32-39

Chan, S. W. K. (2004) Automatic discourse structure detection using shallow textual continuity International Journal of Human-Computer Studies 61(1): 138-164

Chen, H., Houston, A. L., Sewell, R. R., & Schatz, B. R., (1998) Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques Journal of the American Society for Information Science 49(7): 582-603

Eick, S. G. (1994) Graphically displaying text Journal of Computational and Graphical Statistics 3: 127-142

Friendly, M. (1994) Mosaic displays for multi-way contingency tables Journal of the American Statistical Association 89: 190-200

Friendly, M. (1999) Visualizing Categorical Data In Sirken, Monroe G. et al. (Eds.) Cognition and Survey Research 319-348. New York: John Wiley & Sons

Furnas, G. W., Landauer, T. K., Gomez, L. M., & Dumais, S. T. (1987) The vocabulary problem in human-system communication Communications of the ACM 30(11): 964-971

Graham, M., & Kennedy, J. (2001) Combining linking & focusing techniques for a multiple hierarchy visualisation Proc. of IV 2001 - 5th International Conference on Information Visualization pp 425-432

Grosz, B. J., Joshi A. K., & Weinstein, S. (1995) Centering: a framework for modeling the local coherence of discourse Computational Linguistics 21(2):203--255

Hartigan, J. A., & Kleiner, B. (1981) Mosaics for contingency tables In W. F. Eddy (Ed.) Computer science and statistics: Proceedings of the 13th symposium on the interface pp. 286-273. New York: Springer-Verlag

Hearst, M. A. (1998) Automated discovery of WordNet relations In Christiane Fellbaum (Ed.) WordNet: An Electronic Lexical Database MIT Press

Hindle, D. (1990) Noun classification from predicate-argument structures Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics pp 268-275

Mann, W. C., & Thompson, S. A. (1988) Rhetorical structure theory: toward a functional theory of text organization Text 8(3): 243-281

Marcu, D., & Echihabi, A. (2002) An unsupervised approach to recognizing discourse relations Proc. of the 40th Annual Meeting of the Association for Computational Linguistics (ACL) pp 368-375

Munzner, T., Guimbretiere, F., Tasiran, S., Zhang, L., & Zhou, Y. (2003) TreeJuxtaposer: Scalable Tree Comparison using Focus+Context with Guaranteed Visibility ACM Transactions on Graphics 22(3): 453-462

Polanyi, L. (1988) A formal model of the structure of discourse Journal of Pragmatics 12: 601-638

Qu, Y. (2003) Sensemaking-Supporting Information Gathering System Extended Abstract of Conference on Human Factors in Computing Systems (CHI 2003) pp906-907

Radev, D. (2000) A common theory of information fusion from multiple text sources, step one: Cross-document structure Proceedings of the 1st ACL SIGDIAL Workshop on Discourse and Dialogue, Hong Kong, October 2000

Stolte, C., Tang, D., & Hanrahan, P. (2002) Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases IEEE Transactions on Visualization and Computer Graphics 8(1): 52-65

Ward, M. O., & Martin, A. R. (1995) High Dimensional Brushing for Interactive Exploration of Multivariate Data Proceedings of Visualization '95 pp. 271-278

Wise, J. A., Thomas, J. J., Pennock, K., Lantrip, D., Pottier, M., & Schur, A. (1995) Visualizing the non-visual: Spatial analysis and interaction with information from text documents Proceedings of the Information Visualization Symposium pp. 51-58

Zhang, Z., Otterbacher, J., & Radev, D. (2003) Learning cross-document structural relationships using boosting Proceedings of the Twelfth International Conference on Information and Knowledge Management (CIKM 2003) pp 124-130


Downloads

Downloads per month over past year

Actions (login required)

View Item View Item