Towards a Model of Information Scatter: Implications for Search and Design

Bhavnani, Suresh K. and Peck, Frederick A. Towards a Model of Information Scatter: Implications for Search and Design., 2006 . In 69th Annual Meeting of the American Society for Information Science and Technology (ASIST), Austin (US), 3-8 November 2006. [Conference paper]


Download (190kB) | Preview

English abstract

Recent studies suggest that users often retrieve incomplete healthcare information because of the complex and skewed distribution of facts across relevant webpages. To understand the causes for such skewed distributions, this paper presents the results of two analyses: (1) A distribution analysis discusses how facts related to healthcare topics are scattered across high-quality healthcare pages. (2) A cluster analysis of the same data suggests that the skewed distribution can be explained by the existence of three page profiles that vary in information density, each of which play an important role in providing comprehensive information on a topic. The above analyses provide clues towards a model of information scatter which describes how the design decisions by individual webpage authors could collectively lead to the scatter of information as observed in the data. The analyses also suggest implications for the design of websites, search algorithms, and search interfaces to help users find comprehensive information about a topic.

Item type: Conference paper
Keywords: Healthcare ; Web searching ; Webometrics ; distributions ; information scatter
Subjects: I. Information treatment for information services > IK. Design, development, implementation and maintenance
H. Information sources, supports, channels. > HQ. Web pages.
L. Information technology and library technology > LC. Internet, including WWW.
Depositing user: Norm Medeiros
Date deposited: 06 Dec 2006
Last modified: 02 Oct 2014 12:05


Barabasi, A.-L., & Albert, R. (1999). Emergence of scaling in random networks. Science, 286, 509-512.

Bhavnani, S.K. (2001). Important cognitive components of domain-specific search knowledge. Proc. TREC 2001, 571-578.

Bhavnani, S.K. (2005). The Retrieval of Highly Scattered Facts and Architectural Images: Strategies for Search and Design. Automation in Construction, 14, 724-735.

Bhavnani, S.K. (2005). Why is it Difficult to Find Comprehensive Information? Journal of the American Society of Information Science and Technology, 56, 9, 989-1003.

Bhavnani, S.K., Bichakjian, C.K., Schwartz, J.L., Strecher, V.J., Dunn, R.L., Johnson, T.M., & Lu, X. (2002). Getting patients to the right healthcare sources: From real-world questions to Strategy Hubs. Proc. AMIA 2002, 51-55.

Bhavnani, S.K., Bichakjian, C.K., Johnson, T.M., Little, R.J., Peck, F.A., Schwartz, J.L., and Strecher, V.J. (2006). Strategy Hubs: Domain Portals to Help Find Comprehensive Information. Journal of the American Society for Information Science and Technology, 57, 1, 4-24.

Bradford, S.C. (1948). Documentation. London: Crosby Lockwood.

Brinck, T., Gergle, D., & Wood, S. (2002). Designing Websites that Work: Usability for the Web. San Francisco: Morgan Kaufmann.

Dumais, S.T., Furnas, G.W., Landauer, T.K. & Deerwester, S. (1988). Using latent semantic analysis to improve information retrieval. Proc. CHI 1988, 281-285.

Egan, D.E., Remde, J.R., Landauer, T.K., Lochbaum, C.C., & Gomez, L.M. (1989). Behavioral evaluation and analysis of a hypertext browser. Proc. CHI 1989, 205-210.

Eysenbach, G., & Köhler, C. (2002). How do consumers search for and appraise health information on the World Wide Web? Qualitative study using focus groups, usability tests, and in-depth interviews, British Medical Journal, 324, 573-577.

Eysenbach, G., Powell, J., Kuss, O., & Sa, E-R. (2002). Empirical studies assessing the quality of health information for consumers on the World Wide Web: A systematic review. Journal of the American Medical Association, 287, 20, 2691-2700.

Figueiredo, M.A.T., & Jain, A.K. (2002). Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 3, 381-396.

Fox, S., & Fallows, F. (2003). Health searches and email have become more commonplace, but there is room for improvement in searches and overall Internet access. Pew Internet and American live project: Online life report. Avail: (Accessed July, 2006).

Hood, W., & Wilson, C. (2001). The scatter of documents over databases in different subject domains: How many databases are needed? Journal of the American Society for Information Science, 52, 14, 1242-1254.

Peck, F.A., Bhavnani, S.K., Blackmon, M.H., & Radev, D.R. (2004). Exploring the use of natural language systems for fact identification: Towards the automatic construction of healthcare portals. Proc. ASIST 2004.

Pirolli, P., & Card, S.K. (1999). Information Foraging. Psychological Review, 106, 643-675.

Pratt, W., Hearst, M., & Fagan, L.A. (1999). Knowledge-Based Approach to Organizing Retrieved Documents. Proc. AAAI 1999.

Shneiderman, B., Byrd, D., & Croft, W.B. (1997). Clarifying search: A user interface framework for text searches. D-LIB Magazine 3, 1.

Sturdee, D.W. The importance of patient education in improving compliance. Climacteric, 10, 2, 9-13.


Downloads per month over past year

Actions (login required)

View Item View Item