ABSTRACT
On the Web, users typically forage for information by navigating from page to page along Web links. Their surfing patterns or actions are guided by their information needs. Researchers need tools to explore the complex interactions between user needs, user actions, and the structures and contents of the Web. In this paper, we describe two computational methods for understanding the relationship between user needs and user actions. First, for a particular pattern of surfing, we seek to infer the associated information need. Second, given an information need, and some pages as starting pints, we attempt to predict the expected surfing patterns. The algorithms use a concept called “information scent”, which is the subjective sense of value and cost of accessing a page based on perceptual cues. We present an empirical evaluation of these two algorithms, and show their effectiveness.
- 1.Accrue Insight. (1999) http://www.accrue.comGoogle Scholar
- 2.Alexa Internet. (1999) http://www.alexa.comGoogle Scholar
- 3.Anderson, J. R., Pirolli, P. L. (1984) Spread of Activation. Journal of Experimental Psychology: Learning, Memory and Cognition, 10, 791-798.Google ScholarCross Ref
- 4.Astra SiteManager. (1999) http://www.merc-int.comGoogle Scholar
- 5.Bharat, K. and Henzinger, M. R. (1998) Improved algorithms for topic distillation in a hyperlinked environment. In Proc. of the 21 st ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 104-111). Google ScholarDigital Library
- 6.Brin, S. and Page, L. (1998) The anatomy of a large-scale hypertextual web search engine. In Proc. Of the 7 th International World Wide Web Conference (WWW7) (pp. 107- 117), Brisbane, Australia. Google ScholarDigital Library
- 7.Chakrabarti, S., B. Dom, P. Raghavan, S. Rajagopalan, D. Gibson, and J. Kleinberg. (1998) Automatic resource compilation by analyzing hyperlink structure and associated text. In Proc. Of the 7 th International World Wide Web Conference (WWW7) (pp. 65-74), Brisbane, Australia. Google ScholarDigital Library
- 8.Chi, E.H., Pitkow, J., Mackinlay, J., Pirolli, P., Gossweiler, R., and Card, S. (1998). Visualizing the Evolution of Web Ecologies. Proceedings of the Human Factors in Computing Systems, CHI '98. (pp. 400-407). Los Angles, CA. Google ScholarDigital Library
- 9.Chi, E. H., Pirolli, P., Pitkow, J. (2000) The Scent of a Site: A System for Analyzing and Predicting Information Scent, Usage, and Usability of a Web Site. Proceedings of Human Factors in Computing Systems, CHI 2000. (pp. 400-407). Hague, Netherlands. Google ScholarDigital Library
- 10.Cooley, R., Mobasher, B., Srivastava, J. (1997) Web Mining: Information and Pattern Discovery on the World Wide Web (A Survey Paper), in Proc. of the 9th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'97). Nov. 1997. Google ScholarDigital Library
- 11.Furnas, G.W. (1997) Effective view navigation. Proceedings of the Human Factors in Computing Systems, CHI '97 (pp. 367- 374), Atlanta, GA. Google ScholarDigital Library
- 12.Heer, J., and Chi, E.H. Identifying Web user types using multimodal clustering. (submitted for publication)Google Scholar
- 13.Huberman, B. A., Pirolli, P., Pitkow, J., Lukose, R. (1998) Strong Regularities in World Wide Web Surfing. Science, 280, 95-97.Google ScholarCross Ref
- 14.Kleinberg, J. M. (1998) Authoritative sources in a hyperlinked environment. In Proc. Of the 9 th Annual ACM-SIAM Symposium on Discrete Algorithms, (pp. 668-677), San Francisco, CA. Google ScholarDigital Library
- 15.Olston, C., Chi, E. H. (2000) ScentTrails: Integrating Browsing and Searching on the World Wide Web. (Submitted for publication)Google Scholar
- 16.Pirolli, P. (1997) Computational models of information scentfollowing in a very large browsable text collection. Proceedings of the Conference on Human Factors in Computing Systems, CHI '97 (pp. 3-10), Atlanta, GA. Google ScholarDigital Library
- 17.Pirolli, P. and Card, S.K. (1999) Information foraging. Psychological Review 106(4) (pp. 643-675).Google Scholar
- 18.Pirolli, P., Pitkow, J., and Rao, R. (1996) Silk from a sow's ear: Extracting usable structures from the web. Proceedings of the Conference on Human Factors in Computing Systems, CHI '96 Vancouver, Canada. Google ScholarDigital Library
- 19.Pirolli, P. and Pitkow, J.E. (1999) Distributions of surfers' paths through the World Wide Web: Empirical characterization. World Wide Web, 1, 1-17. Google ScholarDigital Library
- 20.Pirolli, P. (2000) A Web site user model should at least predict something about users. Internetworking, 3:1. http://www.sandia.gov/itg/newsletter/mar00/critique_max.htmlGoogle Scholar
- 21.Pitkow, J. and Piroll, P. (1999) Mining longest repeated subsequences to predict World Wide Web surfing. Proceedings of the USENIX Conference on Internet. Google ScholarDigital Library
- 22.Pitkow, J. and Pirolli, P. (1997) Life, death, and lawfulness on the electronic frontier. Proceedings of the Conference on Human Factors in Computing Systems, CHI '97 (pp. 383-390). Google ScholarDigital Library
- 23.Schuetze, H., Manning, C. (1999) Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press. Google ScholarDigital Library
- 24.Silva, I., B. Ribeiro-Neto, P. Calado, E. Moura, N. Ziviani. (2000) Link-based and Content-based Evidential Information in a Belief Network Model. In Proc. of the 21 st ACM SIGIR Conference on Research and Development in Information Retrieval (pp.96-103). Athens, Greece. Google ScholarDigital Library
- 25.Turtle, H., Croft, W. (1991) Evaluation of an inference networkbased retrieval model. ACM Transactions on Information Systems, 9(3):187-222 Google ScholarDigital Library
- 26.WebCriteria SiteProfile. (1999) http://www.webcriteria.comGoogle Scholar
Index Terms
- Using information scent to model user information needs and actions and the Web
Recommendations
The scent of a site: a system for analyzing and predicting information scent, usage, and usability of a Web site
CHI '00: Proceedings of the SIGCHI conference on Human Factors in Computing SystemsDesigners and researchers of users' interactions with the World Wide Web need tools that permit the rapid exploration of hypotheses about complex interactions of user goals, user behaviors, and Web site designs. We present an architecture and system for ...
Information scent as a driver of Web behavior graphs: results of a protocol analysis method for Web usability
CHI '01: Proceedings of the SIGCHI Conference on Human Factors in Computing SystemsThe purpose of this paper is to introduce a replicable WWW protocol analysis methodology illustrated by application to data collected in the laboratory. The methodology uses instrumentation to obtain detailed recordings of user actions with a browser, ...
Web User Modeling via Negotiating Information Foraging Agent
INTERACT '09: Proceedings of the 12th IFIP TC 13 International Conference on Human-Computer Interaction: Part IInformation foraging theory lays a good foundation for web user modeling. However, the existing user modeling methods mainly focus on fixed information needs. In the real world, a user's information goal often evolves, and information foraging is a ...
Comments