ABSTRACT
One formidable problem in language technology is the word sense disambiguation (WSD) problem: disambiguating the true sense of a word as it occurs in a sentence (e.g., recognizing whether the word "bank" refers to a river bank or to a financial institution). This paper explores a strategy for harnessing the linguistic abilities of human beings to develop datasets that can be used to train machine learning algorithms for WSD. To create such datasets, we introduce a new interactive system: a fun game designed to produce valuable output by engaging human players in what they perceive to be a cooperative task of guessing the same word as another player. Our system makes a valuable contribution by tackling the knowledge acquisition bottleneck in the WSD problem domain. Rather than using conventional and costly techniques of paying lexicographers to generate training data for machine learning algorithms, we delegate the work to people who are looking to be entertained.
- A. Kilgarriff. 1998. Senseval: An exercise in evaluating word sense disambiguation programs.Google Scholar
- Miller, G. A. 1995. WORDNET: A Lexical Database for English. Communications of ACM Google ScholarDigital Library
- Ng, T. H. 1997. Getting serious about word sense disambiguation. In Proceedings of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What, and How? (Washington D.C.). 1--7.Google Scholar
- Roberto Navigli. 2009. Word sense disambiguation: a survey. ACM Computing Surveys, 41 Google ScholarDigital Library
- Stork, D. G. The Open Mind Initiative. IEEE Intelligent Systems & Their Applications, 14--3, 1999, pages 19--20.Google Scholar
- Von Ahn, L. and Dabbish, L. 2004. Labeling images with a computer game. In Proc. ACM CHI. Google ScholarDigital Library
- Kraut, R. E. & Resnick, P. Evidence-based social design: Mining the social sciences to build online communities. Cambridge, MA: MIT Press. In preparationGoogle Scholar
- Zellweger, P. T., Bouvin, N. O., Jehøj, H., and Mackinlay, J. D. Fluid Annotations in an Open World. Proc. Hypertext 2001, ACM Press (2001), 9--18. Google ScholarDigital Library
Index Terms
- Word sense disambiguation via human computation
Recommendations
An unsupervised method for word sense disambiguation
AbstractWord sense disambiguation (WSD) finds the actual meaning of a word according to its context. This paper presents a novel WSD method to find the correct sense of a word present in a sentence. The proposed method uses both the WordNet ...
Unsupervised Word-Sense Disambiguation Using Bilingual Comparable Corpora
An unsupervised method for word-sense disambiguation using bilingual comparable corpora was developed. First, it extracts word associations, i.e., statistically significant pairs of associated words, from the corpus of each language. Then, it aligns ...
A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation
Word Sense Disambiguation (WSD) aims to automatically predict the correct sense of a word used in a given context. All human languages exhibit word sense ambiguity, and resolving this ambiguity can be difficult. Standard benchmark resources are required ...
Comments