Annotated Bibliography on Statistical Semantics
Delavenay, E. (1960). An Introduction to Machine Translation, New York, NY: Thames and Hudson.
- This book contains one of the earliest definitions of the term statistical semantics, as "statistical study of meanings of words and their frequency and order of recurrence".
Firth, J.R. (1957). A synopsis of linguistic theory 1930-1955. In Studies in Linguistic Analysis, pp. 1-32. Oxford: Philological Society. Reprinted in F.R. Palmer (ed.), Selected Papers of J.R. Firth 1952-1959, London: Longman (1968).
- Firth wrote that "a word is characterized by the company it keeps".
Frank, E., Paynter, G.W., Witten, I.H., Gutwin, C., and Nevill-Manning, C.G. (1999). Domain-specific keyphrase extraction. In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99), pp. 668-673. California: Morgan Kaufmann.
- A Naive Bayes supervised learning algorithm is used to extract important words and phrases from documents. The features that characterize keyphrases include early occurrence and relatively high frequency in the given document.
Furnas, G.W., Landauer, T.K., Gomez, L.M., and Dumais, S.T. (1983). Statistical semantics: Analysis of the potential performance of keyword information systems. Bell System Technical Journal, 62(6):1753-1806.
- A foundational paper on statistical semantics.
Hearst, M.A. (1992). Automatic acquisition of hyponyms from large text corpora. In Proceedings of the Fourteenth International Conference on Computational Linguistics, pages 539–545, Nantes, France.
- A widely cited paper that shows how simple lexical-syntactic patterns can be used to mine text for semantic relations, such as hyponymy.
Landauer, T.K., and Dumais, S.T. (1997). A solution to Plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104(2):211–240.
- Latent Semantic Analysis (LSA) is evaluated as a measure of word similarity, using multiple-choice synonym questions from the Test of English as a Foreign Language. LSA achieves a score of 64.4%, almost the same as the 64.5% of the average non-english US college applicant.
Lund, K., Burgess, C., and Atchley, R.A. (1995). Semantic and associative priming in high-dimensional semantic space. In Proceedings of the 17th Annual Conference of the Cognitive Science Society, pages 660-665.
- A statistical semantics measure of word similarity, from the perspective of cognitive science.
Pantel, P., and Lin, D. (2002). Discovering word senses from text. In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 613–619.
- A corpus-based algorithm that clusters similar words together. A polysemous word may belong to several different clusters, one cluster for each sense of the word.
Terra, E., and Clarke, C.L.A. (2003). Frequency estimates for statistical word similarity measures. In Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003), pages 244–251.
Turney, P.D. (2000). Learning algorithms for keyphrase extraction. Information Retrieval, 2(4), 303-336. OAI arXiv.org:cs/0212020
Turney, P.D. (2001). Answering subcognitive Turing Test questions: A reply to French. Journal of Experimental and Theoretical Artificial Intelligence, 13(4), 409-419. OAI arXiv.org:cs/0212015
Turney, P.D. (2003). Coherent keyphrase extraction via Web mining, In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03), Acapulco, Mexico, 434-439. OAI arXiv.org:cs/0308033
Turney, P.D. (2004). Word sense disambiguation by Web mining for word co-occurrence probabilities. In Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL-3), Barcelona, Spain, pp. 239-242. OAI arXiv.org:cs/0407065
Turney, P.D. (2006), Similarity of semantic relations. Computational Linguistics, 32(3), 379-416. OAI arXiv.org:cs/0608100
Turney, P.D., and Littman, M.L. (2003). Measuring praise and criticism: Inference of semantic orientation from association, ACM Transactions on Information Systems (TOIS), 21(4), 315-346. OAI arXiv.org:cs/0309034
Turney, P.D., and Littman, M.L. (2005). Corpus-based learning of analogies and semantic relations. Machine Learning, 60(1–3):251–278. OAI arXiv.org:cs/0508103
Turney, P.D., Littman, M.L., Bigham, J., and Shnayder, V. (2003). Combining independent modules to solve multiple-choice synonym and analogy problems. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03), Borovets, Bulgaria, pp. 482-489. OAI arXiv.org:cs/0309035
Weaver, W. (1955). Translation. In W.N. Locke and D.A. Booth (eds.), Machine Translation of Languages, Cambridge, MA: MIT Press. ISBN 0-8371-8434-7