Semi-supervised Learning in NLP
Semi-supervised Learning for NLP Bibliography
The goal of this page is to collect all papers focusing on semi-supervised learning for natural language processing. Another good starting point for papers (divided by topic) is John Blitzer and Jerry Zhu's ACL 2008 tutorial website.
2009
Carlson, A., Betteridge, J., Hruschka Junior, E.R. & Mitchell, T.M. (2009), "Coupling Semi-Supervised Learning of Categories and Relations", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 1-9. Association for Computational Linguistics.
Veeramachaneni, S. & Kondadadi, R.K. (2009), "Surrogate Learning - From Feature Independence to Semi-Supervised Classification", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 10-18. Association for Computational Linguistics.
Goldberg, A.B. & Zhu, X. (2009), "Keepin' It Real: Semi-Supervised Learning with Realistic Tuning", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 19-27. Association for Computational Linguistics.
Zubiaga, A., Fresno, V. & Martínez, R. (2009), "Is Unlabeled Data Suitable for Multiclass SVM-based Web Page Classification?", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 28-36. Association for Computational Linguistics.
Plank, B. (2009), "A Comparison of Structural Correspondence Learning and Self-training for Discriminative Parse Selection", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 37-42. Association for Computational Linguistics.
Andrzejewski, D. & Zhu, X. (2009), "Latent Dirichlet Allocation with Topic-in-Set Knowledge", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 43-48. Association for Computational Linguistics.
Poveda, J., Surdeanu, M. & Turmo, J. (2009), "An Analysis of Bootstrapping for the Recognition of Temporal Expressions", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 49-57. Association for Computational Linguistics.
Liao, W. & Veeramachaneni, S. (2009), "A Simple Semi-supervised Algorithm For Named Entity Recognition", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 58-65. Association for Computational Linguistics.
Chen, Z. & Ji, H. (2009), "Can One Language Bootstrap the Other: A Case Study on Event Extraction", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 66-74. Association for Computational Linguistics.
Huang, J.-T. & Hasegawa-Johnson, M. (2009), "On Semi-Supervised Learning of Gaussian Mixture Models for Phonetic Classification", In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 75-83. Association for Computational Linguistics.
Dasgupta, S. & Ng, V. (2009), "Discriminative Models for Semi-Supervised Natural Language Learning", Invited Position Paper, In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 84-85. Association for Computational Linguistics.
Hal Daume (2009). "Semi-supervised or Semi-unsupervised?", Invited Position Paper, In Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing. Boulder, Colorado, USA. June 2009., pp. 84-85. Association for Computational Linguistics.
Fürstenau, H. & Lapata, M. (2009), "Semi-Supervised Semantic Role Labeling", In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009). Athens, Greece. March 2009., pp. 220-228. Association for Computational Linguistics.
Rao, D. & Ravichandran, D. (2009), "Semi-Supervised Polarity Lexicon Induction", In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009). Athens, Greece. March 2009., pp. 675-682. Association for Computational Linguistics.
Spoustová, D., Hajič, J., Raab, J. & Spousta, M. (2009), "Semi-Supervised Training for the Averaged Perceptron POS Tagger", In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009). Athens, Greece. March 2009., pp. 763-771. Association for Computational Linguistics.
Candito, M., Crabbé, B. & Seddah, D. (2009), "On Statistical Parsing of French with Supervised and Semi-Supervised Strategies", In Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference. Athens, Greece. March 2009., pp. 49-57. Association for Computational Linguistics.
2008
Wang, Q.I., Schuurmans, D. & Lin, D. (2008), "Semi-Supervised Convex Training for Dependency Parsing", In Proceedings of ACL-08: HLT. Columbus, Ohio. June 2008., pp. 532-540. Association for Computational Linguistics.
Koo, T., Carreras, X. & Collins, M. (2008), "Simple Semi-supervised Dependency Parsing", In Proceedings of ACL-08: HLT. Columbus, Ohio. June 2008., pp. 595-603. Association for Computational Linguistics.
Suzuki, J. & Isozaki, H. (2008), "Semi-Supervised Sequential Labeling and Segmentation Using Giga-Word Scale Unlabeled Data", In Proceedings of ACL-08: HLT. Columbus, Ohio. June 2008., pp. 665-673. Association for Computational Linguistics.
Mann, G.S. & McCallum, A. (2008), "Generalized Expectation Criteria for Semi-Supervised Learning of Conditional Random Fields", In Proceedings of ACL-08: HLT. Columbus, Ohio. June 2008., pp. 870-878. Association for Computational Linguistics.
Haffari, G. & Sarkar, A. (2008), "Homotopy-Based Semi-Supervised Hidden Markov Models for Sequence Labeling", In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008). Manchester, UK. August 2008., pp. 305-312.
Wong, K.-F., Wu, M. & Li, W. (2008), "Extractive Summarization Using Supervised and Semi-Supervised Learning", In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008). Manchester, UK. August 2008., pp. 985-992.
Xu, J., Gao, J., Toutanova, K. & Ney, H. (2008), "Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation", In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008). Manchester, UK. August 2008., pp. 1017-1024.
McClosky, D. & Charniak, E. (2008), "Self-Training for Biomedical Parsing", In Proceedings of ACL-08: HLT, Short Papers. Columbus, Ohio. June 2008., pp. 101-104. Association for Computational Linguistics.
McClosky, D., Charniak, E. & Johnson, M. (2008), "When is Self-Training Effective for Parsing?", In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008). Manchester, UK. August 2008., pp. 561-568. Coling 2008 Organizing Committee.
2007
Abney, S. (2007), "Semisupervised Learning for Computational Linguistics" Chapman & Hall / CRC.
Chang, M.-W., Ratinov, L. & Roth, D. (2007), "Guiding Semi-Supervision with Constraint-Driven Learning", In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. Prague, Czech Republic. June 2007., pp. 280-287. Association for Computational Linguistics.
Ueffing, N., Haffari, G. & Sarkar, A. (2007), "Transductive learning for statistical machine translation", In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. Prague, Czech Republic. June 2007., pp. 25-32. Association for Computational Linguistics.
Kate, R. & Mooney, R. (2007), "Semi-Supervised Learning for Semantic Parsing using Support Vector Machines", In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers. Rochester, New York. April 2007., pp. 81-84. Association for Computational Linguistics.
Mann, G. & McCallum, A. (2007), "Efficient Computation of Entropy Gradient for Semi-Supervised Conditional Random Fields", In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers. Rochester, New York. April 2007., pp. 109-112. Association for Computational Linguistics.
Nadeau, D. (2007), "Semi-Supervised Named Entity Recognition: Learning to Recognize 100 Entity Types with Little Supervision", PhD Thesis, Ottawa, University of Ottawa. December 2007.
Tratz, S. & Sanfilippo, A. (2007), "A High Accuracy Method for Semi-Supervised Information Extraction", In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers. Rochester, New York. April 2007., pp. 169-172. Association for Computational Linguistics.
Rosenfeld, B. & Feldman, R. (2007), "Using Corpus Statistics on Entities to Improve Semi-supervised Relation Extraction from the Web", In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. Prague, Czech Republic. June 2007., pp. 600-607. Association for Computational Linguistics.
Erkan, G., Ozgur, A. & Radev, D.R. (2007), "Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing", In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). Prague, Czech Republic. June 2007., pp. 228-237. Association for Computational Linguistics.
Suzuki, J., Fujino, A. & Isozaki, H. (2007), "Semi-Supervised Structured Output Learning Based on a Hybrid Generative and Discriminative Approach", In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). Prague, Czech Republic. June 2007., pp. 791-800. Association for Computational Linguistics.
2006
Duh, K. & Kirchhoff, K. (2006), "Lexicon Acquisition for Dialectal Arabic Using Transductive Learning", In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. Sydney, Australia. July 2006., pp. 399-407. Association for Computational Linguistics.
Chen, J., Ji, D., Tan, C.L. & Niu, Z. (2006), "Relation Extraction Using Label Propagation Based Semi-Supervised Learning", In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Sydney, Australia. July 2006., pp. 129-136. Association for Computational Linguistics.
Jiao, F., Wang, S., Lee, C.-H., Greiner, R. & Schuurmans, D. (2006), "Semi-Supervised Conditional Random Fields for Improved Sequence Segmentation and Labeling", In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Sydney, Australia. July 2006., pp. 209-216. Association for Computational Linguistics.
Frunza, O. & Inkpen, D. (2006), "Semi-Supervised Learning of Partial Cognates Using Bilingual Bootstrapping", In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Sydney, Australia. July 2006., pp. 441-448. Association for Computational Linguistics.
Fraser, A. & Marcu, D. (2006), "Semi-Supervised Training for Statistical Word Alignment", In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Sydney, Australia. July 2006., pp. 769-776. Association for Computational Linguistics.
Levow, G.-A. (2006), "Unsupervised and Semi-supervised Learning of Tone and Pitch Accent", In Proceedings of the Human Language Technology Conference of the NAACL, Main Conference. New York City, USA. June 2006., pp. 224-231. Association for Computational Linguistics.
2005
Mohit, B. & Hwa, R. (2005), "Syntax-based Semi-Supervised Named Entity Tagging", In Proceedings of the ACL Interactive Poster and Demonstration Sessions. Ann Arbor, Michigan. June 2005., pp. 57-60. Association for Computational Linguistics.
Niu, Z.-Y., Ji, D.-H. & Tan, C.L. (2005), "Word Sense Disambiguation Using Label Propagation Based Semi-Supervised Learning", In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05). Ann Arbor, Michigan. June 2005., pp. 395-402. Association for Computational Linguistics.
2004
Claveau, V. & Sébillot, P. (2004), "From efficiency to portability: acquisition of semantic relations by semi-supervised machine learning ", In Proceedings of Coling 2004 . Geneva, Switzerland. Aug 23--Aug 27 2004., pp. 261-267. COLING.
Schulz, S., Markó, K., Sbrissia, E., Nohama, P. & Hahn, U. (2004), "Cognate Mapping - A Heuristic Strategy for the Semi-Supervised Acquisition of a Spanish Lexicon from a Portuguese Seed Lexicon ", In Proceedings of Coling 2004 . Geneva, Switzerland. Aug 23--Aug 27 2004., pp. 813-819. COLING.
Su, W., Carpuat, M. & Wu, D. (2004), "Semi-supervised training of a Kernel PCA-Based Model for Word Sense Disambiguation ", In Proceedings of Coling 2004 . Geneva, Switzerland. Aug 23--Aug 27 2004., pp. 1298-1304. COLING.