Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word Representation

Haoyue Shi, Caihua Li, Junfeng Hu


Abstract
Previous researches have shown that learning multiple representations for polysemous words can improve the performance of word embeddings on many tasks. However, this leads to another problem. Several vectors of a word may actually point to the same meaning, namely pseudo multi-sense. In this paper, we introduce the concept of pseudo multi-sense, and then propose an algorithm to detect such cases. With the consideration of the detected pseudo multi-sense cases, we try to refine the existing word embeddings to eliminate the influence of pseudo multi-sense. Moreover, we apply our algorithm on previous released multi-sense word embeddings and tested it on artificial word similarity tasks and the analogy task. The result of the experiments shows that diminishing pseudo multi-sense can improve the quality of word representations. Thus, our method is actually an efficient way to reduce linguistic complexity.
Anthology ID:
W16-4109
Volume:
Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC)
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Dominique Brunato, Felice Dell’Orletta, Giulia Venturi, Thomas François, Philippe Blache
Venue:
CL4LC
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
79–88
Language:
URL:
https://aclanthology.org/W16-4109
DOI:
Bibkey:
Cite (ACL):
Haoyue Shi, Caihua Li, and Junfeng Hu. 2016. Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word Representation. In Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC), pages 79–88, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word Representation (Shi et al., CL4LC 2016)
Copy Citation:
PDF:
https://aclanthology.org/W16-4109.pdf