Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words

Saif Mohammad


Abstract
Words play a central role in language and thought. Factor analysis studies have shown that the primary dimensions of meaning are valence, arousal, and dominance (VAD). We present the NRC VAD Lexicon, which has human ratings of valence, arousal, and dominance for more than 20,000 English words. We use Best–Worst Scaling to obtain fine-grained scores and address issues of annotation consistency that plague traditional rating scale methods of annotation. We show that the ratings obtained are vastly more reliable than those in existing lexicons. We also show that there exist statistically significant differences in the shared understanding of valence, arousal, and dominance across demographic variables such as age, gender, and personality.
Anthology ID:
P18-1017
Volume:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2018
Address:
Melbourne, Australia
Editors:
Iryna Gurevych, Yusuke Miyao
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
174–184
Language:
URL:
https://aclanthology.org/P18-1017
DOI:
10.18653/v1/P18-1017
Bibkey:
Cite (ACL):
Saif Mohammad. 2018. Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 174–184, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):
Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words (Mohammad, ACL 2018)
Copy Citation:
PDF:
https://aclanthology.org/P18-1017.pdf
Presentation:
 P18-1017.Presentation.pdf
Video:
 https://aclanthology.org/P18-1017.mp4