Semantics and Homothetic Clustering of Hafez Poetry

Arya Rahgozar, Diana Inkpen


Abstract
We have created two sets of labels for Hafez (1315-1390) poems, using unsupervised learning. Our labels are the only semantic clustering alternative to the previously existing, hand-labeled, gold-standard classification of Hafez poems, to be used for literary research. We have cross-referenced, measured and analyzed the agreements of our clustering labels with Houman’s chronological classes. Our features are based on topic modeling and word embeddings. We also introduced a similarity of similarities’ features, we called homothetic clustering approach that proved effective, in case of Hafez’s small corpus of ghazals2. Although all our experiments showed different clusters when compared with Houman’s classes, we think they were valid in their own right to have provided further insights, and have proved useful as a contrasting alternative to Houman’s classes. Our homothetic clusterer and its feature design and engineering framework can be used for further semantic analysis of Hafez’s poetry and other similar literary research.
Anthology ID:
W19-2511
Volume:
Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature
Month:
June
Year:
2019
Address:
Minneapolis, USA
Editors:
Beatrice Alex, Stefania Degaetano-Ortlieb, Anna Kazantseva, Nils Reiter, Stan Szpakowicz
Venue:
LaTeCH
SIG:
SIGHUM
Publisher:
Association for Computational Linguistics
Note:
Pages:
82–90
Language:
URL:
https://aclanthology.org/W19-2511
DOI:
10.18653/v1/W19-2511
Bibkey:
Cite (ACL):
Arya Rahgozar and Diana Inkpen. 2019. Semantics and Homothetic Clustering of Hafez Poetry. In Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 82–90, Minneapolis, USA. Association for Computational Linguistics.
Cite (Informal):
Semantics and Homothetic Clustering of Hafez Poetry (Rahgozar & Inkpen, LaTeCH 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-2511.pdf