Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques

Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama, Akiko Aizawa


Abstract
Retrieving research papers and patents is important for any researcher assessing the scope of a field with high industrial relevance. However, the terms used in patents are often more abstract or creative than those used in research papers, because they are intended to widen the scope of claims. Therefore, a method is required for translating scholarly terms into patent terms. In this paper, we propose six methods for translating scholarly terms into patent terms using two synonym extraction methods: a statistical machine translation (SMT)-based method and a distributional similarity (DS)-based method. We conducted experiments to confirm the effectiveness of our method using the dataset of the Patent Mining Task from the NTCIR-7 Workshop. The aim of the task was to classify Japanese language research papers (pairs of titles and abstracts) using the IPC system at the subclass (third level), main group (fourth level), and subgroup (the fifth and most detailed level). The results showed that an SMT-based method (SMT_ABST+IDF) performed best at the subgroup level, whereas a DS-based method (DS+IDF) performed best at the subclass level.
Anthology ID:
L12-1622
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3447–3451
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1043_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama, and Akiko Aizawa. 2012. Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3447–3451, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques (Nanba et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1043_Paper.pdf