International Journal of Computational Intelligence Systems

Volume 11, Issue 1, 2018, Pages 371 - 383

Learning Turkish Hypernymy Using Word Embeddings

Authors
Savaş Yıldırım1, savas.yildirim@bilgi.edu.tr, Tuğba Yıldız1, tdalyan@bilgi.edu.tr
1Department of Computer Engineering, İstanbul Bilgi University, Eski Silahtarağa Elektrik Santralı, Kazım Karabekir Cad. No: 2/13, 34060, Eyüp, İstanbul, Turkey, Tel : +90-212-3117506
Received 25 January 2017, Accepted 5 December 2017, Available Online 1 January 2018.
DOI
10.2991/ijcis.11.1.28How to use a DOI?
Keywords
Word Embeddings; Semantic Relation Projection; Semantic Relation Classification
Abstract

Recently, Neural Network Language Models have been effectively applied to many types of Natural Language Processing (NLP) tasks. One popular type of tasks is the discovery of semantic and syntactic regularities that support the researchers in building a lexicon. Word embedding representations are notably good at discovering such linguistic regularities. We argue that two supervised learning approaches based on word embeddings can be successfully applied to the hypernym problem, namely, utilizing embedding offsets between word pairs and learning semantic projection to link the words. The offset-based model classifies offsets as hypernym or not. The semantic projection approach trains a semantic transformation matrix that ideally maps a hyponym to its hypernym. A semantic projection model can learn a projection matrix provided that there is a sufficient number of training word pairs. However, we argue that such models tend to learn is-a-particular-hypernym relation rather than to generalize is-a relation. The embeddings are trained by applying both the Continuous Bag-of Words and the Skip-Gram training models using a huge corpus in Turkish text. The main contribution of the study is the development of a novel and efficient architecture that is well-suited to applying word embeddings approaches to the Turkish language domain. We report that both the projection and the offset classification models give promising and novel results for the Turkish Language.

Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article under the CC BY-NC license (http://creativecommons.org/licences/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

Journal
International Journal of Computational Intelligence Systems
Volume-Issue
11 - 1
Pages
371 - 383
Publication Date
2018/01/01
ISSN (Online)
1875-6883
ISSN (Print)
1875-6891
DOI
10.2991/ijcis.11.1.28How to use a DOI?
Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article under the CC BY-NC license (http://creativecommons.org/licences/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Savaş Yıldırım
AU  - Tuğba Yıldız
PY  - 2018
DA  - 2018/01/01
TI  - Learning Turkish Hypernymy Using Word Embeddings
JO  - International Journal of Computational Intelligence Systems
SP  - 371
EP  - 383
VL  - 11
IS  - 1
SN  - 1875-6883
UR  - https://doi.org/10.2991/ijcis.11.1.28
DO  - 10.2991/ijcis.11.1.28
ID  - Yıldırım2018
ER  -