Learning Turkish Hypernymy Using Word Embeddings

Savaş Yıldırım; Tuğba Yıldız

doi:10.2991/ijcis.11.1.28

<Previous Article In Issue

Next Article In Issue>

Volume 11, Issue 1, 2018, Pages 371 - 383

Learning Turkish Hypernymy Using Word Embeddings

Authors

Savaş Yıldırım¹^,savas.yildirim@bilgi.edu.tr, Tuğba Yıldız¹^,tdalyan@bilgi.edu.tr

¹Department of Computer Engineering, İstanbul Bilgi University, Eski Silahtarağa Elektrik Santralı, Kazım Karabekir Cad. No: 2/13, 34060, Eyüp, İstanbul, Turkey, Tel : +90-212-3117506

Received 25 January 2017, Accepted 5 December 2017, Available Online 1 January 2018.

DOI: 10.2991/ijcis.11.1.28 How to use a DOI?
Keywords: Word Embeddings; Semantic Relation Projection; Semantic Relation Classification
Abstract: Recently, Neural Network Language Models have been effectively applied to many types of Natural Language Processing (NLP) tasks. One popular type of tasks is the discovery of semantic and syntactic regularities that support the researchers in building a lexicon. Word embedding representations are notably good at discovering such linguistic regularities. We argue that two supervised learning approaches based on word embeddings can be successfully applied to the hypernym problem, namely, utilizing embedding offsets between word pairs and learning semantic projection to link the words. The offset-based model classifies offsets as hypernym or not. The semantic projection approach trains a semantic transformation matrix that ideally maps a hyponym to its hypernym. A semantic projection model can learn a projection matrix provided that there is a sufficient number of training word pairs. However, we argue that such models tend to learn is-a-particular-hypernym relation rather than to generalize is-a relation. The embeddings are trained by applying both the Continuous Bag-of Words and the Skip-Gram training models using a huge corpus in Turkish text. The main contribution of the study is the development of a novel and efficient architecture that is well-suited to applying word embeddings approaches to the Turkish language domain. We report that both the projection and the offset classification models give promising and novel results for the Turkish Language.
Copyright: © 2018, the Authors. Published by Atlantis Press.
Open Access: This is an open access article under the CC BY-NC license (http://creativecommons.org/licences/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

<Previous Article In Issue

Next Article In Issue>

Journal: International Journal of Computational Intelligence Systems
Volume-Issue: 11 - 1
Pages: 371 - 383
Publication Date: 2018/01/01
ISSN (Online): 1875-6883
ISSN (Print): 1875-6891
DOI: 10.2991/ijcis.11.1.28 How to use a DOI?
Open Access: This is an open access article under the CC BY-NC license (http://creativecommons.org/licences/by-nc/4.0/).

Cite this article

ris enw bib

TY  - JOUR
AU  - Savaş Yıldırım
AU  - Tuğba Yıldız
PY  - 2018
DA  - 2018/01/01
TI  - Learning Turkish Hypernymy Using Word Embeddings
JO  - International Journal of Computational Intelligence Systems
SP  - 371
EP  - 383
VL  - 11
IS  - 1
SN  - 1875-6883
UR  - https://doi.org/10.2991/ijcis.11.1.28
DO  - 10.2991/ijcis.11.1.28
ID  - Yıldırım2018
ER  -

download .riscopy to clipboard