Proceedings of the International Seminar on Language, Education, and Culture (ISoLEC 2021)

How to Lemmatize German Words with NLP-Spacy Lemmatizer?

Authors
M. Kharis1, *, Kisyani2, Suhartono3, Udjang Pairin4, Darni5
1, 2, 3, 4 5Universitas Negeri Surabaya, Surabaya, Indonesia
*Corresponding author. Email: mkharis.19010@mhs.unesa.ac.id
Corresponding Author
M. Kharis
Available Online 14 December 2021.
DOI
10.2991/assehr.k.211212.036How to use a DOI?
Keywords
SpaCy; German lemmatization; lemmatize; Lemmatizer
Abstract

Simple algorithms for the lemmatization process have been developed to recognize changes in a word as a result of grammatical processes and changes. Lemmatizer tools can analyze the types of word changes in the German language. Thus, this paper aims at investigating how the lemmatization of German words is aided by the Lemmatizer software. NLP Lemmatizer spacy, in cooperation with Python and Visual Studio Code, is utilized to find out the primary form of the word changes in German language. Based on the lemmatization analysis results, Lemmatizer SpaCy can analyze the shape of token, lemma, and PoS-tag of words in German. However, there are some errors identified during the process of finding out the word changes in German language.

Copyright
© 2021 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article under the CC BY-NC license.

Download article (PDF)

Volume Title
Proceedings of the International Seminar on Language, Education, and Culture (ISoLEC 2021)
Series
Advances in Social Science, Education and Humanities Research
Publication Date
14 December 2021
ISBN
978-94-6239-482-7
ISSN
2352-5398
DOI
10.2991/assehr.k.211212.036How to use a DOI?
Copyright
© 2021 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article under the CC BY-NC license.

Cite this article

TY  - CONF
AU  - M. Kharis
AU  - Kisyani
AU  - Suhartono
AU  - Udjang Pairin
AU  - Darni
PY  - 2021
DA  - 2021/12/14
TI  - How to Lemmatize German Words with NLP-Spacy Lemmatizer?
BT  - Proceedings of the International Seminar on Language, Education, and Culture (ISoLEC 2021)
PB  - Atlantis Press
SP  - 189
EP  - 193
SN  - 2352-5398
UR  - https://doi.org/10.2991/assehr.k.211212.036
DO  - 10.2991/assehr.k.211212.036
ID  - Kharis2021
ER  -