Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018)

Improvement and Implementation of Feature Weighting Algorithm TF-IDF in Text Classification

Authors
Weisi Dai
Corresponding Author
Weisi Dai
Available Online May 2018.
DOI
10.2991/ncce-18.2018.94How to use a DOI?
Keywords
Classification algorithm, TF-IDF
Abstract

Based on the introduction of the traditional feature weighting algorithm TF-IDF, based on the phenomenon that the eigenvalue extraction is not effective when the text to be classified is not uniform, an improved TF-IDF algorithm is proposed in this paper, which considers the uneven text distribution Inside. The experimental results show that the results obtained by the classification algorithm using the improved algorithm are better than the original algorithm in terms of accuracy and recall and make up for the defects of the original TF-IDF algorithm

Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018)
Series
Advances in Intelligent Systems Research
Publication Date
May 2018
ISBN
10.2991/ncce-18.2018.94
ISSN
1951-6851
DOI
10.2991/ncce-18.2018.94How to use a DOI?
Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Weisi Dai
PY  - 2018/05
DA  - 2018/05
TI  - Improvement and Implementation of Feature Weighting Algorithm TF-IDF in Text Classification
BT  - Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018)
PB  - Atlantis Press
SP  - 583
EP  - 587
SN  - 1951-6851
UR  - https://doi.org/10.2991/ncce-18.2018.94
DO  - 10.2991/ncce-18.2018.94
ID  - Dai2018/05
ER  -