Research on Bayes-based Text Automatic Classification
- DOI
- 10.2991/icadme-15.2015.104How to use a DOI?
- Keywords
- text automatic classification; Bayes; classification algorithms; feature extraction
- Abstract
Enormous amount of information on the Internet, there are several of information and it is so complicated. Information retrieval is of blind and too much redundant information is in search results. In order for a user to much more effective at getting the information they needed, This paper researches the method of page text automatic classification based on the classification algorithm of Naive Bayes. Responding to the structure of pages, the paper analyses the structure components which are useful to the classification in the page tags in detail. And we apply Naive Bayes algorithm to classify with these effective features of HTML identifiers. It easy for users to more precise locate information on Internet through reduced the difficulty of Internet information retrieval.
- Copyright
- © 2015, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Xuan Zhang PY - 2015/10 DA - 2015/10 TI - Research on Bayes-based Text Automatic Classification BT - Proceedings of the 5th International Conference on Advanced Design and Manufacturing Engineering PB - Atlantis Press SP - 519 EP - 522 SN - 2352-5401 UR - https://doi.org/10.2991/icadme-15.2015.104 DO - 10.2991/icadme-15.2015.104 ID - Zhang2015/10 ER -