A Private Cloud Document Management System with Document Clustering Algorithm
- DOI
- 10.2991/citcs.2012.244How to use a DOI?
- Keywords
- private cloud; document management; document clustering; frequent term sets
- Abstract
Recently, more and more enterprises use virtualization technology and cloud computing technology to improve the level of information management. Private cloud document management system from the lab to practical application. We launched a private cloud file management system is characterized by the automatic cluster of files, so as to achieve the automated management of the text block. Document clustering has been extensively studied, because it is an effective solution, the organization of a large number of files. In order to overcome the main challenges that the current document clustering a huge number of documents, high dimensional process and comprehensible cluster, we propose a hybrid algorithm based on the top-k frequent itemsets and K-Means. The experimental results show the efficiency and effectiveness of the algorithm is superior to the other two representative clustering algorithm on two public data sets. Our algorithm can be further improved in the future parallel implementation, based on semantic representation and similarity measurement.
- Copyright
- © 2012, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Jiajia Miao AU - Zhongjun Fan AU - Guoyou Chen AU - Handong Mao AU - Le Wang PY - 2012/11 DA - 2012/11 TI - A Private Cloud Document Management System with Document Clustering Algorithm BT - Proceedings of the 2012 National Conference on Information Technology and Computer Science PB - Atlantis Press SP - 959 EP - 962 SN - 1951-6851 UR - https://doi.org/10.2991/citcs.2012.244 DO - 10.2991/citcs.2012.244 ID - Miao2012/11 ER -