Proceedings of the 2012 National Conference on Information Technology and Computer Science

A Private Cloud Document Management System with Document Clustering Algorithm

Authors
Jiajia Miao, Zhongjun Fan, Guoyou Chen, Handong Mao, Le Wang
Corresponding Author
Jiajia Miao
Available Online November 2012.
DOI
10.2991/citcs.2012.244How to use a DOI?
Keywords
private cloud; document management; document clustering; frequent term sets
Abstract

Recently, more and more enterprises use virtualization technology and cloud computing technology to improve the level of information management. Private cloud document management system from the lab to practical application. We launched a private cloud file management system is characterized by the automatic cluster of files, so as to achieve the automated management of the text block. Document clustering has been extensively studied, because it is an effective solution, the organization of a large number of files. In order to overcome the main challenges that the current document clustering a huge number of documents, high dimensional process and comprehensible cluster, we propose a hybrid algorithm based on the top-k frequent itemsets and K-Means. The experimental results show the efficiency and effectiveness of the algorithm is superior to the other two representative clustering algorithm on two public data sets. Our algorithm can be further improved in the future parallel implementation, based on semantic representation and similarity measurement.

Copyright
© 2012, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2012 National Conference on Information Technology and Computer Science
Series
Advances in Intelligent Systems Research
Publication Date
November 2012
ISBN
978-94-91216-39-8
ISSN
1951-6851
DOI
10.2991/citcs.2012.244How to use a DOI?
Copyright
© 2012, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Jiajia Miao
AU  - Zhongjun Fan
AU  - Guoyou Chen
AU  - Handong Mao
AU  - Le Wang
PY  - 2012/11
DA  - 2012/11
TI  - A Private Cloud Document Management System with Document Clustering Algorithm
BT  - Proceedings of the 2012 National Conference on Information Technology and Computer Science
PB  - Atlantis Press
SP  - 959
EP  - 962
SN  - 1951-6851
UR  - https://doi.org/10.2991/citcs.2012.244
DO  - 10.2991/citcs.2012.244
ID  - Miao2012/11
ER  -