Study on Similarity Compute and File Filtering Based on Cloud Computing Method
- DOI
- 10.2991/ccis-13.2013.111How to use a DOI?
- Keywords
- Text Filtering; Cloud Computing; Text similarity.
- Abstract
Text similarity computing has been widely used in confidential document filtering to enhance the safety of an enterprise information system. And the accuracy rate and performance of the similarity computing has always been the crucial problem in the research of document filtering. With the approaching era of massive data, the traditional way of computing similarity can not meet the needs of enterprises any more, but new ideas can be put forward in cloud computing environment. Aiming to solve this problem, this paper presents an algorithm of computing the distributed similarity which is based on mutual information document in cloud computing environment. This algorithm can calculate the text similarity based on cloud computing environment, and the calculations can be used to achieve the document filtering function. We’ve lanuched some experiments in Hadoop cloud computing environment, and the results show that this algorithm is a high-performance and effective algorithm.
- Copyright
- © 2013, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Yuanyuan Ma AU - Bo Zhang AU - Yufei Wang PY - 2013/11 DA - 2013/11 TI - Study on Similarity Compute and File Filtering Based on Cloud Computing Method BT - Proceedings of the The 1st International Workshop on Cloud Computing and Information Security PB - Atlantis Press SP - 477 EP - 482 SN - 1951-6851 UR - https://doi.org/10.2991/ccis-13.2013.111 DO - 10.2991/ccis-13.2013.111 ID - Ma2013/11 ER -