Proceedings of the 2012 National Conference on Information Technology and Computer Science

A Clustering Algorithm towards Microblogs based on Vector Space Model

Authors
Guoyou Chen, Jiajia Miao, Handong Mao, Le Wang, Siyu Jiang
Corresponding Author
Guoyou Chen
Available Online November 2012.
DOI
https://doi.org/10.2991/citcs.2012.243How to use a DOI?
Keywords
microblogs clustering, k-means, vector space model
Abstract
Weibos have become wildly popular in China in recent years, and state media reports that there are more than 300 million registered users. The Real Name Policy[1] requires all users on Chinese weibo websites to register with the name that corresponds with their government issued ID card. With the rapid development of the web, the research of consensus encounters new problems and challenges. Is a practical method for large-scale text clustering, instant messaging, text content analysis features, and find or track the social hot topics. Unlike the file, which is not suitable for very common clustering algorithm' A new method is proposed of the named MVSM synthesis microblogging dialogue, but also enriched the words of the vector is not included in the text of the blog, but existing content is closely related. Extended vector space this MVSM perform the dialogue, k-means clustering. Experiments on public datasets show better, MVSM than traditional k-means and kmeans algorithm into two
Open Access
This is an open access article distributed under the CC BY-NC license.

Download article (PDF)

Proceedings
2012 National Conference on Information Technology and Computer Science
Part of series
Advances in Intelligent Systems Research
Publication Date
November 2012
ISBN
978-94-91216-39-8
ISSN
1951-6851
DOI
https://doi.org/10.2991/citcs.2012.243How to use a DOI?
Open Access
This is an open access article distributed under the CC BY-NC license.

Cite this article

TY  - CONF
AU  - Guoyou Chen
AU  - Jiajia Miao
AU  - Handong Mao
AU  - Le Wang
AU  - Siyu Jiang
PY  - 2012/11
DA  - 2012/11
TI  - A Clustering Algorithm towards Microblogs based on Vector Space Model
BT  - 2012 National Conference on Information Technology and Computer Science
PB  - Atlantis Press
SP  - 955
EP  - 958
SN  - 1951-6851
UR  - https://doi.org/10.2991/citcs.2012.243
DO  - https://doi.org/10.2991/citcs.2012.243
ID  - Chen2012/11
ER  -