Proceedings of the 2007 International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2007)

An Semantic Rank for Web Crawler Based on Formal Concept Analysis

Authors
Yajun Du1, Xinchun Li
1School of Mathematical and Computers Science, Xihua University
Corresponding Author
Yajun Du
Available Online October 2007.
DOI
10.2991/iske.2007.246How to use a DOI?
Keywords
Formal Concept Analysis, Web Crawler, Concept Similarity, Ontology, Web Log
Abstract

Web Crawler is an important research in Search Engine. In this paper, a method for measuring the similarity of FCA concepts is proposed by using information content approach based on user Web log. In process of crawling Web pages for Web Crawler, in order to make choice of Web pages, the semantic rank of Web pages can be determined by using the similarity, other than relying on ontology with human domain expertise. The semantic rank can be made choice of Web pages for Web crawler.

Copyright
© 2007, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2007 International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2007)
Series
Advances in Intelligent Systems Research
Publication Date
October 2007
ISBN
10.2991/iske.2007.246
ISSN
1951-6851
DOI
10.2991/iske.2007.246How to use a DOI?
Copyright
© 2007, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Yajun Du
AU  - Xinchun Li
PY  - 2007/10
DA  - 2007/10
TI  - An Semantic Rank for Web Crawler Based on Formal Concept Analysis
BT  - Proceedings of the 2007 International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2007)
PB  - Atlantis Press
SP  - 1447
EP  - 1453
SN  - 1951-6851
UR  - https://doi.org/10.2991/iske.2007.246
DO  - 10.2991/iske.2007.246
ID  - Du2007/10
ER  -