Similarity Management for Fuzzy Data Mining
- 10.2991/iske.2007.287How to use a DOI?
Data mining is a domain difficult to cope with for various reasons. First, most of the databases are complex, large, and contain heterogeneous, imprecise, vague, uncertain, incomplete data. Furthermore, the queries may be imprecise or subjective in the case of information retrieval, the mining results must be easily understandable by a user in the case of data mining or knowledge discovery. Fuzzy logic provides an interesting tool for such tasks, mainly because of its capability to represent imperfect information, for instance by means of imprecise categories, measures of resemblance or aggregation methods. We will focus our study on the use of similarity measures which are key concepts for many steps of the process, such as clustering, construction of prototypes, utilization of expert or association rules, fuzzy querying, for instance. We will consider a general framework for measures of comparison, compatible with Tversky's contrast model, providing tools to identify similar or dissimilar descriptions of objects, for instance in a case-based reasoning or a classification approach. We present some real-world problems where these paradigms have been exploited among others to manage various types of data such as image retrieval or risk analysis.
- © 2007, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Bernadette Bouchon-Meunier PY - 2007/10 DA - 2007/10 TI - Similarity Management for Fuzzy Data Mining BT - Proceedings of the 2007 International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2007) PB - Atlantis Press SP - 1663 EP - 1663 SN - 1951-6851 UR - https://doi.org/10.2991/iske.2007.287 DO - 10.2991/iske.2007.287 ID - Bouchon-Meunier2007/10 ER -