Feature Selection for Single Nucleotide Polymorphisms using Parallal Genetic Algorithm
- DOI
- 10.2991/emcs-17.2017.249How to use a DOI?
- Keywords
- Feature selection; mutual information; Genetic Algorithm; parallel; Single Nucleotide Polymorphism
- Abstract
The application of statistical machine learning methods to study the association between large scale single nucleotide polymorphism (SNP) and complex diseases is facing the "Curse of dimensionality". The first task is to reduce the large scale SNP to smaller sets. For this reason, a multiple genetic algorithm is proposed for feature selection of single nucleotide polymorphism. For the first time, a new method is proposed to measure the degree of association between SNP and disease by mutual information, and as the fitness value of genetic algorithm (GA), the candidate feature SNP set is obtained by the use of genetic algorithm. In the SNP experiment on simulation data and the maximum entropy (ME) method for performance comparison shows that, this method can reduce the SNP collection and disease related SNP, while retaining the disease associated with SNP, SNP provides data for further research on the suitable scale, this method can be used in medium or large scale SNP set.
- Copyright
- © 2017, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Ming Zheng AU - Mugui Zhuo PY - 2017/03 DA - 2017/03 TI - Feature Selection for Single Nucleotide Polymorphisms using Parallal Genetic Algorithm BT - Proceedings of the 2017 7th International Conference on Education, Management, Computer and Society (EMCS 2017) PB - Atlantis Press SP - 1287 EP - 1291 SN - 2352-538X UR - https://doi.org/10.2991/emcs-17.2017.249 DO - 10.2991/emcs-17.2017.249 ID - Zheng2017/03 ER -