Privacy Preserving RFE-SVM for Distributed Gene Selection
The support vector machine recursive feature elimination (SVM-RFE) is one of the most effective feature selection methods which has been successfully used in selecting informative genes for cancer classification. This paper extends this well-studied algorithm to the privacy preserving distributed data mining issue. For gene selection over multiple patient data from different sites, we propose a novel RFE-SVM method which aims to learn global informative gene subset to get the highest cancer classification accuracy, with limits on sharing of information. We experiment it using Leukemia bio-medical dataset. The experimental results show that it can provide good capability of privacy preserving and generates a set of attributes that is very similar to the set produced by its centralized counterpart.
Keywords: Privacy Preserving, Gene selection, Distributed Data Mining, RFE-SVM, Cancer Diagnostic.
Download Full-Text
ABOUT THE AUTHORS
Fode Camara
PhD candidate at Cheikh Anta Diop University
Mouhamadou Lamine Samb
PhD candidate at Cheikh Anta Diop University
Samba Ndiaye
Associate professor at Cheikh Anta Diop University, Dakar, Senegal
Yahya Slimani
Professor in Computer Science at University Department of Computer Science, Faculty of Sciences of Tunis 1060 Tunis,Tunisia
Fode Camara
PhD candidate at Cheikh Anta Diop University
Mouhamadou Lamine Samb
PhD candidate at Cheikh Anta Diop University
Samba Ndiaye
Associate professor at Cheikh Anta Diop University, Dakar, Senegal
Yahya Slimani
Professor in Computer Science at University Department of Computer Science, Faculty of Sciences of Tunis 1060 Tunis,Tunisia