Publication Type : Journal Article
Publisher : Future Computing and Informatics Journal .
Source : Future Computing and Informatics Journal, Volume 3, Number 1, p.131 - 142 (2018)
Url : http://www.sciencedirect.com/science/article/pii/S2314728817300338
Keywords : Feature selection, Fuzzy rough quick reduct, Information Gain, Random forest.
Campus : Coimbatore
School : School of Engineering
Department : Computer Science
Year : 2018
Abstract : Microarray gene expression data plays a prominent role in feature selection that helps in diagnosis and treatment of a wide variety of diseases. Microarray gene expression data contains redundant feature genes of high dimensionality and smaller training and testing samples. This paper proposes a customized similarity measure using fuzzy rough quick reduct algorithm for attribute selection. Information Gain based entropy is used to reduce the dimensionality in the first stage and the proposed fuzzy rough quick reduct method that defines a customized similarity measure for selecting the minimum number of informative genes and removing the redundant genes is employed at the second stage. The proposed method is evaluated using leukemia, lung and ovarian cancer gene expression datasets on a random forest classifier. The proposed method produces 97.22%, 99.45% and 99.6% classifier accuracy on leukemia, lung and ovarian cancer gene expression datasets respectively. The research study is carried out using the R open source software package. The proposed method shows substantial improvement in the performance with respect to various statistical parameters like classification accuracy, precision, recall, f-measure and region of characteristic compared to available methods in literature.
Cite this Research Publication : A. Chinnaswamy and Ramakrishnan, S., “Attribute Selection using Fuzzy Roughset based Customized Similarity Measure for Lung Cancer Microarray Gene Expression Data”, Future Computing and Informatics Journal, vol. 3, pp. 131 - 142, 2018.