skip to main content

Locally linear embedding and neighborhood rough set-based gene selection for gene expression data classification

Sun, L ; Xu, J-C ; Wang, W ; Yin, Y

Genetics and molecular research, 2016-08, Vol.15 (3) [Periódico revisado por pares]

Brazil

Texto completo disponível

Citações Citado por
  • Título:
    Locally linear embedding and neighborhood rough set-based gene selection for gene expression data classification
  • Autor: Sun, L ; Xu, J-C ; Wang, W ; Yin, Y
  • Assuntos: Algorithms ; Biomarkers, Tumor - genetics ; Biomarkers, Tumor - metabolism ; Gene Expression ; Gene Expression Profiling - methods ; Gene Expression Profiling - standards ; Genes, Neoplasm ; Humans ; Linear Models ; Neoplasms - diagnosis ; Neoplasms - genetics ; Neoplasms - metabolism ; Reference Standards ; Software
  • É parte de: Genetics and molecular research, 2016-08, Vol.15 (3)
  • Notas: ObjectType-Article-1
    SourceType-Scholarly Journals-1
    ObjectType-Feature-2
    content type line 23
  • Descrição: Cancer subtype recognition and feature selection are important problems in the diagnosis and treatment of tumors. Here, we propose a novel gene selection approach applied to gene expression data classification. First, two classical feature reduction methods including locally linear embedding (LLE) and rough set (RS) are summarized. The advantages and disadvantages of these algorithms were analyzed and an optimized model for tumor gene selection was developed based on LLE and neighborhood RS (NRS). Bhattacharyya distance was introduced to delete irrelevant genes, pair-wise redundant analysis was performed to remove strongly correlated genes, and the wavelet soft threshold was determined to eliminate noise in the gene datasets. Next, prior optimized search processing was carried out. A new approach combining dimension reduction of LLE and feature reduction of NRS (LLE-NRS) was developed for selecting gene subsets, and then an open source software Weka was applied to distinguish different tumor types and verify the cross-validation classification accuracy of our proposed method. The experimental results demonstrated that the classification performance of the proposed LLE-NRS for selecting gene subset outperforms those of other related models in terms of accuracy, and our proposed approach is feasible and effective in the field of high-dimensional tumor classification.
  • Editor: Brazil
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.