Indexed by:
Abstract:
High-throughput techniques bring novel tools and also statistical challenges to genomic research. Identification of which type of diseases a new patient belongs to has been recognized as an important problem. For high-dimensional small sample size data, the classical discriminant methods suffer from the singularity problem and are, therefore, no longer applicable in practice. In this article, we propose a geometric diagonalization method for the regularized discriminant analysis. We then consider a bias correction to further improve the proposed method. Simulation studies show that the proposed method performs better than, or at least as well as, the existing methods in a wide range of settings. A microarray dataset and an RNA-seq dataset are also analyzed and they demonstrate the superiority of the proposed method over the existing competitors, especially when the number of samples is small or the number of genes is large. Finally, we have developed an R package called GDRDA which is available upon request.
Keyword:
Reprint Author's Address:
Email:
Source :
JOURNAL OF COMPUTATIONAL BIOLOGY
ISSN: 1066-5277
Year: 2017
Issue: 11
Volume: 24
Page: 1099-1111
1 . 7 0 0
JCR@2022
ESI Discipline: BIOLOGY & BIOCHEMISTRY;
ESI HC Threshold:215
CAS Journal Grade:3
Cited Count:
WoS CC Cited Count: 7
SCOPUS Cited Count: 8
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 5
Affiliated Colleges: