A new regularized least squares support vector regression for gene selection

Pei Chun Chen, Su Yun Huang, Wei J. Chen, Chuhsing K. Hsiao*

*Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

16 Scopus citations

Abstract

Background: Selection of influential genes with microarray data often faces the difficulties of a large number of genes and a relatively small group of subjects. In addition to the curse of dimensionality, many gene selection methods weight the contribution from each individual subject equally. This equal-contribution assumption cannot account for the possible dependence among subjects who associate similarly to the disease, and may restrict the selection of influential genes. Results: A novel approach to gene selection is proposed based on kernel similarities and kernel weights. We do not assume uniformity for subject contribution. Weights are calculated via regularized least squares support vector regression (RLS-SVR) of class levels on kernel similarities and are used to weight subject contribution. The cumulative sum of weighted expression levels are next ranked to select responsible genes. These procedures also work for multiclass classification. We demonstrate this algorithm on acute leukemia, colon cancer, small, round blue cell tumors of childhood, breast cancer, and lung cancer studies, using kernel Fisher discriminant analysis and support vector machines as classifiers. Other procedures are compared as well. Conclusion: This approach is easy to implement and fast in computation for both binary and multiclass problems. The gene set provided by the RLS-SVR weight-based approach contains a less number of genes, and achieves a higher accuracy than other procedures.

Original languageEnglish
Article number44
JournalBMC Bioinformatics
Volume10
DOIs
StatePublished - 03 02 2009
Externally publishedYes

Fingerprint

Dive into the research topics of 'A new regularized least squares support vector regression for gene selection'. Together they form a unique fingerprint.

Cite this