Abstract
High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Current Genomics
Title: Performance of Feature Selection Methods
Volume: 10 Issue: 6
Author(s): Edward R. Dougherty, Jianping Hua and Chao Sima
Affiliation:
Abstract: High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Export Options
About this article
Cite this article as:
Dougherty R. Edward, Hua Jianping and Sima Chao, Performance of Feature Selection Methods, Current Genomics 2009; 10 (6) . https://dx.doi.org/10.2174/138920209789177629
DOI https://dx.doi.org/10.2174/138920209789177629 |
Print ISSN 1389-2029 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5488 |
Call for Papers in Thematic Issues
Advanced AI Techniques in Big Genomic Data Analysis
The thematic issue on "Advanced AI Techniques in Big Genomic Data Analysis" aims to explore the cutting-edge methodologies and applications of artificial intelligence (AI) in the realm of genomic research, where vast amounts of data pose both challenges and opportunities. This issue will cover a broad spectrum of AI-driven strategies, ...read more
Advanced Computational Algorithms and Artificial Intelligence in Clinical Pharmacogenomics
In the era of personalized medicine, understanding the relationship between genetics and drug response is crucial. This issue delves into innovative methodologies, leveraging deep computational analysis and artificial intelligence, to enhance the field of Clinical Pharmacogenomics. The interdisciplinary approach harnesses the power of advanced high-throughput genotyping technologies, sophisticated computational analysis, ...read more
Applications of Single-cell Sequencing Technology in Reproductive Medicine
Single cell sequencing (SCS) technology utilizes individual cells' genetic material to sequence their genome, transcriptome, and epigenetics at the molecular level. It offers insights into cell heterogeneity and enables the study of limited biological materials. Since its recognition as a valuable technique in 2011, single cell sequencing has yielded numerous ...read more
Big Data in Cancer Research
Cancer is a significant threat to human life and health, remaining a highly aggressive killer. It is a leading cause of death worldwide and represents a crucial medical issue for humanity. However, in the past decade, the effectiveness of new synthetic anticancer agents has not matched the current clinical speculation. ...read more
Related Journals
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements
Related Articles
-
Importance of ABC Transporters in Drug Development
Current Pharmaceutical Design Functional and Molecular Ultrasound Imaging: Concepts and Contrast Agents
Current Medicinal Chemistry Telomere Maintenance as Therapeutic Target in Embryonal Tumours
Anti-Cancer Agents in Medicinal Chemistry A Connecting Switch Among Aging, Diabetes and Tumor: Avenue Leading to Cancer Therapeutics
Current Cancer Therapy Reviews Application of In Vivo Electroporation to Cancer Gene Therapy
Current Gene Therapy ABC Transporters in Multidrug Resistance and Pharmacokinetics, and Strategies for Drug Development
Current Pharmaceutical Design Chemical Composition and Biological Activities of <i>Croton delpyi, Croton decalvatus</i> and <i>Croton caudatus</i>
The Natural Products Journal ENaC in the Brain - Future Perspectives and Pharmacological Implications
Current Molecular Pharmacology Targeting Malignancies with Disulfiram (Antabuse): Multidrug Resistance, Angiogenesis, and Proteasome
Current Cancer Drug Targets Editorial [Hot Topic: SOD Enzymes and Their Mimics in Cancer: Pro- vs Anti-Oxidative Mode of Action-Part I (Guest Editor: Ines Batinic-Haberle)]
Anti-Cancer Agents in Medicinal Chemistry Development and Applications of Optical Imaging Techniques in Cancer Diagnosis: Diffuse Optical Tomography and Microendoscopy
Current Medical Imaging Geniposide Attenuates Oligomeric Aβ<sub>1-42</sub>-Induced Inflammatory Response by Targeting RAGE-Dependent Signaling in BV2 Cells
Current Alzheimer Research Nitrones: A Potential New Alternative as Therapeutic Agents
Current Organic Chemistry HSP90 Inhibitors: Multi-Targeted Antitumor Effects and Novel Combinatorial Therapeutic Approaches in Cancer Therapy
Current Medicinal Chemistry Advanced Vectors for Gene Delivery
Current Drug Therapy Radiogenetic Therapy: Strategies to Overcome Tumor Resistance
Current Pharmaceutical Design Transition Metal-mediated Uncaging Chemistry in Prodrug Design
Current Topics in Medicinal Chemistry In Vitro Regulatory Effect of Epididymal Serpin CRES on Protease Activity of Proprotein Convertase PC4/PCSK4
Current Molecular Medicine The Hedgehog Knows Many Tricks
Current Drug Targets Nanoparticle-mediated Drug Delivery Systems (DDS) in the Central Nervous System
Current Organic Chemistry