Machine Learning Approaches for Biomarker Discovery Using Gene Expression Data
Chapter
Published version
View/ Open
Date
2021Metadata
Show full item recordCollections
- Department of Informatics [992]
- Registrations from Cristin [10863]
Original version
In: Nakaya HI (Ed.). 2021. Bioinformatics. https://doi.org/10.36255/exonpublications.bioinformatics.2021.ch4Abstract
Biomarkers are of great importance in many fields, such as cancer research, toxicology, diagnosis and treatment of diseases, and to better understand biological response mechanisms to internal or external intervention. High-throughput gene expression profiling technologies, such as DNA microarrays and RNA sequencing, provide large gene expression data sets which enable data-driven biomarker discovery. Traditional statistical tests have been the mainstream for identifying differentially expressed genes as biomarkers. In recent years, machine learning techniques such as feature selection have gained more popularity. Given many options, picking the most appropriate method for a particular data becomes essential. Different evaluation metrics have therefore been proposed. Being evaluated on different aspects, a method’s varied performance across different datasets leads to the idea of integrating multiple methods. Many integration strategies are proposed and have shown great potential. This chapter gives an overview of the current research advances and existing issues in biomarker discovery using machine learning approaches on gene expression data.