Best practices of feature selection in multi-omics data

dc.contributor.authorIpekten, Funda
dc.contributor.authorZararsiz, Gözde Ertürk
dc.contributor.authorDoğan, Halef Okan
dc.contributor.authorEldem, Vahap
dc.contributor.authorZararsiz, Gökmen
dc.date.accessioned2024-10-26T17:53:00Z
dc.date.available2024-10-26T17:53:00Z
dc.date.issued2024
dc.departmentSivas Cumhuriyet Üniversitesi
dc.description.abstractWith the recent advances in molecular biology techniques such as next-generation sequencing, massspectrometry, etc., a large omic data is produced. Using such data, the expression levels of thousands of molecular features (genes, proteins, metabolites, etc.) can be quantified and associated with diseases. The fact that multiple omics data contains different types of data and the number of analyzed variables increases the complexity of the models created with machine learning methods. In addition, due to many variables, the investigation of molecular variables associated with diseases is very costly. Therefore, selecting the informative and disease-related molecular features is applicable before model training and evaluation. This feature selection step is essential for obtaining accurate and generalizable models in minimum time with minimum cost. Some current methods used for feature selection are as follows: recursive feature elimination, information gain, minimum redundancy maximum relevance (mRMR), boruta, altmann, and lasso. © 2024, IGI Global. All rights reserved.
dc.identifier.doi10.4018/979-8-3693-3026-5.ch014
dc.identifier.endpage323
dc.identifier.isbn979-836933027-2
dc.identifier.isbn979-836933026-5
dc.identifier.scopus2-s2.0-85193522278
dc.identifier.startpage308
dc.identifier.urihttps://doi.org/10.4018/979-8-3693-3026-5.ch014
dc.identifier.urihttps://hdl.handle.net/20.500.12418/26656
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherIGI Global
dc.relation.ispartofResearch Anthology on Bioinformatics, Genomics, and Computational Biology
dc.relation.publicationcategoryKitap Bölümü - Uluslararası
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.titleBest practices of feature selection in multi-omics data
dc.typeBook Chapter

Dosyalar