Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies

Full metadata record
DC Field Value Language
dc.contributor.authorMieth, Bettina-
dc.contributor.authorKloft, Marius-
dc.contributor.authorRodriguez, Juan Antonio-
dc.contributor.authorSonnenburg, Soren-
dc.contributor.authorVobruba, Robin-
dc.contributor.authorMorcillo-Suarez, Carlos-
dc.contributor.authorFarre, Xavier-
dc.contributor.authorMarigorta, Urko M.-
dc.contributor.authorFehr, Ernst-
dc.contributor.authorDickhaus, Thorsten-
dc.contributor.authorBlanchard, Gilles-
dc.contributor.authorSchunk, Daniel-
dc.contributor.authorNavarro, Arcadi-
dc.contributor.authorMueller, Klaus-Robert-
dc.date.accessioned2021-09-03T16:43:54Z-
dc.date.available2021-09-03T16:43:54Z-
dc.date.created2021-06-16-
dc.date.issued2016-11-28-
dc.identifier.issn2045-2322-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/86798-
dc.description.abstractThe standard approach to the analysis of genome-wide association studies (GWAS) is based on testing each position in the genome individually for statistical significance of its association with the phenotype under investigation. To improve the analysis of GWAS, we propose a combination of machine learning and statistical testing that takes correlation structures within the set of SNPs under investigation in a mathematically well-controlled manner into account. The novel two-step algorithm, COMBI, first trains a support vector machine to determine a subset of candidate SNPs and then performs hypothesis tests for these SNPs together with an adequate threshold correction. Applying COMBI to data from a WTCCC study (2007) and measuring performance as replication by independent GWAS published within the 2008-2015 period, we show that our method outperforms ordinary raw p-value thresholding as well as other state-of-the-art methods. COMBI presents higher power and precision than the examined alternatives while yielding fewer false (i.e. non-replicated) and more true (i.e. replicated) discoveries when its results are validated on later GWAS studies. More than 80% of the discoveries made by COMBI upon WTCCC data have been validated by independent studies. Implementations of the COMBI method are available as a part of the GWASpi toolbox 2.0.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherNATURE PUBLISHING GROUP-
dc.subjectLINEAR-MIXED MODELS-
dc.subjectVARIABLE SELECTION-
dc.subjectRISK PREDICTION-
dc.subjectMISSING HERITABILITY-
dc.subjectPOPULATION-STRUCTURE-
dc.subjectCLASSIFICATION-
dc.subjectVARIANTS-
dc.subjectLIBRARY-
dc.subjectCOMMON-
dc.subjectLOCI-
dc.titleCombining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies-
dc.typeArticle-
dc.contributor.affiliatedAuthorMueller, Klaus-Robert-
dc.identifier.doi10.1038/srep36671-
dc.identifier.scopusid2-s2.0-84999740191-
dc.identifier.wosid000389039500001-
dc.identifier.bibliographicCitationSCIENTIFIC REPORTS, v.6-
dc.relation.isPartOfSCIENTIFIC REPORTS-
dc.citation.titleSCIENTIFIC REPORTS-
dc.citation.volume6-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaScience & Technology - Other Topics-
dc.relation.journalWebOfScienceCategoryMultidisciplinary Sciences-
dc.subject.keywordPlusLINEAR-MIXED MODELS-
dc.subject.keywordPlusVARIABLE SELECTION-
dc.subject.keywordPlusRISK PREDICTION-
dc.subject.keywordPlusMISSING HERITABILITY-
dc.subject.keywordPlusPOPULATION-STRUCTURE-
dc.subject.keywordPlusCLASSIFICATION-
dc.subject.keywordPlusVARIANTS-
dc.subject.keywordPlusLIBRARY-
dc.subject.keywordPlusCOMMON-
dc.subject.keywordPlusLOCI-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE