Boston University Libraries OpenBU
    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    •   OpenBU
    • School of Public Health
    • Department of Biostatistics
    • SPH Biostatistics Papers
    • View Item
    •   OpenBU
    • School of Public Health
    • Department of Biostatistics
    • SPH Biostatistics Papers
    • View Item

    Mapping Complex Traits Using Random Forests

    Thumbnail
    License
    Copyright 2003 Bureau et al; licensee BioMed Central Ltd This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
    Date Issued
    2003-12-31
    Publisher Version
    10.1186/1471-2156-4-S1-S64
    Author(s)
    Bureau, Alexandre
    Dupuis, Josée
    Hayward, Brooke
    Falls, Kathleen
    Van Eerdewegh, Paul
    Share to FacebookShare to TwitterShare by Email
    Export Citation
    Download to BibTex
    Download to EndNote/RefMan (RIS)
    Metadata
    Show full item record
    Permanent Link
    https://hdl.handle.net/2144/3073
    Citation (published version)
    Bureau, Alexandre, Josée Dupuis, Brooke Hayward, Kathleen Falls, Paul Van Eerdewegh. "Mapping complex traits using Random Forests" BMC Genetics 4(Suppl 1):S64. (2003)
    Abstract
    Random Forest is a prediction technique based on growing trees on bootstrap samples of data, in conjunction with a random selection of explanatory variables to define the best split at each node. In the case of a quantitative outcome, the tree predictor takes on a numerical value. We applied Random Forest to the first replicate of the Genetic Analysis Workshop 13 simulated data set, with the sibling pairs as our units of analysis and identity by descent (IBD) at selected loci as our explanatory variables. With the knowledge of the true model, we performed two sets of analyses on three phenotypes: HDL, triglycerides, and glucose. The goal was to approach the mapping of complex traits from a multivariate perspective. The first set of analyses mimics a candidate gene approach with a high proportion of true genes among the predictors while the second set represents a genome scan analysis using microsatellite markers. Random Forest was able to identify a few of the major genes influencing the phenotypes, such as baseline HDL and triglycerides, but failed to identify the major genes regulating baseline glucose levels.
    Rights
    Copyright 2003 Bureau et al; licensee BioMed Central Ltd This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
    Collections
    • SPH Biostatistics Papers [126]


    Boston University
    Contact Us | Send Feedback | Help
     

     

    Browse

    All of OpenBUCommunities & CollectionsIssue DateAuthorsTitlesSubjectsThis CollectionIssue DateAuthorsTitlesSubjects

    Deposit Materials

    LoginNon-BU Registration

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Boston University
    Contact Us | Send Feedback | Help