Localizing influential genes with modified versions of Bayesian Information Criterion

Małgorzata Bogdan; Piotr Szulc

Mathematica Applicanda (2012)

  • Volume: 40, Issue: 1
  • ISSN: 1730-2668

Abstract

top
Regions of the genome that influence quantitative traits are called quantitative trait loci (QTLs) and can be located using statistical methods. For this aim scientists use genetic markers, whose genotypes are known, and look for the associations between these genotypes and trait values. The common method which can be used in this problem is a linear regression. There are many model selection criteria for the choice of predictors in a linear regression. However, in the context of QTL mapping, where the number of available markers p n is usually  bigger than the sample size n , the classical criteria overestimate the number of regressors. To solve this problem several modifications of the Bayesian Information Criterion have been proposed and it has been recently proved that at least three of them, EBIC, mBIC and mBIC2, are consistent (also in case when p n > n ). In this article we discuss these criteria and their asymptotic properties and compare them by an extensive simulation study in the genetic context.

How to cite

top

Małgorzata Bogdan, and Piotr Szulc. "Localizing influential genes with modified versions of Bayesian Information Criterion." Mathematica Applicanda 40.1 (2012): null. <http://eudml.org/doc/292708>.

@article{MałgorzataBogdan2012,
abstract = {Regions of the genome that influence quantitative traits are called quantitative trait loci (QTLs) and can be located using statistical methods. For this aim scientists use genetic markers, whose genotypes are known, and look for the associations between these genotypes and trait values. The common method which can be used in this problem is a linear regression. There are many model selection criteria for the choice of predictors in a linear regression. However, in the context of QTL mapping, where the number of available markers $p_n$ is usually  bigger than the sample size $n$, the classical criteria overestimate the number of regressors. To solve this problem several modifications of the Bayesian Information Criterion have been proposed and it has been recently proved that at least three of them, EBIC, mBIC and mBIC2, are consistent (also in case when $p_n>n$). In this article we discuss these criteria and their asymptotic properties and compare them by an extensive simulation study in the genetic context.},
author = {Małgorzata Bogdan, Piotr Szulc},
journal = {Mathematica Applicanda},
keywords = {statistical genetics, quantitative trait loci, model selection, sparse linear regression, Bayesian Information Criterion},
language = {eng},
number = {1},
pages = {null},
title = {Localizing influential genes with modified versions of Bayesian Information Criterion},
url = {http://eudml.org/doc/292708},
volume = {40},
year = {2012},
}

TY - JOUR
AU - Małgorzata Bogdan
AU - Piotr Szulc
TI - Localizing influential genes with modified versions of Bayesian Information Criterion
JO - Mathematica Applicanda
PY - 2012
VL - 40
IS - 1
SP - null
AB - Regions of the genome that influence quantitative traits are called quantitative trait loci (QTLs) and can be located using statistical methods. For this aim scientists use genetic markers, whose genotypes are known, and look for the associations between these genotypes and trait values. The common method which can be used in this problem is a linear regression. There are many model selection criteria for the choice of predictors in a linear regression. However, in the context of QTL mapping, where the number of available markers $p_n$ is usually  bigger than the sample size $n$, the classical criteria overestimate the number of regressors. To solve this problem several modifications of the Bayesian Information Criterion have been proposed and it has been recently proved that at least three of them, EBIC, mBIC and mBIC2, are consistent (also in case when $p_n>n$). In this article we discuss these criteria and their asymptotic properties and compare them by an extensive simulation study in the genetic context.
LA - eng
KW - statistical genetics, quantitative trait loci, model selection, sparse linear regression, Bayesian Information Criterion
UR - http://eudml.org/doc/292708
ER -

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.