Matrix eQTL: Ultra fast eQTL analysis via large matrix operations

Andrey A. Shabalin,

Department of Biostatistics, University of North Carolina at Chapel Hill

... just shortened my computing time from a year to a couple of days
Adaikalavan Ramasamy, Ph.D., King's College London external link

Key features

  • Designed for eQTL analysis of large datasets.
  • Performs testing for each transcript-SNP pair.
  • Ultra fast without loss of precision.
  • Equally fast for models with covariates.
  • Supports
    • Linear additive and ANOVA models. Includes tests for genotype-covariate interaction.
    • Covariates to account for sex, population structure, surrogate variables, etc.
    • Correlated and heteroskedastic errors.
    • Correction for multiple testing using FDR external link.
    • Separate FDR for cis- and trans- eQTLs (more info).
  • Matlab and R implementations.
Performance analyzing CF dataset with 573,337 SNPs and 22,011 transcripts over 840 samples. Tested on a quad-core PC, using additive linear models with zero and with 10 covariates.

Method No covar. 10 covar.
Matrix eQTL, Matlab 11.8 11.8 minutes
Matrix eQTL, Rev R 14.6 14.6 minutes
Matrix eQTL, R+GOTOexternal link 19.4 19.4 minutes
Plink external link 9.4 583.3 days
Merlin external link 19.6 20.0 days
R/qtl external link 1.0 4.7 days
snpMatrix external link 3.2 5.1 days
eMap external link 17.8 N/A days
FastMap external link 10.3 N/A hours

Info: Details of the testing procedure.

Note: Matrix eQTL results match those from other software.
Pdf icon Manuscript in Bioinformatics (2012)

R logo icon R code

Matlab logo icon Matlab code

help icon Manual / FAQ

7zip logo icon  Sample dataset of CF data size (7zip archive external link).

Questions, comments, concerns?
Contact me: Andrey A. Shabalin.

Support Matrix eQTL



Update history