Last updated: 2018-06-27
Code version: f6754bb
The summary statistics are from Mixed model association for biobank-scale data sets.
The \(\beta\) are coefficients from mix effects model. Since fitting the mix model with a large sample size is intractable, the authors estimate them using \(\chi^2_{BOLT\_LMM\_inf}\) statistics. There is no information about how to compute se. The p value is based on \(\chi^2_{BOLT\_LMM}\) statistics. Detail about the BOLT LMM is in BOLT_LMM
There are 23 phenotypes. The tri-allelic SNPs are excluded from the data. We took the union of the SNPs from different phenotype, so there are missing values in \(\hat{B}\) matrix. The total data set contains 11988455 SNPs.
The phenotypes are
| Abbreviation | Phenotype | 
|---|---|
| Eosinophil_Count | Eosinophil count | 
| Height | Height | 
| BMI | BMI | 
| WHR | Waist hip ratio | 
| BMD | Bone mineral density | 
| FVC | Forced vital capacity | 
| FEV1FVC | FEV1 FVC ratio | 
| Red_Count | Red blood cell count | 
| RBC_Dist_Width | RBC distribution width | 
| White_Count | White blood cell count | 
| Platelet_Count | Platelet count | 
| BP | Blood pressure (systolic) | 
| Cardiovascular | Cardiovascular disease | 
| T2D | Type 2 diabetes | 
| Respiratory | Respiratory disease | 
| Allergy_Eczema | Allergy or eczema | 
| Hypothyroidism | Hypothyroidism | 
| Neuroticism | Neuroticism | 
| MorningPerson | Chronotype (morning person) | 
| Hair | Hair color | 
| Tanning | Tanning ability | 
| Edu_Years | Years of education | 
| Smoking | Smoking status | 
The procedure to select strong SNPs subset:
The procedure to select random SNPs subset:
The strong subset contains 60070 SNPs. The random subset contains 142075 SNPs.
sessionInfo()
R version 3.4.4 (2018-03-15)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: macOS High Sierra 10.13.5
Matrix products: default
BLAS: /Library/Frameworks/R.framework/Versions/3.4/Resources/lib/libRblas.0.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/3.4/Resources/lib/libRlapack.dylib
locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     
loaded via a namespace (and not attached):
 [1] compiler_3.4.4  backports_1.1.2 magrittr_1.5    rprojroot_1.3-2
 [5] tools_3.4.4     htmltools_0.3.6 yaml_2.1.19     Rcpp_0.12.17   
 [9] stringi_1.2.2   rmarkdown_1.9   knitr_1.20      git2r_0.21.0   
[13] stringr_1.3.0   digest_0.6.15   evaluate_0.10.1
This R Markdown site was created with workflowr