Accessibility navigation


The optimal parameters of spline regression for SNP-set analysis in genome-wide association study

Sookkhee, S., Kirdwichai, P. and Baksh, F. ORCID: https://orcid.org/0000-0003-3107-8815 (2021) The optimal parameters of spline regression for SNP-set analysis in genome-wide association study. Science & Technology Asia, 26 (1). pp. 39-52. ISSN 2586-9027

[img]
Preview
Text (Open Access) - Published Version
· Available under License Creative Commons Attribution Non-commercial No Derivatives.
· Please see our End User Agreement before downloading.

1MB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

To link to this item DOI: 10.14456/scitechasia.2021.5

Abstract/Summary

This research aims to develop a method that is capable and reliable for identifying significant regions in Genome-Wide Association Study based on Spline regression. We evaluate the optimal parameters in the Splines by smoothing and tuning p-values obtained from two methods, Sequence Kernel Association Test using normal weight (SKAT normal weight) and Generalized Higher Criticism (GHC) for testing SNP-set. False positive (FP) and True positive (TP) rates were evaluated under different genetic models for disease with significance thresholds adjusted for multiple hypothesis testing based on the permutation method. The simulated data used in this research are constructed from a control data set in a study of Crohn’s disease which is repeated 1,500 replicates for studies of size 3,000 cases and 3,000 controls. The simulation result shows that the optimal parameter in the Splines on the p-value of SKAT normal weight and GHC under the one disease SNP model simulation are at the degree of freedom 1,000. GHC is shown to be preferable in terms of comparing FP and TP rates but it is disadvantageous compared to SKAT in terms of computational burden time. Finally, the optimal parameter of both methods was applied to real data on Crohn’s disease. Both methods found the important regions of genes NOD2 which are strongly associated with the development and the importance of gene NOD2 which causes Crohn’s disease.

Item Type:Article
Refereed:Yes
Divisions:Science > School of Mathematical, Physical and Computational Sciences > Department of Mathematics and Statistics
Science > School of Mathematical, Physical and Computational Sciences > Department of Mathematics and Statistics > Applied Statistics
ID Code:97535
Publisher:Thammasat University

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation