首页 | 本学科首页   官方微博 | 高级检索  
     检索      

具有遗传性疾病和性状的遗传位点分析
引用本文:方兴,毕嘉琳,邢传智,张承进.具有遗传性疾病和性状的遗传位点分析[J].数学建模及其应用,2017,6(4):50-60.
作者姓名:方兴  毕嘉琳  邢传智  张承进
作者单位:山东大学(威海) 机电与信息工程学院 威海 264209,山东大学(威海) 数学与统计学院 威海 264209,山东大学(威海) 数学与统计学院 威海 264209,山东大学(威海) 机电与信息工程学院 威海 264209
摘    要:为便于进行数据分析,首先将数据中的位点信息由原来字母编码方式转换为数值编码的方式,根据位点的编码信息和患病信息,采用Logistic回归的方法,找出某种疾病最有可能的一个或几个致病位点,同时采用显著性检验进一步对建立的模型进行检验,证明了建立结果的合理性。此外,通过主成分分析,从原有的300个主成分中取出了225个主成分尽可能多地反映原来基因变量的信息,再通过主成分Logistic回归分析找出与疾病最有可能相关的一个或几个基因。最后,采用典型相关分析找出与相关性状有关联的基因位点。

关 键 词:Logistic回归分析  主成分分析  典型相关分析  遗传统计学  全基因组关联性分析(GWAS)  位点(SNPs)

Genetic loci analysis of inherited diseases and traits
Authors:FANG Xing  BI Jialin  XING Chuanzhi and ZHANG Chengjin
Institution:School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai Weihai Shandong 264209, China,School of mathematics and statistics, Shandong University, Weihai Weihai Shandong 264209, China,School of mathematics and statistics, Shandong University, Weihai Weihai Shandong 264209, China and School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai Weihai Shandong 264209, China
Abstract:For the convenience of data analysis, the SNPs information in data conversion from the original letter encoding for the numerical encoding, according to the encoding information of the SNPs and disease information , to find out one or several SNPs relating to a disease most likely by using the Logistics regression analysis, at the same time, the significance test was used to test the established model. In addition, 225 principal components can be extracted from the original 300 principal components by principal component analysis, as much as possible to reflect the information of the original gene variable, and then the principal component Logistic regression analysis is used to find one or several genes most likely to be related to the disease. Finally, the canonical correlation analysis was used to identify the SNPs associated with the correlation.
Keywords:Logistic regression analysis  principal component analysis  canonical correlation analysis  genetic statistics  genome wide association analysis (GWAS)  single nucleotide polymorphisms(SNPs)
本文献已被 CNKI 等数据库收录!
点击此处可从《数学建模及其应用》浏览原始摘要信息
点击此处可从《数学建模及其应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号