首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于高斯混合模型的肿瘤纯度估计
引用本文:闫占正,李玉双.基于高斯混合模型的肿瘤纯度估计[J].浙江大学学报(理学版),2020,47(2):191-195.
作者姓名:闫占正  李玉双
作者单位:燕山大学 理学院,河北 秦皇岛 066004
基金项目:国家自然科学基金资助项目(61807029).
摘    要:在癌症基因组学研究中,临床所得的肿瘤组织是由癌症和正常细胞组成的混合物,肿瘤不纯会对后续的数据分析产生严重影响。基于DNA甲基化的芯片数据,构造了一种简单的肿瘤纯度估计方法GmmPurify。首先借助公共正常样本,利用高斯混合模型定义了一个重要的统计量“信息贡献值”;然后筛选出具有高信息贡献值的DNA甲基化位点,构成差异甲基化位点集合;最后利用核密度方法估计肿瘤的纯度。将GmmPurify方法应用于9类肿瘤,得到的纯度估值与两类先进方法的结果高度一致。研究结果表明,在与肿瘤样本相匹配的正常样本缺失的情况下,借助公共正常样本,GmmPurify可以给出令人满意的肿瘤纯度估计。

关 键 词:DNA甲基化  肿瘤纯度  高斯混合模型  信息贡献值  差异甲基化位点  
收稿时间:2019-03-11

Tumor purity estimation based on Gaussian mixture model
YAN Zhanzheng,LI Yushuang.Tumor purity estimation based on Gaussian mixture model[J].Journal of Zhejiang University(Sciences Edition),2020,47(2):191-195.
Authors:YAN Zhanzheng  LI Yushuang
Institution:School of Science, Yanshan University, Qinhuangdao 066004, Hebei Province, China
Abstract:In cancer genomics or epigenomics research, tumor tissues obtained from clinic are mixtures of cancer and normal cells, and impure tumor may have a severe impact on subsequent data analyses. Based on DNA methylation microarray data, we propose a simple method, GmmPurify, for estimating tumor purity in this paper. First, we apply Gaussian mixture model on the common normal samples to derive an important statistics“information contribution value”. Then, we construct a set of differential methylation sites with high information contribution values and estimate their tumor purity by using kernel density method. To verify the performance of GmmPurify, we use it to compute the purities of nine types of tumors from The Cancer Genome Atlas (TCGA), and the obtained purity estimates are highly consistent with the results of two state of the art methods. The result shows that GmmPurify could provide a satisfactory tumor purity estimation in the absence of normal samples with match the current tumor samples.
Keywords:DNA methylation  tumor purity  Gaussian mixture model  information contribution value  differential methylation site  
本文献已被 CNKI 等数据库收录!
点击此处可从《浙江大学学报(理学版)》浏览原始摘要信息
点击此处可从《浙江大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号