首页 | 本学科首页   官方微博 | 高级检索  
     检索      

生物信息中的概率统计方法
引用本文:钱敏平,沈世镒.生物信息中的概率统计方法[J].数学进展,2004,33(6):669-684.
作者姓名:钱敏平  沈世镒
作者单位:1. 北京大学理论生物中心,北京,100871;北京大学数学科学学院,北京,100871
2. 南开大学数学系,300071,天津
摘    要:随着生物研究技术的不断改进与突破,特别是人类,水稻,大鼠,小鼠等的基因组测序与后基因组计划的迅猛推进,未来几年蛋白质和核酸的测序数据将以指数方式增加,生物信息学成为当前生物学领域的研究热点,预计在未来的若干年中它还将变得越来越重要.这是因为海量数据的获,导只是为知识的获得提供了条件,如果不能从中找出规律性的事物,数据本身往往并不显示知识.例如人类基因组测序只不过提供了用4种核酸(A,T,C,G)的序列书写成的人类机体的设计书.拿到了书并不意味着知道书中的内容.对一个识字不多的人来说,拿到书只是到达懂得书中内容这一遥远征途的第一小步.事实上,人们对于人类基因组的认识,大概也就是小学水平.另一方面,当前,生物的实验技术不仅达到了很高的精度,而且具有极高的通量.我们正在生物信息学研究的一个有活力的新时代.不少科学家还说它是人类基因组研究的收获时代,它不仅将赋予人们获得各种基础研究的重要成果的可能性,也带来取得巨大的经济效益和社会效益成果的很多机会.这是一个难得的机遇,我国应尽快综合利用它,充分发挥各学科交叉研究的威力,走向国际科学界的最前沿.在这一过程中,数学思想、模型、算法,特别是概率统计的思想与方法起到很关键的作用.

关 键 词:基因  比对拼接  基因注释  基因芯片  蛋白质  基因网络  隐马氏模型  聚类分析  关联分析
文章编号:1000-0917(2004)06-0669-16
修稿时间:2002年10月12

A Brief Survey on the Probability and Statistics Method in Bioinformatics
QIAN Min-ping,SHEN Shi-yi.A Brief Survey on the Probability and Statistics Method in Bioinformatics[J].Advances in Mathematics,2004,33(6):669-684.
Authors:QIAN Min-ping  SHEN Shi-yi
Abstract:Along with continously improving and developing of the biotechnology, especially completing of various whole genome projects of human (HGP), rice, mouse and rat, etc. In the near future, the data for amino acids, proteins, and their interaction will accumulate exponentially. The bioinformatics becomes very hot in biology, and it can be expected being hotter and hotter. This is due to that obtaining tremendous amount of data only provides conditions to reach knowledges, and which can only be acquired after rules and laws being found from data by analyzing. For example, HGP only provides the sequences of 4 amino acids (A, T, C, G), the blue print of our body, which means nothing for persons who do not know much about genes and proteins. This just like getting a Chinese book only a very little step to know what it says for a person knowing very little characters. In fact, the understanding about genes by the mankind looks like only in the elementary school level. On the other hand, we are facing an extremely active era of biotechnology, and many scientists call it the harvest age of genome projects. It not only makes it possible to obtain important results in pure science, but also provides opportunities for applications with great economical and social benefits. We should utilize these opportunities without hesitancy to exert the cooperation of multi-disciplines for going to the frontier of international science. In this marching, the mathematical modeling, ideas and algorithms, especially those of the probability theory and statistics will play key roles.
Keywords:ne  alignment and assembbly  gene annotation  microarray  protein  gene network  hidden Markov model  clustering analysis  linkage analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号