期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Using Huffman coding method to visualize and analyze DNA sequences

Qi ZH Li L Qi XQ 《Journal of computational chemistry》2011,32(15):3233-3240

On the basis of the Huffman coding method, we propose a new graphical representation of DNA sequence. The representation can avoid degeneracy and loss of information in the transfer of data from a DNA sequence to its graphical representation. Then a multicomponent vector from the representation is introduced to characterize quantitatively DNA sequences. The components of the vector are derived from the graphical representation of DNA primary sequence. The examination of similarities and dissimilarities among the complete coding sequences of β-globin gene of 11 species and six ND6 proteins shows the utility of the scheme. 相似文献

2.

Primary structure similarity analysis of proteins sequences by a new graphical representation

S.C. Xu S.P. Zhang J.L. Hu 《SAR and QSAR in environmental research》2014,25(10):791-803

相似文献

3.

Physicochemical property based computational scheme for classifying DNA sequence elements of Saccharomyces cerevisiae

《Computational Biology and Chemistry》2019

GenerationE of huge “omics” data necessitates the development and application of computational methods to annotate the data in terms of biological features. In the context of DNA sequence, it is important to unravel the hidden physicochemical signatures. For this purpose, we have considered various sequence elements such as promoter, ACS, LTRs, telomere, and retrotransposon of the model organism Saccharomyces cerevisiae. Contributions due to di-nucleotides play a major role in studying the DNA conformation profile. The physicochemical parameters used are hydrogen bonding energy, stacking energy and solvation energy per base pair. Our computational study shows that all sequence elements in this study have distinctive physicochemical signatures and the same can be exploited for prediction experiments. The order that we see in a DNA sequence is dictated by biological regions and hence, there exists role of dependency in the sequence makeup, keeping this in mind we are proposing two computational schemes (a) using a windowing block size procedure and (b) using di-nucleotide transitions. We obtained better discriminating profile when we analyzed the sequence data in windowing manner. In the second novel approach, we introduced the di-nucleotide transition probability matrix (DTPM) to study the hidden layer of information embedded in the sequences. DTPM has been used as weights for scanning and predictions. This proposed computational scheme incorporates the memory property which is more realistic to study the physicochemical properties embedded in DNA sequences. Our analysis shows that the DTPM scheme performs better than the existing method in this applied region. Characterization of these elements will be a key to genome editing applications and advanced machine learning approaches may also require such distinctive profiles as useful input features. 相似文献

4.

Chun Li Jun Wang 《Journal of mathematical chemistry》2008,43(1):26-31

Based on the permutation of a binary alphabet, four generalized LZ complexities of a (0,1)-sequence are introduced. Since the logical representation of a DNA primary sequence includes four logical sequences, a DNA primary sequence can be characterized by a 16-D vector whose entries are the complexities corresponding to the logical sequences. The utility of the new quantitative characterization of DNA sequences is illustrated by an examination of the similarity among the full β-globin genes of 11 different species. 相似文献

5.

A 2D graphical representation of DNA sequence based on dual nucleotides and its application

Zanbo Liu Bo Liao Wen Zhu Guohua Huang 《International journal of quantum chemistry》2009,109(5):948-958

From the perspective of the neighboring dual nucleotides, we introduce a novel 2D graphical representation of DNA sequences based on the magic circle, which correspond to 16 dual nucleotides. So, we can reduce a DNA sequence into a plot set in two‐dimensional space and get a two‐component vector relatively to the introduced covariance matrix. The utility of our approach can be illustrated by the examination of similarities/dissimilarities among the complete coding sequences of β‐globin gene belonging to 11 species. © 2008 Wiley Periodicals, Inc. Int J Quantum Chem, 2009 相似文献

6.

MOLECULAR CHARACTERIZATION OF RICE Wx GENE 总被引：4，自引：0，他引：4

王宗阳武志亮邢彦彦郑霏琴郭小丽张伟国洪孟民《中国科学B辑(英文版)》1992,(5)

The complete nucleotide (nt) sequence of the rice waxy(Wx) gene, which is responsible for the synthesis of amylose in endosperm and pollen, has been determined by a combination of restriction mapping and nt sequence analysis of two overlapping genomic DNA clones. The entire gene is about 5.5 kb in length. The alignment of the nt sequence of the Wx gene from rice with those of maize (Klsgen, R. B. et al.) and barley (Rohde, W. et al.) revealed the presence of thirteen introns and fourteen exons. The full-length of Wx protein in cluding transit peptide is 609 amino acid (aa) residues. The calculated molecular weight of rice Wx preprotein is about 72 kD. There is no significant difference between the similarity scores of the aa sequence deduced from the rice Wx gene compared with those of maize and barley. However, the nt sequences of the 5'-end upstream, 3'-end downstream and introns of the rice Wx gene, as well as the aa sequence of the transit peptide region of the Wx preprotein have low similarity scor 相似文献

7.

Oligonucleotide fingerprinting of arrayed genomic DNA sequences using LNA-modified hybridization probes

Liu JP Drungowski M Nyársik L Schwartz R Lehrach H Herwig R Janitz M 《Combinatorial chemistry & high throughput screening》2007,10(4):269-276

Recently, we established a robust method for the detection of hybridization events using a DNA microarray deposited on a nanoporous membrane. Here, in a follow-up study, we demonstrate the performance of this approach on a larger set of LNA-modified oligoprobes and genomic DNA sequences. Twenty-six different LNA-modified 7-mer oligoprobes were hybridized to a set of 66 randomly selected human genomic DNA clones spotted on a nanoporous membrane slide. Subsequently, assay sensitivity analysis was performed using receiver operating characteristic (ROC) curves. Comparison of LNA-modified heptamers and DNA heptamers revealed that the LNA modification clearly improved sensitivity and specificity of hybridization experiment. Clustering analysis was applied in order to test practical performance of hybridization experiments with LNA-modified oligoprobes in recognizing similarity of genomic DNA sequences. Comparing the results with the theoretical sequence clusters, we conclude that the application of LNA-modified oligoprobes allows for reliable clustering of DNA sequences which reflects the underlying sequence homology. Our results show that LNA-modified oligoprobes can be used effectively to unravel sequence similarity of DNA sequences and thus, to characterize the content of unknown DNA libraries. 相似文献

8.

Use of an automated capillary DNA sequencer to investigate the interaction of cisplatin with telomeric DNA sequences

Paul M Murray V 《Biomedical chromatography : BMC》2012,26(3):350-354

The determination of the sequence selectivity of DNA-damaging agents is very important in elucidating the mechanism of action of anti-tumour drugs. The development of automated capillary DNA sequencers with fluorescent labelling has enabled a more precise method for DNA sequence specificity analysis. In this work we utilized the ABI 3730 capillary sequencer with laser-induced fluorescence to examine the sequence selectivity of cisplatin with purified DNA sequences. The use of this automated machine enabled a higher degree of precision of both position and intensity of cisplatin-DNA adducts than previously possible with manual and automated slab gel procedures. A problem with artefact bands was overcome by ethanol precipitation. It was found that cisplatin strongly formed adducts with telomeric DNA sequences. 相似文献

9.

《Chemical physics letters》2006,417(1-3):173-178

We considered constructing three 8-component vectors for a DNA primary sequence using triplets of nucleic acid bases. For two DNA sequences, using the corresponding vectors, we constructed a set of 3 × 3 matrices called the related matrix. The normalized leading eigenvalues from the constructed matrices were selected as invariants to characterize the degree of similarity between the two DNA sequences. Construction of similarity/dissimilarity tables based on this invariant for sequences of DNA of the first exon of the β-globin gene from 11 species illustrates the utility of the newly formulated matrices for DNA. 相似文献

10.

Novel numerical and graphical representation of DNA sequences and proteins

M. Randić M. Novič D. Vikić-Topić D. Plavšić 《SAR and QSAR in environmental research》2013,24(6):583-595

We have introduced novel numerical and graphical representations of DNA, which offer a simple and unique characterization of DNA sequences. The numerical representation of a DNA sequence is given as a sequence of real numbers derived from a unique graphical representation of the standard genetic code. There is no loss of information on the primary structure of a DNA sequence associated with this numerical representation. The novel representations are illustrated with the coding sequences of the first exon of β-globin gene of half a dozen species in addition to human. The method can be extended to proteins as is exemplified by humanin, a 24-aa peptide that has recently been identified as a specific inhibitor of neuronal cell death induced by familial Alzheimer's disease mutant genes. 相似文献

11.

Normalized Lempel-Ziv complexity and its application in bio-sequence analysis

Yi Zhang Junkang Hao Changjie Zhou Kai Chang 《Journal of mathematical chemistry》2009,46(4):1203-1212

In this article, we propose a new method to measure DNA similarity based on a normalized Lempel-Ziv complexity scheme. The new method can weaken the effect of sequence length on complexity measurement and save computation time. Firstly, a DNA sequence is transformed into three (0,1)-sequences based on a scheme, which considers “A” and “non-A” , “G” and “non-G”, “C” and “non-C” bases respectively. Then, the normalized Lempel-Ziv complexity of the three (0,1)-sequences constitute a 3D vector. Finally, by the 3D vector, one may characterize DNA sequences and compute similarity matrix for them. The examination of similarities of two sets of DNA sequences illustrates the utility of the method in local and global similarity analysis. 相似文献

12.

Novel method for analyzing proteome

Bo Liao Jiawei Luo Renfa Li Wen Zhu 《International journal of quantum chemistry》2007,107(6):1295-1300

We propose a 6D representation of protein sequences consisting of 20 amino acids. Based on this 6D representation, we propose a proteome distance measure for constructing phylogenic tree. And we make use of the corresponding similarity matrix to construct phylogenic tree. The examination of phylogenic tree belong to 30 mitochondrial sequence illustrates the utility of our approach. © 2006 Wiley Periodicals, Inc. Int J Quantum Chem, 2007 相似文献

13.

A graphical method to construct a phylogenetic tree

Weiping Wang Bo Liao Tianming Wang Wen Zhu 《International journal of quantum chemistry》2006,106(9):1998-2005

A 3D graphical representation of DNA sequences, which has no circuit or degeneracy, is derived for mathematical denotation of DNA sequence. Based on this graphical representation, we propose a new sequence distance measure. We make use of the corresponding similarity matrix to construct a phylogenic tree by virtue of the fuzzy theory. The examination of phylogenic tree belong to eight species illustrates the utility of our approach. © 2006 Wiley Periodicals, Inc. Int J Quantum Chem, 2006 相似文献

14.

Heat‐Flow‐Driven Oligonucleotide Gelation Separates Single‐Base Differences

下载免费PDF全文

Matthias Morasch Prof. Dieter Braun Dr. Christof B. Mast 《Angewandte Chemie (International ed. in English)》2016,55(23):6676-6679

DNA phase transitions are often induced by the addition of condensation agents or by dry concentration. Herein, we show that the non‐equilibrium setting of a moderate heat flow across a water‐filled chamber separates and gelates DNA strands with single‐base resolution. A dilute mix of DNA with two slightly different gel‐forming sequences separates into sequence‐pure hydrogels under constant physiological solvent conditions. A single base change in a 36 mer DNA inhibits gelation. Only sequences with the ability to form longer strands are concentrated, further elongated, and finally gelated by length‐dependent thermal trapping. No condensation agents, such as multivalent ions, were added. Equilibrium aggregates from dry concentration did not show any sequence separation. RNA is expected to behave identically owing to its equal thermophoretic properties. The highly sequence‐specific phase transition points towards new possibilities for non‐equilibrium origins of life. 相似文献

15.

On representation of DNA by line distance matrix

Milan Randić Jure Zupan Tomaž Pisanski 《Journal of mathematical chemistry》2008,43(2):674-692

Recently Line Distance (LD) matrix has been introduced as a novel route for characterization of DNA sequences. The approach was based on construction of four separate submatrices for the four nucleotides, the first row of each of which records the separation between the selected nucleotide and the remaining nucleotides of the same kind. In this article, we consider an alternative representation of DNA by LD matrix in which we construct a single matrix for each DNA sequence. The approach is illustrated on the DNA sequence of the first exon of human β-globin gene. 相似文献

16.

A new method for the detection of ATP using a quantum-dot-tagged aptamer 总被引：1，自引：0，他引：1

Chen Z Li G Zhang L Jiang J Li Z Peng Z Deng L 《Analytical and bioanalytical chemistry》2008,392(6):1185-1188

Fluorescence resonance energy transfer (FRET) between a quantum dot as donor and an organic fluorophore as acceptor has been widely used for detection of nucleic acids and proteins. In this paper, we developed a new method, characterized by 605-nm quantum dot (605QD) fluorescence intensity increase and corresponding Cy5 fluorescence intensity decrease, to detect adenosine triphosphate (ATP). The new method involved the use of three different oligonucleotides: 3′-biotin-modified DNA that binds to streptavidin-conjugated 605QD; 3′-Cy5-labelled DNA; and a capture DNA consisting of an ATP aptamer and a sequence which could hybridize with both 3′-biotin-modified DNA and 3′-Cy5-labelled DNA. In the absence of the target ATP, the capture DNA binds to 3′-biotin-modified DNA and 3′-Cy5-labelled DNA, bringing quantum dot and Cy5 into close proximity for greater FRET efficiency. When ATP is introduced, the release of the 3′-Cy5-labelled DNA from the hybridization complex took place, triggering 605QD fluorescence intensity increase and corresponding Cy5 fluorescence intensity decrease. Taken together, the virtue of FRET pair 605QD/Cy5 and the property of aptamer-specific conformation change caused by aptamer–ATP interaction, combined with the fluorescence intensity change of both 605QD and Cy5, provide prerequisites for simple and convenient ATP detection. Zhang Chen and Guang Li contributed equally to this work. 相似文献

17.

Testing homology with Contact Accepted mutatiOn (CAO): a contact-based Markov model of protein evolution

Lin K Kleinjung J Taylor WR Heringa J 《Computational Biology and Chemistry》2003,27(2):93-102

Point Accepted Mutation (PAM) is the Markov model of amino acid replacements in proteins introduced by Dayhoff and her co-workers (Dayhoff et al., 1978). The PAM matrices and other matrices based on the PAM model have been widely accepted as the standard scoring system of protein sequence similarity in protein sequence alignment tools. Here, we present Contact Accepted mutatiOn (CAO), a Markov model of protein residue contact mutations. The CAO model simulates the interchanging of structurally defined side-chain contacts, and introduces additional structural information into protein sequence alignments. Therefore, similarities between structurally conserved sequences can be detected even without apparent sequence similarity. CAO has been benchmarked on the HOMSTRAD database and a subset of the CATH database, by comparing sequence alignments with reference alignments derived from structural superposition. CAO yields scores that reflect coherently the structural quality of sequence alignments, which has implications particularly for homology modelling and threading techniques. 相似文献

18.

New graphical representation of a DNA sequence based on the ordered dinucleotides and its application to sequence analysis

Yujuan Huang Tianming Wang 《International journal of quantum chemistry》2012,112(6):1746-1757

Graphical representation of a DNA sequence is a powerful tool for basic biological research. Based on the ordered dinucleotides, we propose a novel three dimensional (3D) graphical representation without circuit or degeneracy. Simultaneously, we derive the projection curve of the 3D graph. These two curves have good visualization for longer DNA sequences. The utility of the proposed curves is illustrated by mutation analysis, similarity analysis, and evolutionary relationships of different species. The results indicate that our method is efficient and powerful. © 2011 Wiley Periodicals, Inc. Int J Quantum Chem, 2012 相似文献

19.

Resolution of Mixed Site DNA Complexes with Dimer‐Forming Minor‐Groove Binders by Using Electrospray Ionization Mass Spectrometry: Compound Structure and DNA Sequence Effects

下载免费PDF全文

Sarah Laughlin Dr. Siming Wang Dr. Arvind Kumar Dr. Abdelbasset A. Farahat Prof. David W. Boykin Prof. W. David Wilson 《Chemistry (Weinheim an der Bergstrasse, Germany)》2015,21(14):5528-5539

相似文献

20.

脱氧核糖核酸一级序列特殊碱基组合及相关性研究

张宝华王海水许禄《化学学报》2006,64(22):2221-2224

在脱氧核糖核酸(DNA)一级序列中, 4种碱基可以有16种两两组合的方式(例如, ct, ag, cg, tc等), 不同的组合方式在DNA的该种序列中出现的次数也不一样, 其中, cg出现的频数甚少. 本文将cg所处位置及频率作为一种数学的量, 计算了各物种的分子连接性指数, 并以之为参数进行了物种间相似性分析. 在此基础上, 进一步对10种物种做了分组和归类. 相似文献