首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
药物的水溶解度与其吸收密切相关。本文利用一种新的计算方法,VolSurf,预测药物的水溶解度并测定有利于药物水溶解度的主要分子特征。被测化合物包括26个结构不同的药物,通过偏最小二乘分析法,对药物水溶解度实验值与分子特征进行相关,得到较好的模型(r2=0.90,q2=0.77)。将化合物分为训练集和预测集进行相关分析,结果表明以18个化合物所建立的训练集模型对其余8个化合物有较好的预测能力,预测的标准偏差(SDEP)为0.59。参数分析表明分子与水相互作用的3个局部能量最小值越小,且它们之间的距离越大,对其水溶解度越有利;亲水性占主导因素的分子有高的水溶解度;分子的疏水性越强,在水中的溶解性越弱;大分子的溶解度较小分子溶解度低。  相似文献   

2.
A combination of ^1H nuclear magnetic resonance (NMR) spectroscopy and principal component analysis (PCA) has shown the potential for being a useful method for classification of type, production origin or geographic origin of wines. In this preliminary study, twenty-one bottled wines were classified/separated for their location of production in Shacheng, Changli and Yantai, and the types of the blended, medium dry, dry white and dry red wines, using the NMR-PCA method. The wines were produced by three subsidiary companies of an enterprise according to the same national standard. The separation was believed to be mainly due to the fermentation process for different wines and environmental variations, such as local climate, soil, underground water, sunlight and rainfall. The major chemicals associated with the separation were identified.  相似文献   

3.
The prediction of human intestinal absorption is a major goal in the design,optimization,and selection of candidates for the develoment of oral drugs.In this study,a computerized method(VolSurf with GRID) was used as a novel tool for predicting human intestinal absorption of test compound,and for determining the critical molecular properties needed for human intestinal absorption.The tested molecules consisted of 20 diverse drug-like compounds.Partial least squares(PLS) discriminant analysis was used to correlate the experimental data with the theoretical molecular properties of human intestinal absorption.A good correlation(r^2=0.95,q^2=0.86) between the molecular modeling results and the experimental data demonstrated that human intestinal absorption could be predicted from the three-dimensional(3D) molecular structure of a compound .Favorable structureal properties identified for the potent intestinal absorption of drugs included strong imbalance between the center of mass of a molecule and the barycentre of its hydrophilic and hydrophobic regions and a definitive hydrophobic region as well as less hydrogen bonding donors and acceptors in the molecule.  相似文献   

4.
5.
针对高维小样本质谱数据在构造模型时易产生的过拟合现象、变量间的严重共线性、及结构与性质间的非线性关系,采用了核分段逆回归(KSIR)特征提取集成线性判别分析(LDA)新技术。首先以KSIR算法完成质谱数据的非线性特征提取,然后在由新特征矢量张成的低维空间构造样本类别的线性判别函数,负责各样本个体类别的判定。将KSIR-LDA方法应用于软饮料的质谱数据分类,结果表明:该方法不仅适应质谱数据与性质间的非线性关系,而且可以更少、解释能力更强的特征变量取得更高的分类精度,并能实现在低维特征空间对数据的解释及可视化。  相似文献   

6.
This work demonstrates the potential of multivariate image analysis methods in the extraction of useful, problem dependent information from SIMS images. Specific algorithms have been developed to classify SIMS micrographs manually as well as automatically. A feature selection has been achieved by means of principal component analysis with a subsequent image classification.As an application example for these improved digital image processing tools chemical phases within a soldered industrial metal sample have been identified. This is of highly practical value as it was assumed that during the soldering process inhomogeneities occur along the joint site which cause a cracking of the brazed material under mechanical strain conditions.  相似文献   

7.
High throughput data are frequently observed in contemporary chemical studies. Classification through spectral information is an important issue in chemometrics. Linear discriminant analysis (LDA) fails in the large‐p‐small‐n situation for two main reasons: (1) the sample covariance matrix is singular when p > n and (2) there is an accumulation of noise in the estimation of the class centroid in high dimensional feature space. The Independence Rule is a class of methods used to overcome these drawbacks by ignoring the correlation information between spectral variables. However, a strong correlation is an essential characteristic of spectral data. We proposed a new correlation‐assisted nearest shrunken centroid classifier (CA‐NSC) to incorporate correlation information into the classification. CA‐NSC combines two sources of information [class centroid (mean) and correlation structure (variance)] to generate the classification. We used two real data analyses and a simulation study to verify our CA‐NSC method. In addition to NSC, we also performed a comparison with the soft independent modeling of class analogy (SIMCA) approach, which uses only correlation structure information for classification. The results show that CA‐NSC consistently improves on NSC and SIMCA. The misclassification rate of CA‐NSC is reduced by almost half compared with NSC in one of the real data analyses. Generally, correlation among variables will worsen the performance of NSC, even though the discriminatory information contained in the class centroid remains unchanged. If only correlation structure information is used (as in the case of SIMCA), the result will be satisfactory only when the correlation structure alone can provide sufficient information for classification. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

8.
In the present work, the applicability of attenuated total reflectance-Fourier transform infrared (ATR-FTIR) spectroscopy, coupled with chemometric tools in recognizing essential oils (EOs) for routine control, was evaluated. EOs belonging to Mentha, Cymbopogon, and Lavandula families and to S. rosmarinus and T. vulgaris species were analyzed, and the performance of several untargeted approaches, based on the synergistic combination of ATR-FTIR and Partial Least Squares Discriminant Analysis (PLS-DA), was tested to classify the species and chemotypes. Different spectra pre-processing methods were employed, and the robustness of the built models was tested by means of a Receiver Operating Characteristic (ROC) curve and random permutations test. The application of these approaches revealed fruitful results in terms of sensitivity and specificity, highlighting the potentiality of ATR-FTIR and chemometrics techniques to be used as a sensitive, cost-effective, and rapid tool to differentiate EO samples according to their species and chemotype.  相似文献   

9.
As for the detection of drug body packing, skin is a typical interference factor. In this paper, multivariate data analysis was proposed to analyze the impact of fat and muscle on heroin identification based on the profile of spectra of energy dispersive X‐ray diffraction. In the space of principal components, the results showed that different pure samples (heroin, muscle, fat) clustered in different areas, whereas the location of mixture samples moved between locations of pure samples. The impact of fat and muscle lies in moving the feature points between pure materials in the space of principal components. Furthermore, the model of heroin covered by fat and muscle of different thicknesses was set up, and a linear relationship was proven to be suitable. Our findings indicate that multivariate data analysis would be a promising method in the detection of drug body packing. Copyright © 2011 John Wiley & Sons, Ltd.  相似文献   

10.
11.
12.
Multiway principal component analysis (MPCA) has been extensively applied to batch process monitoring. In the case of monitoring a two‐stage batch process, the inter‐stage variation is neglected if MPCA models for each individual stage are used. On the other hand, if two stages of reference data are combined into a large dataset that MPCA is applied to, the dimensions of the unfolded matrix will increase dramatically. In addition, when an abnormal event is detected, it is difficult to identify which stage's operation induced this alarm. In this paper, partial least squares (PLS) is applied to monitor the inter‐stage relation of a two‐stage batch process. In post‐analysis of abnormalities, PLS can clarify whether root causes are from previous stage operations or due to the changes of the inter‐stage correlations. This approach was successfully applied to a semiconductor manufacturing process. Copyright © 2008 John Wiley & Sons, Ltd.  相似文献   

13.
粗糙集方法及其在化学模式分类规则挖掘中的应用   总被引:5,自引:0,他引:5  
简要介绍了粗糙集的基本概念,决策系统的约简步骤和分类规则的挖掘原理,提出了基于信息熵的数据离散化方法,使之充分结合粗糙集特性,具有良好的推广性。又以经典的橄榄油产地判别为例,采用粗糙集方法,无需先验知识,不用设定参数,即能消除冗余的属性和属性值,约简化学系统,从样本数据中挖掘出简明直接、易于理解的产生式分类规则,构建专业意义明确的化学模式分类模型,其预报性能良好,效果令人满意。  相似文献   

14.
Water-in-oil emulsions are investigated by means of multivariate analysis of near infrared (NIR) spectroscopic profiles in the range 1100 — 2250 nm. The oil phase is a paraffin-diluted crude oil from the Norwegian Continental Shelf. The influence of water absorption and light scattering of the water droplets are shown to be strong. Despite the strong influence of the water phase, the NIR technique is still capable of predicting the composition of the investigated oil phase.  相似文献   

15.
When studying the principal component analysis (PCA) or partial least squares (PLS) modelling of batch process data, one realizes that there is a wide range of approaches. In many cases, new modelling approaches are presented just because they work properly for a particular application, for example, on‐line monitoring and a given number of processes. A clear understanding of why these approaches perform successfully and which are the advantages and disadvantages in front of the others is seldom supplied. Why does modelling after batch‐wise unfolding capture changing dynamics? What are the consequences of variable‐wise unfolding? Is there any best unfolding method? When should several models for a single process be used? In this paper, it is shown how these and other related questions can be answered by properly analyzing the dynamic covariance structures of the various approaches. Copyright © 2008 John Wiley & Sons, Ltd.  相似文献   

16.
Cistus is a plant that has been used in natural medicine for hundreds of years; it works primarily as an antioxidant and cleansing agent. Cistus × incanus leaves or herb can be an attractive source of polyphenols and flavonoids. The official protocols of active compound analysis relies on the extraction of compounds of interest from plant matter, which makes their determination long and costly. An analysis of plant material in its native state can be performed using vibrational spectroscopy. This paper presents a comparison of Raman spectroscopy, attenuated total reflection in mid-infrared and diffuse reflectance technique in the near-infrared region for the simultaneous quantification of total polyphenols (TPC) and flavonoids (TF) content, as well as the determination of FRAP antioxidant activity of C. incanus material. Utilizing vibrational spectra and using partial least squares algorithm, TPC and TF were quantified with the RSEPVAL errors in the 2.7–5.4% range, while FRAP antioxidant activity for validation sets was determined with relative errors ranged from 5.2 to 9.3%. For the analyzed parameters, the lowest errors of predictions were computed for models constructed using Raman data. The developed models allow for fast and precise quantification of the studied active compounds in C. incanus material without any chemical sample treatment.  相似文献   

17.
1H Nuclear Magnetic Resonance (1H NMR) combined with multivariate statistics was adopted to discriminate female and male flower buds of Populus tomentosa in the study. Samples of 11 female and 16 male flower buds of P. tomentosa were collected in Beijing, China. 1H NMR spectra were acquired on a 400 MHz spectrometer. In total, 30 chemical compounds were identified with standards and literature according to chemical shifts, peak areas, and multiplicity. Principal component analysis (PCA), hierarchical clustering analysis (HCA), and supervised orthogonal partial least squares-discriminant analysis (OPLS-DA) were applied to discriminate female and male flower buds. An apparent grouping trend (R2X, 0.809; Q2, 0.903) between female and male groups was exhibited with PCA and HCA. The two groups were also well discriminated with OPLS-DA (R2X, 0.808; R2Y, 0.976; Q2, 0.960). Combined with variable importance in projection (VIP) > 1.0 and p < 0.05 of OPLS-DA, it was found that the content of daucosterol, β-sitosterol, ursolic acid, and betulonic acid in male group was higher than that in female, which should be the key differences of chemical constituents in female and male flower buds of P. tomentosa. The study demonstrated that 1H NMR combined with multivariate statistics could be used to discriminate female and male plants and clarify differences, which provided a novel method to identify the gender of dioecious plants.  相似文献   

18.
Cross‐validation (CV) is a common approach for determining the optimal number of components in a principal component analysis model. To guarantee the independence between model testing and calibration, the observation‐wise k‐fold operation is commonly implemented in each cross‐validation step. This operation renders the CV algorithm computationally intensive, and it is the main limitation to apply CV on very large data sets. In this paper, we carry out an empirical and theoretical investigation of the use of this operation in the element‐wise k‐fold (ekf) algorithm, the state‐of‐the‐art CV algorithm. We show that when very large data sets need to be cross‐validated and the computational time is a matter of concern, the observation‐wise k‐fold operation can be skipped. The theoretical properties of the resulting modified algorithm, referred to as column‐wise k‐fold (ckf) algorithm, are derived. Also, its performance is evaluated with several artificial and real data sets. We suggest the ckf algorithm to be a valid alternative to the standard ekf to reduce the computational time needed to cross‐validate a data set. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

19.
The aim of the present study was to determine the volatile compounds of three different species of chili peppers, using solid-phase microextraction (SPME) methods in combination with gas chromatography–mass spectrometry (GC-MS). The detection of marker aroma compounds could be used as a parameter to differentiate between species of chili peppers for their detection and traceability in chili pepper food. The sensorial contribution was also investigated to identify the predominant notes in each species and to evaluate how they can influence the overall aroma. Three different pepper species belonging to the Capsicum genus were analyzed: Chinense, Annuum, and Baccatum. A total of 269 volatile compounds were identified in these species of chili peppers. The Capsicum annum species were characterized by a high number of acids and ketones, while the Capsicum chinense and Capsicum baccatum were characterized by esters and aldehydes, respectively. The volatile profile of extra virgin olive oils (EVOOs) flavored with chili peppers was also investigated, and principal component analysis (PCA) and hierarchical cluster analysis (HCA) of the volatile profiles were demonstrated to be a powerful analytical strategy for building a model that highlights the potential of a volatile characterization approach for use in evaluating food traceability and authenticity.  相似文献   

20.
Both the concept and the model of snug quantitative structure-activity relationship (QSAR) were pro-posed and developed for molecular design through constructing QSAR based on some known mode of receptor/ligand interactions. Many disadvantages of traditional models can be avoided by using the proposed method because the traditional models only determined upon molecular structural features in sample sets themselves. A genetic virtual screening of peptide/protein combinations (GVSPPC) is proposed for the first time by utilizing this idea to examine peptide/protein affinity activities. A genetic algorithm (GA) was developed for screening combinative targets with an interaction mode for virtual receptors. GVSPPC succeeds in disposing difficulties in rational QSAR,in order to search for the ligand/receptor interactions on conditions of unknown structures. Some bioactive oligo-/poly-peptide systems covering 58 angiotensin converting enzyme (ACE) inhibitors and 18 double site mutation residues in camel antibody protein cAb-Lys3 were investigated by GVSPPC with satisfactory results (R 2 cu>0.91,Q 2 cv > 0.86,ERMS=0.19-0.95),respectively,which demonstrates that GVSPPC is more inter-pretable in the ligand-receptor interaction than the traditional QSAR method.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号