首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Feature vector clustering molecular pairs in computer simulations
Authors:Han-Wen Pei  Aatto Laaksonen
Institution:1. Department of Materials and Environmental Chemistry, Arrhenius Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden;2. Department of Materials and Environmental Chemistry, Arrhenius Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden

State Key Laboratory of Materials-Oriented and Chemical Engineering, Nanjing Tech University, Nanjing, 210009 China

Centre of Advanced Research in Bionanoconjugates and Biopolymers, Petru Poni Institute of Macromolecular Chemistry Aleea Grigore Ghica-Voda, 41A, 700487 Lasi, Romania

Abstract:A clustering framework is introduced to analyze the microscopic structural organization of molecular pairs in liquids and solutions. A molecular pair is represented by a representative vector (RV). To obtain RV, intermolecular atom distances in the pair are extracted from simulation trajectory as components of the key feature vector (KFV). A specific scheme is then suggested to transform KFV to RV by removing the influence of permutational molecular symmetry on the KFV as the predicted clusters should be independent of possible permutations of identical atoms in the pair. After RVs of pairs are obtained, a clustering analysis technique is finally used to classify all the RVs of molecular pairs into the clusters. The framework is applied to analyze trajectory from molecular dynamics simulations of an ionic liquid (trihexyltetradecylphosphonium bis(oxalato)borate (P6,6,6,14]BOB])). The molecular pairs are successfully categorized into physically meaningful clusters, and their effectiveness is evaluated by computing the product moment correlation coefficient (PMCC). (Willett, Winterman, and Bawden, J. Chem. Inf. Comput. Sci. 1986, 26, 109–118; Downs, Willett, and Fisanick, J. Chem. Inf. Comput. Sci. 1994, 34, 1094–1102) It is observed that representative configurations of two clusters are related to two energy local minimum structures optimized by density functional theory (DFT) calculation, respectively. Several widely used clustering analysis techniques of both nonhierarchical (k-means) and hierarchical clustering algorithms are also evaluated and compared with each other. The proposed KFV technique efficiently reveals local molecular pair structures in the simulated complex liquid. It is a method, which is highly useful for liquids and solutions in particular with strong intermolecular interactions. © 2019 Wiley Periodicals, Inc.
Keywords:data mining  ionic liquid  molecular structure ■
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号