This article discusses problems of validating classification models especially in datasets where sample sizes are small and the number of variables is large. It describes the use of percentage correctly classified (%CC) as an indicator for success of a classification model. For small datasets, %CC should not be used uncritically and its interpretation depends on sample size. It illustrates the use of a common classification method, discriminant partial least squares (D-PLS) on a randomly generated dataset of 200 samples and 200 variables.
An aim of the classifier is to determine whether the null hypothesis (there is no distinction between two classes) can be rejected. Autoprediction gives an 84.5% CC. It is shown that, if there is variable selection, it must be performed independently on the training set to obtain a CC close to 50% on the test set; otherwise, over-optimistic and false conclusions can be reached about the ability to classify samples into groups.
Finally, two aims of determining the quality of a model are frequently confused, namely optimisation (often used to determine the most appropriate number of components in a model) and independent validation; to overcome this, the data should be split into three groups.
There are often difficulties with model building if validation and optimisation have been done on different groups of samples, especially using iterative methods, each group being modelled using properties, such as a different number of components or different variables. 相似文献
The densities of H2O, D2O, and MeOH solutions in acetonitrile with the solute concentrations up to 0.07 molar fractions at 278.15, 288.15, 298.15, 308.15, and 318.15 K were measured using vibrating-tube densimetry with an error 8·10–6 g cm–3. The limiting partial molar volumes for the H/D isotopomers of water and IaII in acetonitrile (V–2) and the isotope effects in V–2 and in excess molar volumes of acetonitrile—water mixtures were calculated. Molecules of H2O, D2O, and IaII form associates with acetonitrile molecules via hydrogen bonds. The associates have the packing volumes close to those in the individual solute. The water and methanol molecules were assumed to be incorporated into the acetonitrile structure without substantial changes in the latter. However, this process results in some compression of the system with a simultaneous increase in its expansibility. 相似文献
Using a precise technique of scanning microcalorimetry the heat capacity differences between water and dilute aqueous solutions of ethanol, n-propanol, n-butanol and n-pentanol were measured from 5 to 125°C and the partial molar heat capacities of these substances in water were determined. It was found that the heat capacity increment for alcohol disolved in water is proportional to the number of the-CH
2–
groups and decrease with a temperature increase. The heat capacity increment of hydration of non-polar groups is shown to be positive and large at room temperature and decreases in magnitude as the temperature increases. In contrast, the heat capacity increment of hydration of polar groups is negative at room tempreature and increases as the temperature increases. From the temperature dependence of the heat capacity increment one can assume that the water molecules solvated by the non-polar groups of the alcohols behave in a non-cooperative manner. 相似文献
Starting from the natural neo-clerodane diterpenoid teubotrin (1) several neo-clerodane derivatives (3-7,9-11) have been obtained. The naturally occurring diterpenoid teuscordinon (12) has also been synthesized from teubotrin (1), showing thereby how some of these transformations can be useful for the synthesis of other natural neo-clerodane diterpenes. The latter are of interest due to their activity as insect antifeedants and other important biological properties. 相似文献
Assuming the separation of the intermolecular scattering function into the radial and angular parts and using Egelstaffet al’s orientational model for tetrachlorides, the structure of liquid vanadium tetrachloride has been studied. It has been observed
that such a separation is approximate for this liquid and the introduction of a third correction term is required to account
for the molecular structure function. The chlorine-chlorine partial structure and effective angleaveraged intermolecular chlorine-chlorine
potential in the liquid has been evaluated. Without taking the third correction term, introduced to generate theoretically
the molecular structure function, the centre structure function has been obtained in an approximate way from the experimentally
observed molecular structure function and from it the centre radial distribution function, centre direct correlation function
and the angle-averaged vanadium-vanadium effective potential has been evaluated. 相似文献
Cyclical bifurcated hydrogen bonded structures are proposed for aqueous solutions of hydrofluoric acid and for the bifluoride ion which are consistent with the spectral data. The structure proposed for HF is also applicable to solutions in organic solvents. Raman spectra of tetramethylguanidinium perchlorate suggest that the corresponding Raman spectra of perchloric acid solutions may not be interpreted in terms of a completely dissociated acid. Other evidence including activity coefficient, heat capacity and partial molal volume data suggest that there is some association in relatively dilute perchloric acid solutions between the perchlorate ion and the hydrated proton. This association decreases in concentrated aqueous solutions. 相似文献
Apparent molar heat capacities and volumes of amylamine (PentNH2) 0.02m, capronitrile (PentCN) 0.02m and nitropentane (PentNO2) 0.009m in decyltrimethylammonium bromide (DeTAB) micellar solutions, in water and in octane were measured at 25°C. By assuming that their concentration approaches the standard infinite dilution state, heat capacities and volumes were rationalized by means of previously reported equations following which the distribution constant between the aqueous and the micellar phase and heat capacity and volume of the additives in both phases are simultaneously derived. The present results are compared to those we have previously obtained for pentanol (PentOH). The thermodynamic properties of PentNH2 in water and in micellar phase are substantially identical to those of PentOH but different from those of PentCN and PentNO2 whereas the opposite behavior was observed in their pure liquid state and in octane. The nature of the solvent medium seems to affect the thermodynamic behavior of PentNH2. Also, the study of the apparent molar heat capacities of the amyl compounds investigated here in micellar solutions as a function of surfactant concentration shows evidence of a maximum at about 0.4m DeTAB, which can be attributed to a micellar structural transition. Accordingly, the solubilities of PentCN and PentNO2 as a function of the DeTAB concentration drop in the neighborhood of the concentration where heat capacities display the maximum. 相似文献