It is becoming increasingly common in quantitative structure/activity relationship (QSAR) analyses to use external test sets to evaluate the likely stability and predictivity of the models obtained. In some cases, such as those involving variable selection, an internal test set – i.e., a cross-validation set – is also used. Care is sometimes taken to ensure that the subsets used exhibit response and/or property distributions similar to those of the data set as a whole, but more often the individual observations are simply assigned `at random.' In the special case of MLR without variable selection, it can be analytically demonstrated that this strategy is inferior to others. Most particularly, D-optimal design performs better if the form of the regression equation is known and the variables involved are well behaved. This report introduces an alternative, non-parametric approach termed `boosted leave-many-out' (boosted LMO) cross-validation. In this method, relatively small training sets are chosen by applying optimizable k-dissimilarity selection (OptiSim) using a small subsample size (k = 4, in this case), with the unselected observations being reserved as a test set for the corresponding reduced model. Predictive errors for the full model are then estimated by aggregating results over several such analyses. The countervailing effects of training and test set size, diversity, and representativeness on PLS model statistics are described for CoMFA analysis of a large data set of COX2 inhibitors. 相似文献
For nonautonomous dynamical systems, the principle of inheriting local properties by global Poincaré maps is developed. Using this method, a selection criterion for systems of close competitors is found: to gain competitive advantage, it suffices to outproduce other populations with a margin. The margin factor in question remains uniformly bounded as the number of competitors in the community grows. 相似文献
In this paper, we introduce a new class of vector quasi-equilibrium problems with set-valued maps. Almost all the vector equilibrium models of the Blum-Oettli type in the literature are special cases of our new class of equilibrium problems under consideration. Moreover, a number of C-diagonal quasiconvexity properties are proposed for set-valued maps, which are natural generalizations of the -diagonal quasiconvexity for real functions. Together with an application of continuous selection and fixed-point theorems, these conditions enable us to prove unified existence results of solutions for such vector equilibrium problems. 相似文献
A theorem of M. F. Driscoll says that, under certain restrictions, the probability that a given Gaussian process has its sample paths almost surely in a given reproducing kernel Hilbert space (RKHS) is either or . Driscoll also found a necessary and sufficient condition for that probability to be .
Doing away with Driscoll's restrictions, R. Fortet generalized his condition and named it nuclear dominance. He stated a theorem claiming nuclear dominance to be necessary and sufficient for the existence of a process (not necessarily Gaussian) having its sample paths in a given RKHS. This theorem - specifically the necessity of the condition - turns out to be incorrect, as we will show via counterexamples. On the other hand, a weaker sufficient condition is available.
Using Fortet's tools along with some new ones, we correct Fortet's theorem and then find the generalization of Driscoll's result. The key idea is that of a random element in a RKHS whose values are sample paths of a stochastic process. As in Fortet's work, we make almost no assumptions about the reproducing kernels we use, and we demonstrate the extent to which one may dispense with the Gaussian assumption.
We prove that the trace of the space to an arbitrary closed subset is characterized by the following ``finiteness' property. A function belongs to the trace space if and only if the restriction to an arbitrary subset consisting of at most can be extended to a function such that
The constant is sharp.
The proof is based on a Lipschitz selection result which is interesting in its own right.
This paper deals with Lipschitz selections of set-valued maps with closed graphs. First, we characterize Lipschitzianity of a closed set-valued map in the differential games framework in terms of a discriminating property of its graph. This allows us to consider the -Lipschitz kernel of a given set-valued map as the largest -Lipschitz closed set-valued map contained in the initial one, to derive an algorithm to compute the collection of Lipschitz selections, and to extend the Pasch–Hausdorff envelope to set-valued maps. 相似文献
This paper presents the experience gained by developers and users with implementation and operation of NorTRACKTM, a real-time computerized product tracking system. A Programmable Logic Controller (PLC) collects and transfers data in real time to NorTRACK’s OracleTM database on a Windows NTTM server network. After extensive development and Beta testing at MDS Nordion’s Canadian Irradiation Centre in Montreal, Canada, NorTRACK was installed in January 1997 with a new irradiation facility in Ethicon Endo-Surgery Inc.’s Albuquerque plant in the United States. NorTRACK communicates with the irradiator control and safety system, the plant's central manufacturing database, an innovative pallet staging and tote loading robot, and an automated dosimetry reading system. This integrated system allows the sterilization facility to monitor the irradiator operation and the flow of many products, through varied processing modes, continuously and reliably. As a result of operating with NorTRACK, both MDS Nordion’s CIC facility and the Endo-Surgery manufacturing site, are beginning to realize unique benefits in their respective operations. MDS Nordion is also initiating several future product enhancements and additional productivity modules. This paper describes the NorTRACK system, the various stages of the development project and Beta tests, and the experience of the users to date in their operations. 相似文献
A full analogy of the celebrated Sharkovsky cycle coexistence theorem is established for lower semicontinuous (multivalued) maps on metrizable linear continua. This result is further extended to triangular maps. 相似文献
This paper considers the scenario of supply chain with multiple products and multiple suppliers, all of which have limited capacity. We assume that received items from suppliers are not of perfect quality. Items of imperfect quality, not necessarily defective, could be used in another inventory situation. Imperfect items are sold as a single batch, prior to receiving the next shipment, at a discounted price. The demand over a finite planning horizon is known, and an optimal procurement strategy for this multi-period horizon is to be determined. Each of products can be sourced from a set of approved suppliers, a supplier-dependent transaction cost applies for each period in which an order is placed on a supplier. A product-dependent holding cost per period applies for each product in the inventory that is carried across a period in the planning horizon. Also a maximum storage space for the buyer in each period is considered. The decision maker, the buyer, needs to decide what products to order, in what quantities, with which suppliers, and in which periods. Finally, a genetic algorithm (GA) is used to solve the model. 相似文献