共查询到20条相似文献,搜索用时 15 毫秒
1.
Identifying winners of competitive events: A SVM-based classification model for horserace prediction
The aim of much horserace modelling is to appraise the informational efficiency of betting markets. The prevailing approach involves forecasting the runners’ finish positions by means of discrete or continuous response regression models. However, theoretical considerations and empirical evidence suggest that the information contained within finish positions might be unreliable, especially among minor placings. To alleviate this problem, a classification-based modelling paradigm is proposed which relies only on data distinguishing winners and losers. To assess its effectiveness, an empirical experiment is conducted using data from a UK racetrack. The results demonstrate that the classification-based model compares favourably with state-of-the-art alternatives and confirm the reservations of relying on rank ordered finishing data. Simulations are conducted to further explore the origin of the model’s success by evaluating the marginal contribution of its constituent parts. 相似文献
2.
非平行支持向量机是支持向量机的延伸,受到了广泛的关注.非平行支持向量机构造允许非平行的支撑超平面,可以描述不同类别之间的数据分布差异,从而适用于更广泛的问题.然而,对非平行支持向量机模型与支持向量机模型之间的关系研究较少,且尚未有等价于标准支持向量机模型的非平行支持向量机模型.从支持向量机出发,构造出新的非平行支持向量机模型,该模型不仅可以退化为标准支持向量机,保留了支持向量机的稀疏性和核函数可扩展性.同时,可以描述不同类别之间的数据分布差异,适用于更广泛的非平行结构数据等.最后,通过实验初步验证了所提模型的有效性. 相似文献
3.
We improve the twin support vector machine(TWSVM)to be a novel nonparallel hyperplanes classifier,termed as ITSVM(improved twin support vector machine),for binary classification.By introducing the diferent Lagrangian functions for the primal problems in the TWSVM,we get an improved dual formulation of TWSVM,then the resulted ITSVM algorithm overcomes the common drawbacks in the TWSVMs and inherits the essence of the standard SVMs.Firstly,ITSVM does not need to compute the large inverse matrices before training which is inevitable for the TWSVMs.Secondly,diferent from the TWSVMs,kernel trick can be applied directly to ITSVM for the nonlinear case,therefore nonlinear ITSVM is superior to nonlinear TWSVM theoretically.Thirdly,ITSVM can be solved efciently by the successive overrelaxation(SOR)technique or sequential minimization optimization(SMO)method,which makes it more suitable for large scale problems.We also prove that the standard SVM is the special case of ITSVM.Experimental results show the efciency of our method in both computation time and classification accuracy. 相似文献
4.
The availability of abundant data posts a challenge to integrate static customer data and longitudinal behavioral data to improve performance in customer churn prediction. Usually, longitudinal behavioral data are transformed into static data before being included in a prediction model. In this study, a framework with ensemble techniques is presented for customer churn prediction directly using longitudinal behavioral data. A novel approach called the hierarchical multiple kernel support vector machine (H-MK-SVM) is formulated. A three phase training algorithm for the H-MK-SVM is developed, implemented and tested. The H-MK-SVM constructs a classification function by estimating the coefficients of both static and longitudinal behavioral variables in the training process without transformation of the longitudinal behavioral data. The training process of the H-MK-SVM is also a feature selection and time subsequence selection process because the sparse non-zero coefficients correspond to the variables selected. Computational experiments using three real-world databases were conducted. Computational results using multiple criteria measuring performance show that the H-MK-SVM directly using longitudinal behavioral data performs better than currently available classifiers. 相似文献
5.
Chinese heavy-polluting industrial enterprises, especially petrochemical or chemical industry, labeled low carbon efficiency and high emission load, are facing the tremendous pressure of emission reduction under the background of global shortage of energy supply and constrain of carbon emission. However, due to the limited amount of theoretic and practical research in this field, problems like lacking prediction indicators or models, and the quantified standard of carbon risk remain unsolved. In this paper, the connotation of carbon risk and an assessment index system for Chinese heavy-polluting industrial enterprises (eg. coal enterprise, petrochemical enterprises, chemical enterprises et al.) based on support vector machine are presented. By using several heavy-polluting industrial enterprises’ related data, SVM model is trained to predict the carbon risk level of a specific enterprise, which allows the enterprise to identify and manage its carbon risks. The result shows that this method can predict enterprise’s carbon risk level in an efficient, accurate way with high practical application and generalization value. 相似文献
6.
Issam Dagher 《Journal of Global Optimization》2008,41(1):15-30
A new quadratic kernel-free non-linear support vector machine (which is called QSVM) is introduced. The SVM optimization problem
can be stated as follows: Maximize the geometrical margin subject to all the training data with a functional margin greater
than a constant. The functional margin is equal to W
T
X + b which is the equation of the hyper-plane used for linear separation. The geometrical margin is equal to . And the constant in this case is equal to one. To separate the data non-linearly, a dual optimization form and the Kernel
trick must be used. In this paper, a quadratic decision function that is capable of separating non-linearly the data is used.
The geometrical margin is proved to be equal to the inverse of the norm of the gradient of the decision function. The functional
margin is the equation of the quadratic function. QSVM is proved to be put in a quadratic optimization setting. This setting
does not require the use of a dual form or the use of the Kernel trick. Comparisons between the QSVM and the SVM using the
Gaussian and the polynomial kernels on databases from the UCI repository are shown. 相似文献
7.
针对同一对象从不同途径或不同层面获得的特征数据被称为多视角数据. 多视角学习是利用事物的多视角数据进行建模求解的一种新的机器学习方法. 大量研究表明, 多视角数据共同学习可以显著提高模型的学习效果, 因此许多相关模型及算法被提出. 多视角学习一般需遵循一 致性原则和互补性原则. 基于一致性原则,Farquhar 等人成功地将支持向量机(Support Vector Machine, SVM)和核典型相关分析(Kernel Canonical Correlation Analysis, KCCA)整合成一个单独的优化问题, 提出SVM-2K模型. 但是, SVM-2K模型并未充分利用多视角数据间的互补信息. 因此, 在SVM-2K模型的基础之上, 提出了基于间隔迁移的多视角支持向量机模型(Margin transfer-based multi-view support vector machine, M^2SVM), 该模型同时满足多视角学习的一致性和互补 性两原则. 进一步地, 从一致性的角度对其进行理论分析, 并 与SVM-2K比较, 揭示了 M^2SVM 比SVM-2K 更为灵活. 最后, 在大量的多视角数据集上验证了M^2SVM模型的有效性. 相似文献
8.
9.
Xin Yan Yanqin Bai Shu-Cherng Fang Jian Luo 《The Journal of the Operational Research Society》2016,67(7):1001-1011
In this paper, we propose a kernel-free semi-supervised quadratic surface support vector machine model for binary classification. The model is formulated as a mixed-integer programming problem, which is equivalent to a non-convex optimization problem with absolute-value constraints. Using the relaxation techniques, we derive a semi-definite programming problem for semi-supervised learning. By solving this problem, the proposed model is tested on some artificial and public benchmark data sets. Preliminary computational results indicate that the proposed method outperforms some existing well-known methods for solving semi-supervised support vector machine with a Gaussian kernel in terms of classification accuracy. 相似文献
10.
The paper is related to the error analysis of Multicategory Support Vector Machine (MSVM) classifiers based on reproducing kernel Hilbert spaces. We choose the polynomial kernel as Mercer kernel and give the error estimate with De La Vallée Poussin means. We also introduce the standard estimation of sample error, and derive the explicit learning rate. 相似文献
11.
Bing Zheng Li 《数学学报(英文版)》2008,24(3):511-528
The purpose of this paper is to provide an error analysis for the multicategory support vector machine (MSVM) classificaton problems. We establish the uniform convergency approach for MSVMs and estimate the misclassification error. The main difficulty we overcome here is to bound the offset vector. As a result, we confirm that the MSVM classification algorithm with polynomial kernels is always efficient when the degree of the kernel polynomial is large enough. Finally the rate of convergence and examples are given to demonstrate the main results. 相似文献
12.
Theodore B. Trafalis Olutayo O. Oladunni Michael B. Richman 《Computational Management Science》2011,8(3):281-297
A knowledge-based linear Tihkonov regularization classification model for tornado discrimination is presented. Twenty-three
attributes, based on the National Severe Storms Laboratory’s Mesoscale Detection Algorithm, are used as prior knowledge. Threshold
values for these attributes are employed to discriminate the data into two classes (tornado, non-tornado). The Weather Surveillance
Radar 1998 Doppler is used as a source of data streaming every 6 min. The combination of data and prior knowledge is used
in the development of a least squares problem that can be solved using matrix or iterative methods. Advantages of this formulation
include explicit expressions for the classification weights of the classifier and its ability to incorporate and handle prior
knowledge directly to the classifiers. Comparison of the present approach to that of Fung et al. [in Proceedings neural information
processing systems (NIPS 2002), Vancouver, BC, December 10–12, 2002], over a suite of forecast evaluation indices, demonstrates
that the Tikhonov regularization model is superior for discriminating tornadic from non-tornadic storms. 相似文献
13.
In this paper, we propose a robust L1-norm non-parallel proximal support vector machine (L1-NPSVM), which aims at giving a robust performance for binary classification in contrast to GEPSVM, especially for the problem with outliers. There are three mainly properties of the proposed L1-NPSVM. Firstly, different from the traditional GEPSVM which solves two generalized eigenvalue problems, our L1-NPSVM solves a pair of L1-norm optimal problems by using a simple justifiable iterative technique. Secondly, by introducing the L1-norm, our L1-NPSVM is more robust to outliers than GEPSVM to a great extent. Thirdly, compared with GEPSVM, no parameters need to be regularized in our L1-NPSVM. The effectiveness of the proposed method is demonstrated by tests on a simple artificial example as well as on some UCI datasets, which shows the improvements of GEPSVM. 相似文献
14.
15.
Geomechanical parameters identification by particle swarm optimization and support vector machine 总被引:1,自引:0,他引:1
Back analysis is commonly used in identifying geomechanical parameters based on the monitored displacements. Conventional back analysis method is not capable of recognizing non-linear relationship involving displacements and mechanical parameters effectively. The new intelligent displacement back analysis method proposed in this paper is the combination of support vector machine, particle swarm optimization, and numerical analysis techniques. The non-linear relationship is efficiently represented by support vector machine. Numerical analysis is used to create training and testing samples for recognition of SVMs. Then, a global optimum search on the obtained SVMs by particle swarm optimization can lead to the geomechanical parameters identification effectively. 相似文献
16.
Xiaoyuan ZhangJianzhong Zhou Changqin WangChaoshun Li Lixiang Song 《Applied mathematics and computation》2012,218(9):4973-4987
Support vector machine (SVM) is a popular tool for machine learning task. It has been successfully applied in many fields, but the parameter optimization for SVM is an ongoing research issue. In this paper, to tune the parameters of SVM, one form of inter-cluster distance in the feature space is calculated for all the SVM classifiers of multi-class problems. Inter-cluster distance in the feature space shows the degree the classes are separated. A larger inter-cluster distance value implies a pair of more separated classes. For each classifier, the optimal kernel parameter which results in the largest inter-cluster distance is found. Then, a new continuous search interval of kernel parameter which covers the optimal kernel parameter of each class pair is determined. Self-adaptive differential evolution algorithm is used to search the optimal parameter combination in the continuous intervals of kernel parameter and penalty parameter. At last, the proposed method is applied to several real word datasets as well as fault diagnosis for rolling element bearings. The results show that it is both effective and computationally efficient for parameter optimization of multi-class SVM. 相似文献
17.
Support vector regression (SVR) has been successfully applied in various domains, including predicting the prices of different financial instruments like stocks, futures, options, and indices. Because of the wide variation in financial time-series data, instead of using only a single standard prediction technique like SVR, we propose a hybrid model called USELM-SVR. It is a combination of unsupervised extreme learning machine (US-ELM)-based clustering and SVR forecasting. We assessed the feasibility and effectiveness of this hybrid model using a case study, predicting the one-, two-, and three-day ahead closing values of the energy commodity futures index traded on the Multi Commodity Exchange in India. Our experimental results show that the USELM-SVR is viable and effective, and produces better forecasts than our benchmark models (standard SVR, a hybrid of SVR with self-organizing map (SOM) clustering, and a hybrid of SVR with k-means clustering). Moreover, the proposed USELM-SVR architecture is useful as an alternative model for prediction tasks when we require more accurate predictions. 相似文献
18.
由于标准支持向量机模型是一个二次规划问题,随着数据规模的增大,求解算法过程会越来越复杂.在K-SVCR算法结构的基础上,构造了严格凸的二次规划新模型,该模型的主要特点是可以将其一阶最优化条件转化为变分不等式问题,利用Fischer-Burmeister(FB)函数将互补问题转化为光滑方程组;建立光滑快速牛顿算法求解,并证明了该算法所产生的序列是全局收敛;利用标准数据集测试提出算法的有效性,在训练正确率和运行时间上与K-SVCR算法相比都有较好的表现,实验结果表明该算法可行且有效. 相似文献
19.
Support Vector Machines (SVMs) is known to be a powerful nonparametric classification technique even for high-dimensional data. Although predictive ability is important, obtaining an easy-to-interpret classifier is also crucial in many applications. Linear SVM provides a classifier based on a linear score. In the case of functional data, the coefficient function that defines such linear score usually has many irregular oscillations, making it difficult to interpret. 相似文献
20.
The response surface method (RSM), a simple and effective approximation technique, is widely used for reliability analysis in civil engineering. However, the traditional RSM needs a considerable number of samples and is computationally intensive and time-consuming for practical engineering problems with many variables. To overcome these problems, this study proposes a new approach that samples experimental points based on the difference between the last two trial design points. This new method constructs the response surface using a support vector machine (SVM); the SVM can build complex, nonlinear relations between random variables and approximate the performance function using fewer experimental points. This approach can reduce the number of experimental points and improve the efficiency and accuracy of reliability analysis. The advantages of the proposed method were verified using four examples involving random variables with different distributions and correlation structures. The results show that this approach can obtain the design point and reliability index with fewer experimental points and better accuracy. The proposed method was also employed to assess the reliability of a numerically modeled tunnel. The results indicate that this new method is applicable to practical, complex engineering problems such as rock engineering problems. 相似文献