共查询到20条相似文献,搜索用时 15 毫秒
1.
Market basket prediction, which is the basis of product recommendation systems, is the concept of predicting what customers will buy in the next shopping basket based on analysis of their historical shopping records. Although product recommendation systems develop rapidly and have good performance in practice, state-of-the-art algorithms still have plenty of room for improvement. In this paper, we propose a new algorithm combining pattern prediction and preference prediction. In pattern prediction, sequential rules, periodic patterns and association rules are mined and probability models are established based on their statistical characteristics, e.g., the distribution of periods of a periodic pattern, to make a more precise prediction. Products that have a higher probability will have priority to be recommended. If the quantity of recommended products is insufficient, then we make a preference prediction to select more products. Preference prediction is based on the frequency and tendency of products that appear in customers’ individual shopping records, where tendency is a new concept to reflect the evolution of customers’ shopping preferences. Experiments show that our algorithm outperforms those of the baseline methods and state-of-the-art methods on three of four real-world transaction sequence datasets. 相似文献
2.
Heterogeneous reactions are chemical reactions that occur at the interfaces of multiple phases, and often show a nonlinear dynamical behavior due to the effect of the time-variant surface area with complex reaction mechanisms. It is important to specify the kinetics of heterogeneous reactions in order to elucidate the microscopic elementary processes and predict the macroscopic future evolution of the system. In this study, we propose a data-driven method based on a sparse modeling algorithm and sequential Monte Carlo algorithm for simultaneously extracting substantial reaction terms and surface models from a number of candidates by using partial observation data. We introduce a sparse modeling approach with non-uniform sparsity levels in order to accurately estimate rate constants, and the sequential Monte Carlo algorithm is employed to estimate time courses of multi-dimensional hidden variables. The results estimated using the proposed method show that the rate constants of dissolution and precipitation reactions that are typical examples of surface heterogeneous reactions, necessary surface models, and reaction terms underlying observable data were successfully estimated from only observable temporal changes in the concentration of the dissolved intermediate products. 相似文献
3.
4.
In many industrial domains, there is a significant interest in obtaining temporal relationships among multiple variables in time-series data, given that such relationships play an auxiliary role in decision making. However, when transactions occur frequently only for a period of time, it is difficult for a traditional time-series association rules mining algorithm (TSARM) to identify this kind of relationship. In this paper, we propose a new TSARM framework and a novel algorithm named TSARM-UDP. A TSARM mining framework is used to mine time-series association rules (TSARs) and an up-to-date pattern (UDP) is applied to discover rare patterns that only appear in a period of time. Based on the up-to-date pattern mining, the proposed TSAR-UDP method could extract temporal relationship rules with better generality. The rules can be widely used in the process industry, the stock market, etc. Experiments are then performed on the public stock data and real blast furnace data to verify the effectiveness of the proposed algorithm. We compare our algorithm with three state-of-the-art algorithms, and the experimental results show that our algorithm can provide greater efficiency and interpretability in TSARs and that it has good prospects. 相似文献
5.
6.
7.
在物联网环境下进行信息监控系统设计,实现对网络信息的监控和自适应采集,保障网络安全。针对采用传统的神经网络控制方法进行信息监控的数据挖掘准确性不好的问题,提出一种基于物联网和自组织映射SOM算法的信息监控系统设计方法,首先进行信息监控系统的总体设计和功能模块化分析,然后设计改进的SOM算法,应用在信息监控的数据挖掘和分类识别中,在程序加载模块中进行算法加载,最后在物联网环境下构建嵌入式Linux内核进行信息监控系统的软件设计和开发。系统仿真实验结果表明,采用该信息监控系统进行大型物联网的数据信息监控,对数据的准确挖掘和识别性能较好。 相似文献
8.
Constantin P. Cristescu Cristina StanEugen I. Scarlat Teofil MineaCristina M. Cristescu 《Physica A》2012
We present a novel method for the parameter oriented analysis of mutual correlation between independent time series or between equivalent structures such as ordered data sets. The proposed method is based on the sliding window technique, defines a new type of correlation measure and can be applied to time series from all domains of science and technology, experimental or simulated. A specific parameter that can characterize the time series is computed for each window and a cross correlation analysis is carried out on the set of values obtained for the time series under investigation. We apply this method to the study of some currency daily exchange rates from the point of view of the Hurst exponent and the intermittency parameter. Interesting correlation relationships are revealed and a tentative crisis prediction is presented. 相似文献
9.
Pattern recognition and data mining software based on artificial neural networks applied to proton transfer in aqueous environments
下载免费PDF全文

In computational physics proton transfer phenomena could be viewed as pattern classification problems based on a set of input features allowing classification of the proton motion into two categories: transfer 'occurred' and transfer 'not occurred'. The goal of this paper is to evaluate the use of artificial neural networks in the classification of proton transfer events, based on the feed-forward back propagation neural network, used as a classifier to distinguish between the two transfer cases. In this paper, we use a new developed data mining and pattern recognition tool for automating, controlling, and drawing charts of the output data of an Empirical Valence Bond existing code. The study analyzes the need for pattern recognition in aqueous proton transfer processes and how the learning approach in error back propagation (multilayer perceptron algorithms) could be satisfactorily employed in the present case. We present a tool for pattern recognition and validate the code including a real physical case study. The results of applying the artificial neural networks methodology to crowd patterns based upon selected physical properties (e.g., temperature, density) show the abilities of the network to learn proton transfer patterns corresponding to properties of the aqueous environments, which is in turn proved to be fully compatible with previous proton transfer studies. 相似文献
10.
Trend anomaly detection is the practice of comparing and analyzing current and historical data trends to detect real-time abnormalities in online industrial data-streams. It has the advantages of tracking a concept drift automatically and predicting trend changes in the shortest time, making it important both for algorithmic research and industry. However, industrial data streams contain considerable noise that interferes with detecting weak anomalies. In this paper, the fastest detection algorithm “sliding nesting” is adopted. It is based on calculating the data weight in each window by applying variable weights, while maintaining the method of trend-effective integration accumulation. The new algorithm changes the traditional calculation method of the trend anomaly detection score, which calculates the score in a short window. This algorithm, SNWFD–DS, can detect weak trend abnormalities in the presence of noise interference. Compared with other methods, it has significant advantages. An on-site oil drilling data test shows that this method can significantly reduce delays compared with other methods and can improve the detection accuracy of weak trend anomalies under noise interference. 相似文献
11.
传统的滑动窗策略只是简单且机械地将最远的数据移出窗口, 而将最近的数据移进窗口. 针对这种遗忘策略存在的缺陷, 提出了过滤窗策略. 过滤窗采用\"优胜劣汰\"的选择机制, 将对模型贡献比较大的数据留在窗口当中. 将过滤窗和最小二乘支持向量回归机相结合, 提出了过滤窗最小二乘支持向量回归机. 与滑动窗最小二乘支持向量回归机相比较, 过滤窗最小二乘支持向量回归机具有较小的计算量, 需要较短的窗口长度就能达到和滑动窗最小二乘支持向量回归机几乎相同的预测精度, 而较短的窗口长度又预示着较少的计算量和较好的实时性. 混沌时间序列在线建模和预测的实例表明了过滤窗最小二乘支持向量回归机的有效性和可行性.关键词:混沌时间序列支持向量机滑动窗过滤窗 相似文献
12.
When confronted with massive data streams, summarizing data with dimension reduction methods such as PCA raises theoretical and algorithmic pitfalls. A principal curve acts as a nonlinear generalization of PCA, and the present paper proposes a novel algorithm to automatically and sequentially learn principal curves from data streams. We show that our procedure is supported by regret bounds with optimal sublinear remainder terms. A greedy local search implementation (called slpc, for sequential learning principal curves) that incorporates both sleeping experts and multi-armed bandit ingredients is presented, along with its regret computation and performance on synthetic and real-life data. 相似文献
13.
We introduce a modern, optimized, and publicly available implementation of the sequential Information Bottleneck clustering algorithm, which strikes a highly competitive balance between clustering quality and speed. We describe a set of optimizations that make the algorithm computation more efficient, particularly for the common case of sparse data representation. The results are substantiated by an extensive evaluation that compares the algorithm to commonly used alternatives, focusing on the practically important use case of text clustering. The evaluation covers a range of publicly available benchmark datasets and a set of clustering setups employing modern word and sentence embeddings obtained by state-of-the-art neural models. The results show that in spite of using the more basic Term-Frequency representation, the proposed implementation provides a highly attractive trade-off between quality and speed that outperforms the alternatives considered. This new release facilitates the use of the algorithm in real-world applications of text clustering. 相似文献
14.
An experimental comparison of models for performing dead‐time corrections of photon‐counting detectors at synchrotron sources is presented. The performance of several detectors in the three operating modes of the Advanced Photon Source is systematically compared, with particular emphasis on asymmetric fill patterns. Several simple and well known correction formulas are evaluated. The results demonstrate the critical importance of detector speed and synchrotron fill pattern in selecting the proper dead‐time correction. 相似文献
15.
16.
Stratifying behaviors based on demographics and socioeconomic status is crucial for political and economic planning. Traditional methods to gather income and demographic information, like national censuses, require costly large-scale surveys both in terms of the financial and the organizational resources needed for their successful collection. In this study, we use data from social media to expose how behavioral patterns in different socioeconomic groups can be used to infer an individual’s income. In particular, we look at the way people explore cities and use topics of conversation online as a means of inferring individual socioeconomic status. Privacy is preserved by using anonymized data, and abstracting human mobility and online conversation topics as aggregated high-dimensional vectors. We show that mobility and hashtag activity are good predictors of income and that the highest and lowest socioeconomic quantiles have the most differentiated behavior across groups. 相似文献
17.
18.
本文对电子电荷的测定时,找出计量时间所带来的误差原因,改变了计量时间的措施,使得在测量中,操作简单,数据准确。 相似文献
19.
Manuel Stapper 《Entropy (Basel, Switzerland)》2021,23(6)
A new software package for the Julia language, CountTimeSeries.jl, is under review, which provides likelihood based methods for integer-valued time series. The package’s functionalities are showcased in a simulation study on finite sample properties of Maximum Likelihood (ML) estimation and three real-life data applications. First, the number of newly infected COVID-19 patients is predicted. Then, previous findings on the need for overdispersion and zero inflation are reviewed in an application on animal submissions in New Zealand. Further, information criteria are used for model selection to investigate patterns in corporate insolvencies in Rhineland-Palatinate. Theoretical background and implementation details are described, and complete code for all applications is provided online. The CountTimeSeries package is available at the general Julia package registry. 相似文献
20.
Xide Li 《Optics & Laser Technology》2003,35(3):203-212
A new speckle measurement technique called temporal speckle pattern interferometry or time sequential speckle pattern interferometry has been developed recently. Its principle is that by capturing the temporal speckle patterns related to the object deformation or displacement, the whole-field displacement, the amplitude of the vibrating object and the shape of the tested object can be calculated through speckle intensity fluctuation scanning technique or Fourier-transforming method. In this paper, we combine the analytical and numerical methods to simulate the properties of the time demodulation in temporal speckle patterns interferometry techniques. The performance of three kinds of temporal phase sequences, power, exponential and harmonic phase sequences, are studied with the parameters of temporal speckle intensity fluctuation, the value of the spatial phase term, optical integral time of the recording camera and the initial phase of the temporal speckle intensities. The results indicate that the normalized value and period change of the instantaneous intensity are nearly coincident with that of the integral intensity for the harmonic temporal phase sequences and are different for the power and exponential temporal phase sequences. 相似文献