摘 要: | Principal component analysis(PCA)is one of the most popular multivariate data analysis techniques for dimension reduction and data mining,and is widely used in many fields ranging from industry and biology to finance and social development.When working on big data,it is of great necessity to consider the online version of PCA,in which only a small subset of samples could be stored.To handle the online PCA problem,Oja(1982)presented the stochastic power method under the assumption of zero-mean samples,and there have been lots of theoretical analysis and modified versions of this method in recent years.However,a common circumstance where the samples have nonzero mean is seldom studied.In this paper,we derive the convergence rate of a nonzero-mean version of Oja’s algorithm with diminishing stepsizes.In the analysis,we succeed in handling the dependency between each iteration,which is caused by the updated mean term for data centering.Furthermore,we verify the theoretical results by several numerical tests on both artificial and real datasets.Our work offers a way to deal with the top-1 online PCA when the mean of the given data is unknown.
|