共查询到20条相似文献,搜索用时 0 毫秒
1.
提出了一种基于压缩感知的考虑语音帧间信息的语音转换算法。根据连续多帧语音的线谱对参数所构成的矢量在离散余弦变换域具有稀疏性,利用压缩感知技术对该矢量压缩成短矢量,并将该压缩后的短矢量作为特征参数训练语音转换函数。实验测试结果表明,选择合适的语音帧数时,该算法的性能要比传统的采用加权频率卷绕的转换算法提高3.21%。这说明,充分有效地利用语音帧间的相关信息会使转换语音保持更稳定的帧间声学特性,有利于提高语音转换系统的性能, 相似文献
2.
A voice conversion algorithm,which makes use of the information between continuous frames of speech by compressed sensing,is proposed in this paper.According to the sparsity property of the concatenated vector of several continuous Linear Spectrum Pairs(LSP)in the discrete cosine transformation domain,this paper utilizes compressed sensing to extract the compressed vector from the concatenated LSPs and uses it as the feature vector to train the conversion function.The results of evaluations demonstrate that the performance of this approach can averagely improve 3.21%with the conventional algorithm based on weighted frequency warping when choosing the appropriate numbers of speech frame.The experimental results also illustrate that the performance of voice conversion system can be improved by taking full advantage of the inter-frame information,because those information can make the converted speech remain the more stable acoustic properties which is inherent in inter-frames. 相似文献
3.
针对目标说话人可能存在语料不足的情况,本文提出了一种有限语料下的统一张量字典语音转换算法。从语料库中选取N个说话人作为语音张量字典的基础说话人,通过多序列动态时间规整算法使这N个说话人的平行语音段对齐,从而建立由N个二维基础字典构成的张量字典。在语音转换阶段,源、目标说话人语音都可以通过张量字典中各基础字典的线性组合,构造出各自的语音字典,实现了语音转换。实验结果表明,当基础说话人个数达到14时,只需要极少的目标说话人语料,便可获得与传统的基于非负矩阵分解转换算法相当的转换效果,这极大地方便了语音转换系统的应用。 相似文献
4.
Under the condition of limited target speaker's corpus, this paper proposed an algorithm for voice conversion using unified tensor dictionary with limited corpus. Firstly,parallel speech of N speakers was selected randomly from the speech corpus to build the base of tensor dictionary. And then, after the operation of multi-series dynamic time warping for those chosen speech, N two-dimension basic dictionaries can be generated which constituted the unified tensor dictionary. During the conversion stage, the two dictionaries of source and target speaker were established by linear combination of the N basic dictionaries using the two speakers' speech. The experimental results showed that when the number of the basic speaker was 14, our algorithm can obtain the compared performance of the traditional NMFbased method with few target speaker corpus, which greatly facilitate the application of voice conversion system. 相似文献
5.
6.
A global inversion method for multi-dimensional NMR logging 总被引:4,自引:0,他引:4
We describe a general global inversion methodology of multi-dimensional NMR logging for pore fluid typing and quantification in petroleum exploration. Although higher dimensions are theoretically possible, for practical reasons, we limit our discussion of proton density distributions as a function of two (2D) or three (3D) independent variables. The 2D can be diffusion coefficient and T(2) relaxation time (D-T(2)), and the 3D can be diffusion coefficient, T(2), and T(1) relaxation times (D-T(2)-T(1)) of the saturating fluids in rocks. Using the contrast between the diffusion coefficients of fluids (oil and water), the oil and water phases within the rocks can be clearly identified. This 2D or 3D proton density distribution function can be obtained from either two-window or regular type multiple CPMG echo trains encoded with diffusion, T(1), and T(2) relaxation by varying echo spacing and wait time. From this 2D/3D proton density distribution function, not only the saturations of water and oil can be determined, the viscosity of the oil and the gas-oil ratio can also be estimated based on a previously experimentally determined D-T(2) relationship. 相似文献
7.
8.
9.
10.
11.
12.
G. Capdeville 《Journal of computational physics》2011,230(8):2915-2951
This article presents a numerical model that enables to solve on unstructured triangular meshes and with a high-order of accuracy, a multi-dimensional Riemann problem that appears when solving hyperbolic problems.For this purpose, we use a MUSCL-like procedure in a “cell-vertex” finite-volume framework. In the first part of this procedure, we devise a four-state bi-dimensional HLL solver (HLL-2D). This solver is based upon the Riemann problem generated at the centre of gravity of a triangular cell, from surrounding cell-averages. A new three-wave model makes it possible to solve this problem, approximately. A first-order version of the bi-dimensional Riemann solver is then generated for discretizing the full compressible Euler equations.In the second part of the MUSCL procedure, we develop a polynomial reconstruction that uses all the surrounding numerical data of a given point, to give at best third-order accuracy. The resulting over determined system is solved by using a least-square methodology. To enforce monotonicity conditions into the polynomial interpolation, we develop a simplified central WENO (CWENO) procedure.Numerical tests and comparisons with competing numerical methods enable to identify the salient features of the whole model. 相似文献
13.
14.
针对含噪语音难以实现有效的语音转换,本文提出了一种采用联合字典优化的噪声鲁棒性语音转换算法。在联合字典的构成中,语音字典采用后向剔除算法(Backward Elimination algorithm,BE)进行优化,同时引入噪声字典,使得含噪语音与联合字典相匹配。实验结果表明,在保证转换效果的前提下,后向剔除算法能够减少字典帧数,降低计算量。在低信噪比和多种噪声环境下,本文算法与传统NMF算法和基于谱减法消噪的NMF转换算法相比具有更好的转换效果,噪声字典的引入提升了语音转换系统的噪声鲁棒性。 相似文献
15.
16.
17.
An enhanced Multi-dimensional Limiting Process (e-MLP) is developed for the accurate and efficient computation of multi-dimensional flows based on the Multi-dimensional Limiting Process (MLP). The new limiting process includes a distinguishing step and an enhanced multi-dimensional limiting process. First, the distinguishing step, which is independent of high order interpolation and flux evaluation, is newly introduced. It performs a multi-dimensional search of a discontinuity. The entire computational domain is then divided into continuous, linear discontinuous and nonlinear discontinuous regions. Second, limiting functions are appropriately switched according to the type of each region; in a continuous region, there are no limiting processes and only higher order accurate interpolation is performed. In linear discontinuous and nonlinear discontinuous regions, TVD criterion and MLP limiter are respectively used to remove oscillation. Hence, e-MLP has a number of advantages, as it incorporates useful features of MLP limiter such as multi-dimensional monotonicity and straightforward extensionality to higher order interpolation. It is applicable to local extrema and prevents excessive damping in a linear discontinuous region through application of appropriate limiting criteria. It is efficient because a limiting function is applied only to a discontinuous region. In addition, it is robust against shock instability due to the strict distinction of the computational domain and the use of regional information in a flux scheme as well as a high order interpolation scheme. This new limiting process was applied to numerous test cases including one-dimensional shock/sine wave interaction problem, oblique stationary contact discontinuity, isentropic vortex flow, high speed flow in a blunt body, planar shock/density bubble interaction, shock wave/vortex interaction and, particularly, magnetohydrodynamic (MHD) cloud-shock interaction problems. Through these tests, it was verified that e-MLP substantially enhances the accuracy and efficiency with both continuous and discontinuous multi-dimensional flows. 相似文献
18.
The performance of linear prediction analysis of speech deteriorates rapidly under noisy environments.To tackle this issue,an improved noise-robust sparse linear prediction algorithm is proposed.First,the linear prediction residual of speech is modeled as Student-t distribution,and the additive noise is incorporated explicitly to increase the robustness,thus a probabilistic model for sparse linear prediction of speech is built.Furthermore,variational Bayesian inference is utilized to approximate the intractable posterior distributions of the model parameters,and then the optimal linear prediction parameters are estimated robustly.The experimental results demonstrate the advantage of the developed algorithm in terms of several different metrics compared with the traditional algorithm and the l1 norm minimization based sparse linear prediction algorithm proposed in recent years.Finally it draws to a conclusion that the proposed algorithm is more robust to noise and is able to increase the speech quality in applications. 相似文献
19.
《声学学报:英文版》2014,(5)
语音线性预测分析算法在噪声环境下性能会急剧恶化,针对这一问题,提出一种改进的噪声鲁棒稀疏线性预测算法。首先采用学生t分布对具有稀疏性的语音线性预测残差建模,并显式考虑加性噪声的影响以提高模型鲁棒性,从而构建完整的概率模型。然后采用变分贝叶斯方法推导模型参数的近似后验分布,最终实现噪声鲁棒的稀疏线性预测参数估计。实验结果表明,与传统算法以及近几年提出的基于l_1范数优化的稀疏线性预测算法相比,该算法在多项指标上具有优势,对环境噪声具有更好的鲁棒性,并且谱失真度更小,因而能够有效提高噪声环境下的语音质量。 相似文献