首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 281 毫秒
1.
罗香怡  刘海凤  贲帅  刘学深 《物理学报》2016,65(12):123201-123201
通过数值求解非波恩-奥本海默近似下的一维含时薛定谔方程,研究了蝴蝶结型纳米结构基元中氢分子离子高次谐波的产生.研究表明,在蝴蝶结型纳米结构基元内部产生的非均匀场的空间位置对高次谐波的发射有较大影响.当非均匀场的空间位置从30 a.u.平移到-30 a.u.时,高次谐波的截止位置被延展且形成光滑的超连续的谐波谱,并应用时频分析方法、经典三步模型以及电离概率等解释了高次谐波发射的物理机理.研究了高次谐波谱对非均匀场空间位置的依赖性与载波包络值的关系,发现随着载波包络值的变化,非均匀场在不同空间位置处的高次谐波谱变化趋势相同.  相似文献   

2.
行驶汽车环境中的话音活动检测研究   总被引:1,自引:0,他引:1       下载免费PDF全文
话音活动检测是语音交互和通信系统的重要部分,其作用是区分输入信号中的语音段和背景噪声段,检测的依据主要是语音和噪声的各种时频特性,其中,浊语音的周期性和谐波特性是一种广泛应用的特征。但是在行驶的汽车环境中,由于噪声非平稳且信噪比较低,这类特征较难得到可靠的检测。为此,本文根据浊音谐波结构的基本规律,利用时变噪声环境中各频带信噪比不同的特点,提出一种较为鲁棒的谐波快速检测算法。算法以较小的时频块为分析单元,利用一组基频在对数尺度上变化的谐波模板,自适应地搜索谐波结构清晰的部分,并以此检测浊语音信号。实验证明,该算法能够在行驶的汽车环境中达到较可靠的话音/非话音区别效果。  相似文献   

3.
针对以往语音增强算法在非平稳噪声环境下性能急剧下降的问题,基于时频字典学习方法提出了一种新的单通道语音增强算法。首先,提出采用时频字典学习方法对噪声的频谱结构的先验信息进行建模,并将其融入到卷积非负矩阵分解的框架下;然后,在固定噪声时频字典情况下,推导了时变增益和语音时频字典的乘性迭代求解公式;最后,利用该迭代公式更新语音和噪声的时变增益系数以及语音的时频字典,通过语音时频字典和时变增益的卷积运算重构出语音的幅度谱并用二值时频掩蔽方法消除噪声干扰。实验结果表明,在多项语音质量评价指标上,本文算法都取得了更好的结果。在非平稳噪声和低信噪比环境下,相比于多带谱减法和非负稀疏编码去噪算法,本文算法更有效地消除了噪声,增强后的语音具有更好的质量。   相似文献   

4.
赵立恒  汪增福 《声学学报》2012,37(2):218-224
提出了一种基于谐波和能量特征的单声道浊语音分离方法。该方法将浊语音分离问题转化为声音在时频域的分类问题。首先,在已有谐波特征的基础上,引入能量特征。然后,对于谐波特征明显且能量大的时频单元,在分类器训练阶段复制它们的特征。实验结果表明该方法相比之前的方法有更好的信噪比增益。通过引入能量特征和特征复制,改善了浊语音的分离效果。   相似文献   

5.
关于磁光调制倍频法的讨论和改进   总被引:1,自引:0,他引:1  
底楠  徐晓鹏 《物理实验》2007,27(5):10-12,16
从理论上证明了磁光调制倍频的原理,着重论述了调制幅度对实验波形的影响,证明了为得到理想的结果,应将调制幅度0θ控制在π/4处.根据磁光调制倍频法的工作原理,分别给出了磁光调制器和测量光路的一种变形,通过改进,利用法拉第效应的非互易性,旋转角变为了原来的2倍.  相似文献   

6.
报道了基于激光吸收光谱学原理的天然气管道泄漏移动遥测技术,通过模拟天然气泄漏实验, 分析了移动遥测的关键技术问题.为了定量遥测天然气管道微量泄漏,引入一个和剩余幅度调制(RAM) 等值反相的信号对偏差进行补偿,降低RAM对谐波信号的影响,提高系统检测灵敏度. 针对遥测回波吸收光谱特征,提出了改进软阈值小波去噪法,就提高系统信噪比而言, 比传统软阈值去噪法高2倍多,同时对二次谐波(2f)信号形状也有很好的保留,通过探测限计算, 系统移动遥测灵敏度达到80 ppm/m.  相似文献   

7.
张天骐  李伟  林孝康  刘林 《应用声学》2005,24(3):157-163
本文提出了一种基于数字谱分析的嗓音控制开关(VOX,Voice—Operated Transmit)的新算法,该算法简单、实用,在某种程度上克服了传统VOX算法的结构复杂、参数难调等局限,对噪声的鲁棒性也较好,而且易于用数字信号处理实现。首先利用信号功率谱二次处理,提取出语音的平均幅度包络,然后对所得包络进行阈值处理、限幅放大,最后就得到VOX函数。理论分析和计算机模拟结果表明,该算法不仅能较为准确地提取出语音波形的平均幅度包络,而且能工作在较低的信噪比条件下。  相似文献   

8.
解伟超  张玲华 《声学学报》2014,39(1):130-136
提出一种基于自组织聚类,并且利用改进粒子群算法确定转换模型参数的语音转换方法.该方法首先基于自组织特征映射网络对特征参数进行聚类,再对每个聚类分别建立转换规则,并且利用柯西变异的粒子群算法确定每个转换规则中的模型参数.与传统的单一转换规则相比,聚类后建立的多转换规则以及利用改进粒子群算法确定参数能够提高映射关系的准确度,避免参数陷入局部最优点。以女声转男声为例,主观测试表明该方法得到的转换语音与目标的相似度提高了27.6%,平均主观意见分(Mean Opinion Score,MOS)提高了0.6,客观测试也表明该方法谱失真最小,与目标的包络更接近.   相似文献   

9.
介绍了基于傅里叶分析的旋转矢量算法(REV),并将该算法应用于相控阵天线各单元的初始幅相误差校正,给出了它的仿真分析和工程实现方法。所使用的校正方法区别于以往采用的REV校正方法,利于减少校正天线对相控阵收发信号的影响。仿真结果显示,校正后各通道的相位和幅度与人为设置的相位、幅度只差一个常数,表明方法是有效的;通过实际测试,校正后的天线方向图与近场校正结果逼近,EIRP比校正前提高1.4dB以上,验证了校正方法的功能和性能。  相似文献   

10.
相对幅度和相位对相位阵列系统的影响及消除   总被引:1,自引:1,他引:0  
用扩散近似的解析模型分析了幅度相消相位调制系统对组织中微小异质子实现定位的原理;详细讨论了相对幅度、相对相位、异质子的大小和吸收系数对系统对系统定位精度的影响。提出了一种与相对幅度和相位无关的定位方法并从理论上解释了这种方法的原理和可行性。  相似文献   

11.
The application of the ideal binary mask to an auditory mixture has been shown to yield substantial improvements in intelligibility. This mask is commonly applied to the time-frequency (T-F) representation of a mixture signal and eliminates portions of a signal below a signal-to-noise-ratio (SNR) threshold while allowing others to pass through intact. The factors influencing intelligibility of ideal binary-masked speech are not well understood and are examined in the present study. Specifically, the effects of the local SNR threshold, input SNR level, masker type, and errors introduced in estimating the ideal mask are examined. Consistent with previous studies, intelligibility of binary-masked stimuli is quite high even at -10 dB SNR for all maskers tested. Performance was affected the most when the masker dominated T-F units were wrongly labeled as target-dominated T-F units. Performance plateaued near 100% correct for SNR thresholds ranging from -20 to 5 dB. The existence of the plateau region suggests that it is the pattern of the ideal binary mask that matters the most rather than the local SNR of each T-F unit. This pattern directs the listener's attention to where the target is and enables them to segregate speech effectively in multitalker environments.  相似文献   

12.
Gockel, Carlyon, and Plack [J. Acoust. Soc. Am. 116, 1092-1104 (2004)] showed that discrimination of the fundamental frequency (F0) of a target tone containing only unresolved harmonics was impaired when an interfering complex tone with fixed F0 was added to the target, but filtered into a lower frequency region. This pitch discrimination interference (PDI) was greater when the interferer contained resolved harmonics than when it contained only unresolved harmonics. Here, it is examined whether this occurred because, when the interferer contained unresolved harmonics, "pitch pulse asynchrony (PPA)" between the target and interferer provided a cue that enhanced performance; this was possible in the earlier experiment because both target and interferer had components added in sine phase. In experiment 1, it was shown that subjects were moderately sensitive to the direction of PPA across frequency regions. In experiments 2 and 3, PPA cues were eliminated by adding the components of the target only, or of both target and interferer, in random phase. For both experiments, an interferer containing resolved harmonics produced more PDI than an interferer containing unresolved harmonics. These results show that PDI is smaller for an interferer with unresolved harmonics even when cues related to PPA are eliminated.  相似文献   

13.
The resolution of XPS spectra is limited mainly by instrumental parameters like the spectral line width of exciting X-ray source and the finite energy resolution of the electron analyzer. A new algorithm of an inverse ill-posed problem has been proposed in which low resolution experimental XPS data can be enhanced by removing the instrumental functions. The regularization method with a special fast iteration algorithm is applied for determining the true profile line of a complex chemical compound if we know excitation and instrumental functions. This paper is structured as follows: (I) a mathematical algorithm is described and numerical simulation results are presented. (II) These algorithms have been applied to the poorly resolved C 1s spectrum of a poly-methyl methacrylate (PMMA) film studied by the conventional technique with monochromatic and polychromatic radiation. The resulting enhancement allows previously unresolved structure to be observed without any assumptions concerning number, position, shape of peaks and their ratio.  相似文献   

14.
When all of the components in a harmonic complex tone are shifted in frequency by delta f, the pitch of the complex shifts roughly in proportion to delta f. For tones with a small number of components, the shift is usually somewhat larger than predicted from pitch theories, which has been attributed to the influence of combination tones [Smoorenburg, J. Acoust. Soc. Am. 48, 924-941 (1970)]. Experiment 1 assessed whether combination tones influence the pitch of complex tones with more than five harmonics, by using noise to mask the combination tones. The matching stimulus was a harmonic complex. Test complexes were bandpass filtered with passbands centered on harmonic numbers 5 (resolved), 11 (intermediate), or 16 (unresolved) and fundamental frequencies (FOs) were 100, 200, or 400 Hz. For the intermediate and unresolved conditions, the matching stimuli were filtered with the same passband to minimize differences in the excitation patterns of the test and matching stimuli. For the resolved condition, the matching stimulus had a passband centered above that of the test stimulus, to avoid common partials. For resolved and intermediate conditions, pitch shifts were observed that could generally be predicted from the frequencies of the partials. The shifts were unaffected by addition of noise to mask combination tones. For the unresolved condition, no pitch shift was observed, which suggests that pitch is not based on temporal fine structure for stimuli containing only high unresolved harmonics. Experiment 2 used three-component complexes resembling those of Schouten [J. Acoust. Soc. Am. 34, 1418-1424 (1962)]. Nominal harmonic numbers were 3, 4, 5 (resolved), 8, 9, 10 (intermediate), or 13, 14, 15 (unresolved) and F0s were 50, 100, 200, or 400 Hz. Clear shifts in the matches were found for all conditions, including unresolved. For the latter, subjects may have matched the "center of gravity" of the excitation patterns of the test and matching stimuli.  相似文献   

15.
Carlyon and Shackleton [J. Acoust. Soc. Am. 95, 3541-3554 (1994)] suggested that fundamental-frequency (F0) discrimination performance between resolved and unresolved harmonics is limited by an internal "translation" noise between the outputs of two distinct F0 encoding mechanisms, in addition to the encoding noise associated with each mechanism. To test this hypothesis further, F0 difference limens (DLF0s) were measured in six normal-hearing listeners using sequentially presented groups of harmonics. The two groups of harmonics presented on each trial were bandpass filtered into the same or different spectral regions, in such a way that both groups contained mainly resolved harmonics, both groups contained only unresolved harmonics, or one group contained mainly resolved and the other only unresolved harmonics. Three spectral regions (low: 600-1150 Hz, mid: 1400-2500 Hz, or high: 3000-5250 Hz) and two nominal F0s (100 and 200 Hz) were used. The DLF0s measured in across-region conditions were well accounted for by a model assuming only two sources of internal noise: the encoding noise estimated on the basis of the within-region results plus a constant noise associated with F0 comparisons across different spectral regions, independent of resolvability. No evidence for an across-pitch-mechanism translation noise was found. A reexamination of previous evidence for the existence of such noise suggests that the present negative outcome is unlikely to be explained by insufficient measurement sensitivity or an unusually large across-region comparison noise in the present study. While the results do not rule out the possibility of two separate pitch mechanisms, they indicate that the F0s of sequentially presented resolved and unresolved harmonics can be compared internally at no or negligible extra cost.  相似文献   

16.
一种基于词袋模型的大规模图像层次化分组算法   总被引:1,自引:0,他引:1       下载免费PDF全文
大规模图像集合的自动分组,不仅可以帮助用户快速组织和掌握图像集合的内容,并且是基于图像的三维场景重建应用的前提和重要环节。提出一种基于词袋模型(bag-of-words, BOW)的层次化分组算法,将每幅图像表示为一个超高维视词向量,利用多路量化技术将内容相似的图像量化到同一个节点,从而完成对图像粗略分组。然后,在每组类别里面,对图像的局部特征向量进行逐一匹配,并利用仿射空间不变量的约束条件,去除不可靠特征匹配,得到更为准确可靠的图像相似度度量,从而完成图像的精细分组。实验结果表明:从得到的系统不同阶段图像分组的查准率-查全率(precision-recall)曲线可以看出,精细分组过程可以显著提高粗分组精度,并且在精细分组阶段,使用约束条件比不使用约束还能获得更高的分组精度  相似文献   

17.
In a natural environment, speech signals are degraded by both reverberation and concurrent noise sources. While human listening is robust under these conditions using only two ears, current two-microphone algorithms perform poorly. The psychological process of figure-ground segregation suggests that the target signal is perceived as a foreground while the remaining stimuli are perceived as a background. Accordingly, the goal is to estimate an ideal time-frequency (T-F) binary mask, which selects the target if it is stronger than the interference in a local T-F unit. In this paper, a binaural segregation system that extracts the reverberant target signal from multisource reverberant mixtures by utilizing only the location information of target source is proposed. The proposed system combines target cancellation through adaptive filtering and a binary decision rule to estimate the ideal T-F binary mask. The main observation in this work is that the target attenuation in a T-F unit resulting from adaptive filtering is correlated with the relative strength of target to mixture. A comprehensive evaluation shows that the proposed system results in large SNR gains. In addition, comparisons using SNR as well as automatic speech recognition measures show that this system outperforms standard two-microphone beamforming approaches and a recent binaural processor.  相似文献   

18.
Experiment 1 measured frequency modulation detection thresholds (FMTs) for harmonic complex tones as a function of modulation rate. Six complexes were used, with fundamental frequencies (F0s) of either 88 or 250 Hz, bandpass filtered into a LOW (125-625 Hz), MID (1375-1875 Hz) or HIGH (3900-5400 Hz) frequency region. The FMTs were about an order of magnitude greater for the three complexes whose harmonics were unresolved by the peripheral auditory system (F0 = 88 Hz in the MID region and both F0s in the HIGH region) than for the other three complexes, which contained some resolved harmonics. Thresholds increased with increases in FM rate above 2 Hz for all conditions. The increase was larger when the F0 was 88 Hz than when it was 250 Hz, and was also larger in the LOW than in the MID and HIGH regions. Experiment 2 measured thresholds for detecting mistuning produced by modulating the F0s of two simultaneously presented complexes out of phase by 180 degrees. The size of the resulting mistuning oscillates at a rate equal to the rate of FM applied to the two carriers. At low FM rates, thresholds were lowest when the harmonics were either resolved for both complexes or unresolved for both complexes, and highest when resolvability differed across complexes. For pairs of complexes with resolved harmonics, mistuning thresholds increased dramatically as the FM rate was increased above 2-5 Hz, in a way which could not be accounted for by the effect of modulation rate on the FMTs for the individual complexes. A third experiment, in which listeners detected constant ("static") mistuning between pairs of frequency-modulated complexes, provided evidence that this deterioration was due the harmonics in one of the two "resolved" complexes becoming unresolved at high FM rates, when analyzed over some finite time window. It is concluded that the detection of time-varying mistuning between groups of harmonics is limited by factors that are not apparent in FM detection data.  相似文献   

19.
Optimal utilization of computational resources mandates spatio-temporal variation in resolution for computing complex engineering flows. Closure modeling in regions bridging between different resolutions is rendered difficult due to changing interactions between resolved and unresolved fields. We develop a closure model for the bridging region based on energy conservation principles. Then we proceed to provide a proof of concept in decaying isotropic turbulence with temporally varying resolution. The simplicity of the flow permits a thorough examination of various aspects of the proposed closure not feasible in more complex flows. The results demonstrate the potential promise of the approach, but more validation studies need to be performed.While the present development is in the context of partially averaged Navier–Stokes (PANS) method, the closure principle should apply for other variable-resolution (VR) approaches.  相似文献   

20.
分子振转光谱的计算机辅助标识   总被引:1,自引:0,他引:1  
介绍一种计算机辅助分子光谱振转标识的交互式软件,它的主要功能是根据二次逐差原理自动将谱线分组,挑取各个支带;然后用图解的方式,以谱线频率和强度为坐标轴,将选取的各组谱线按不同的颜色或线形绘成柱状图,清晰地显示出隐藏在复杂分子光谱中的振转结构;而且,同时可以作出Loomis-Wood图辅助光谱振转标识。将本软件应用到由三氯化磷和氦气放电生成的包含多种分子的复杂光谱中,取得了满意的结果,有效地辅助了光谱的振转分析。该软件适用于对称陀螺分子和轻微不对称陀螺分子和线形分子的光谱分析。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号