首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 156 毫秒
1.
赵志军  谢凌云 《声学学报》2013,38(5):624-631
视听交互的重要性日益突出,但视觉刺激对听觉感知的影响尚缺乏全面深入的研究。以视觉刺激下人耳对声音的主观听感差别阈限变化为研究对象,在主观听觉实验中施加颜色、质量、亮度、运动状态四个不同属性视觉刺激,同时测量纯音信号的响度、主观音长和音高的听感差别阈限。通过与无视觉刺激下相应差别阈限的比较,分析不同视觉条件对响度感知、主观音长感知、音高感知能力的影响。实验数据显示,施加视觉刺激后主观听觉感知的差别阈限值增大,主观音长、音高和响度的差别阈限值平均分别提高了45.1%,14.8%和12.3%。进一步分析的结果表明,施加视觉刺激后基本的听觉感知能力呈下降趋势。同一视觉属性的不同水平视觉条件对听觉感知的影响程度不同,主观听感的变化呈现出一定的规律性,即视觉刺激越舒适,听感的差别阈限变化越小。   相似文献   

2.
针对光照分布不均匀的室内环境下低动态载体速度计算实时性较差的问题,提出一种基于改进奇异值分解(SVD)-Harris的低动态载体速度快速计算的新方法。利用SVD对相邻两帧视觉图像分别进行压缩与重构,并结合改进的Harris角点检测算法对两帧图像进行特征点的检测;利用归一化互相关(NCC)模板匹配算法对相邻两帧视觉图像的特征点进行粗匹配;利用随机抽样一致性算法进行误匹配点对的剔除;利用特征匹配点对的信息对载体的速度进行计算。实验结果表明:传统算法的平均计算时间为3.07s,而改进算法的平均计算时间为0.71s,且传统算法的误匹配率远大于改进算法。与传统的NCC模板匹配方法相比,所提算法不仅保证了低动态载体速度计算的精确性,而且显著提高了载体速度在光照不均匀的室内环境下的计算效率,该研究为实现室内移动机器人实时视觉导航提供了理论依据。  相似文献   

3.
基于色貌的跨媒体颜色复制   总被引:6,自引:1,他引:5  
介绍了一种基于视觉匹配的跨媒体颜色复制方法。通过视觉匹配将一个环境下的一些色貌因素"映射"到另一个环境,是一种基于色貌的CRT特性化方法。该方法复制的22个Munsell色卡的平均视觉评价为6分制的5.17分。其中,红色调的复制色块视觉评价较好,蓝色调或蓝色占有较大比例的复制色块误差较大。sRGB作为目前流行的用于颜色通讯的标准色空间,在实验中也进行了比较。实验证明这种基于视觉匹配的特性化方法,已经包含了一些色貌因素,可以满足一般的应用要求,有着广泛的应用前景。  相似文献   

4.
针对在808 nm波段的光参量放大系统(OPA),详细分析了氘化磷酸二氢钾(DKDP)晶体非共线相位匹配的非线性过程。通过数值计算,给出了不同氘化率的DKDP晶体参量匹配特性以及匹配参数;在此基础上,利用OPA耦合波方程组分析了基于氘化率为95%的DKDP晶体的光参量啁啾脉冲放大器(OPCPA)的输出特性。结果显示,对大于30 fs压缩脉宽的超短脉冲激光系统,氘化率为90%以上的DKDP晶体具有足够的增益带宽与能量提取效率,但接收角较小,增加了工程调节难度。  相似文献   

5.
在双目立体视觉系统中,立体匹配是关键步骤之一,其精度对后续的研究有着重大影响。Census算法由于具有简单明晰、运行效果好、实时性强等优点,被广泛采用。但Census立体匹配算法存在变换窗口中心点易受外界条件干扰、深度不连续区域匹配精度低等缺点,由此提出了一种新型的基于Census变换及引导滤波器的立体匹配算法。在Census变换阶段通过计算变换窗口周围的像素的平均值,降低了外界干扰的影响,同时在代价聚合阶段引入具有包边特性且计算量不依赖于滤波核大小的引导滤波器作为自适应权重。实验结果表明:所提算法在Middlebury测试平台上平均误匹配误差为6.03%,相较于目前Census立体匹配算法16.2%的平均误匹配率,匹配效果明显提高,且算法效率较高,具有较好的辐射不变性。  相似文献   

6.
针对现有手掌静脉认证系统误拒率较高以及不支持大数据集匹配的问题,设计了基于透射式光源的双目视觉静脉三维点云重建装置,提出了基于三维点云匹配的手掌静脉认证算法。系统使用850 nm透射式发光二极管(LED)光源作为照明装置,由双目摄像机拍摄静脉视差图像进行三维重建。选择手掌静脉作为特征点描述其空间三维结构,提出了一种改进的内核相关性分析方法匹配三维点云。针对200组点云数据的实验结果验证了该方法的可行性和有效性,识别率达到了98%,误拒率2%,误识率0%,总特征维数约8000至12000维,高于尺度不变特征变换(SIFT),支持对大数据集的认证识别。  相似文献   

7.
根据准相位匹配理论计算了周期极化LiTaO3(PPLT)体中0类准相位匹配过程(e+e→e)的增益曲线.在此基础上,使用数百μJ的低抽运能量获得了~106的增益和-10.3%的转换效率,实现了中心波长位于1064nm的基于简并光学啁啾脉冲参量放大(OPCPA)技术的高增益放大,为产生超短超强激光脉冲提供了新的技术手段.实验结果与理论预期基本符合.  相似文献   

8.
自主研发了一种光笔式双目立体视觉大工件尺寸测量系统,对测量系统中特征点的提取及其匹配技术进行了研究。测量系统采用 Canny 算子和 Zernike 矩相结合的算法实现椭圆光斑的亚像素边缘提取,根据得到的亚像素边缘点,采用基于最小二乘的曲线拟合法得到椭圆光斑的中心坐标。针对特征点的匹配问题,提出了一种基于位置约束的快速匹配方法。实验结果显示:所提方法能提取到椭圆光斑亚像素边缘,可精确计算出椭圆光斑中心坐标,匹配率达到95%以上。  相似文献   

9.
提高故障诊断能力对于确保水下机器人系统的稳定运行具有重要意义,故障分类是目前水下机器人故障诊断所面临的一个重要问题。针对水下机器人推进器系统数据特征,提出一种基于信息增益率的加权朴素贝叶斯故障分类算法。首先,计算故障训练样本的先验概率,将各属性的信息增益率作为权值;其次,构建基于增益率加权的朴素贝叶斯分类模型;然后,对检测的故障数据利用分类模型获取具有最大后验概率的故障模式,实现故障分类。与朴素贝叶斯算法和决策树算法相比,仿真实验结果表明基于信息增益率加权的朴素贝叶斯算法的分类成功率更高,能够有效地实现水下机器人的故障分类。  相似文献   

10.
羊奕伟  刘荣  蒋励  鹿心鑫  王玫  严小松 《物理学报》2014,63(16):162801-162801
开展了钍样品装置内钍核参数的积分中子学基础研究.参考混合堆概念设计搭建了内部放置了钍样品的一维贫铀/聚乙烯交替系统装置,采用加速器D-T中子源模拟聚变堆芯,利用前期开发的离线伽马测量方法测定了不同位置、不同中子谱情况下的232Th(n,γ)反应率,不确定度约为5%.结果显示,聚乙烯对14.1 MeV中子的慢化作用可有效提升钍俘获率,且贫铀对钍俘获率也有显著提升作用.实验结果与主流核数据库计算结果的对比显示,ENDF/B-VI.6和JENDL-3.3数据库的计算值比实验值平均约大6%,而较新的ENDF/B-VII.0数据库的计算值比实验值平均约大4%.因此,相比于之前数据库的钍核数据,ENDF/B-VII.0的计算值与实验结果匹配得较好,可作为相关概念设计的推荐核数据库.  相似文献   

11.
胡航烨  王蔚 《应用声学》2023,42(1):76-83
情感语声合成技术对于人机交互具有重要的意义。面对儿童情感语声合成所需汉语语声数据资源缺乏以及模型训练时长较长等问题,该文提出利用迁移学习实现汉语儿童情感语声合成的方法。首先基于汉语语声数据库训练深度学习模型实现中文语声端到端合成模型,再使用高质量大样本的中文情感语料库完成情感语声合成模型,最后利用自行采样的小样本汉语儿童情感语料对模型进行迁移学习实现低资源的语声合成。客观实验结果中梅尔倒谱失真指标为4.91,主观听辨实验指标分别为3.61和4.17。通过实验对比表明,该文的方法在情感语声合成技术的应用上具有良好的性能表现,并且优于现有先进的低资源情感语声合成方法。  相似文献   

12.
Neural network modeling of emotion   总被引:1,自引:0,他引:1  
This article reviews the history and development of computational neural network modeling of cognitive and behavioral processes that involve emotion. The exposition starts with models of classical conditioning dating from the early 1970s. Then it proceeds toward models of interactions between emotion and attention. Then models of emotional influences on decision making are reviewed, including some speculative (not and not yet simulated) models of the evolution of decision rules. Through the late 1980s, the neural networks developed to model emotional processes were mainly embodiments of significant functional principles motivated by psychological data. In the last two decades, network models of these processes have become much more detailed in their incorporation of known physiological properties of specific brain regions, while preserving many of the psychological principles from the earlier models.Most network models of emotional processes so far have dealt with positive and negative emotion in general, rather than specific emotions such as fear, joy, sadness, and anger. But a later section of this article reviews a few models relevant to specific emotions: one family of models of auditory fear conditioning in rats, and one model of induced pleasure enhancing creativity in humans. Then models of emotional disorders are reviewed. The article concludes with philosophical statements about the essential contributions of emotion to intelligent behavior and the importance of quantitative theories and models to the interdisciplinary enterprise of understanding the interactions of emotion, cognition, and behavior.  相似文献   

13.
朱斯语  姬培锋  杨军 《应用声学》2017,36(6):481-489
为了客观地评价民族乐器与西洋乐器在听觉感知方面的差异,本文利用15种典型的中西方乐器声样本,建立了与音色、响度和音色明亮度有关的15种乐器的感知空间模型,通过这些模型可以预测不同乐器在音高、响度一定时,音色明亮度的感知情况。此外,根据已建立的感知空间模型分别对比弹拨乐器、拉弦乐器和不同类型的吹奏乐器中三种听觉感知属性的变化差异。结果表明,对于中西方典型乐器,音色明亮度随响度的增加而增大,但是响度对音色明亮度的影响程度受到音域和响度范围的影响。民族乐器的音色明亮度随音高的增加而增大,但是西洋乐器的音色明亮度并没有随音高的增加而发生明显的变化。  相似文献   

14.
基于听觉事件检测的汉语语音声韵切分   总被引:2,自引:0,他引:2  
张宝奇  张连海  屈丹 《声学学报》2010,35(6):701-707
提出了一种基于听觉事件检测的汉语声韵母切分方法。该方法首先使用耳蜗滤波器组对语音进行滤波,然后在每个频带上检测对应于能量突变的听觉事件,最后在不同频率范围对听觉事件进行融合以确定声韵母边界。实验结果表明,对8 kHz采样的干净语音切分准确率可达到88.9%;信噪比10 dB的语音切分准确率可达到82.9%以上。   相似文献   

15.
Despite many studies investigating auditory spatial impressions in rooms, few have addressed the impact of simultaneous visual cues on localization and the perception of spaciousness. The current research presents an immersive audiovisual environment in which participants were instructed to make auditory width judgments in dynamic bi-modal settings. The results of these psychophysical tests suggest the importance of congruent audio visual presentation to the ecological interpretation of an auditory scene. Supporting data were accumulated in five rooms of ascending volumes and varying reverberation times. Participants were given an audiovisual matching test in which they were instructed to pan the auditory width of a performing ensemble to a varying set of audio and visual cues in rooms. Results show that both auditory and visual factors affect the collected responses and that the two sensory modalities coincide in distinct interactions. The greatest differences between the panned audio stimuli given a fixed visual width were found in the physical space with the largest volume and the greatest source distance. These results suggest, in this specific instance, a predominance of auditory cues in the spatial analysis of the bi-modal scene.  相似文献   

16.
Vowel and consonant confusion matrices were collected in the hearing alone (H), lipreading alone (L), and hearing plus lipreading (HL) conditions for 28 patients participating in the clinical trial of the multiple-channel cochlear implant. All patients were profound-to-totally deaf and "hearing" refers to the presentation of auditory information via the implant. The average scores were 49% for vowels and 37% for consonants in the H condition and the HL scores were significantly higher than the L scores. Information transmission and multidimensional scaling analyses showed that different speech features were conveyed at different levels in the H and L conditions. In the HL condition, the visual and auditory signals provided independent information sources for each feature. For vowels, the auditory signal was the major source of duration information, while the visual signal was the major source of first and second formant frequency information. The implant provided information about the amplitude envelope of the speech and the estimated frequency of the main spectral peak between 800 and 4000 Hz, which was useful for consonant recognition. A speech processor that coded the estimated frequency and amplitude of an additional peak between 300 and 1000 Hz was shown to increase the vowel and consonant recognition in the H condition by improving the transmission of first formant and voicing information.  相似文献   

17.
The question of beauty has inspired philosophers and scientists for centuries. Today, the study of aesthetics is an active research topic in fields as diverse as computer science, neuroscience, and psychology. Measuring the aesthetic appeal of images is beneficial for many applications. In this paper, we will study the aesthetic assessment of simple visual patterns. The proposed approach suggests that aesthetically appealing patterns are more likely to deliver a higher amount of information over multiple levels in comparison with less aesthetically appealing patterns when the same amount of energy is used. The proposed approach is evaluated using two datasets; the results show that the proposed approach is more accurate in classifying aesthetically appealing patterns compared to some related approaches that use different complexity measures.  相似文献   

18.
Flower colours have evolved over 100 million years to address the colour vision of their bee pollinators. In a much more rapid process, cultural (and horticultural) evolution has produced images of flowers that stimulate aesthetic responses in human observers. The colour vision and analysis of visual patterns differ in several respects between humans and bees. Here, a behavioural ecologist and an installation artist present bumblebees with reproductions of paintings highly appreciated in Western society, such as Van Gogh's Sunflowers. We use this unconventional approach in the hope to raise awareness for between-species differences in visual perception, and to provoke thinking about the implications of biology in human aesthetics and the relationship between object representation and its biological connotations.  相似文献   

19.
语音是一种短时平稳时频信号,因此大多数的研究者都通过分帧来提取情感特征。然而,分帧后提取的特征为局部特征,无法准确反应情感语音动态特性,故单纯采用局部特征往往无法构建鲁棒的情感识别系统。针对这个问题,先在不分帧的语音信号里通过多尺度最优小波包分解提取语句级全局特征,分帧后再提取384维的语句级局部特征,并利用Fisher准则进行降维,最后提出一种弱尺度融合策略来将这两种语句级特征进行融合,再利用SVM进行情感分类。基于柏林情感库的实验结果表明本文方法较单纯使用语句级局部特征最后识别率提高了4.2%到13.8%,特别在小样本的情况下,语音情感识别率波动较小。   相似文献   

20.
This paper addresses the JND(Just Noticeable Difference)change of auditory perception with synchronous visual stimuli.Through psychoacoustics experimentS,loudness JND,subjective duration JND and pitch JND of pure tone were measured in auditory-only mode and visual_auditory mode with different visual stimuli which have different attributes such as color,illumination,quality and moving state.Statistical analyses of the experimental data indicare that,comparing with JND in auditory-only mode,the amount of JND with visual stimuli is often larger.The JND'S average increment of subjective duration,pitch and loudness are 45.1%,14.8%and 12.3%,respectively.The conclusion is that the ability of JNDbased auditory perception often decreases with visual stimuli.The incremental amount of JND is afiected bv the attributes of visual stimuli.If the visual stimuli make subjects feel more comfortable,the JND of auditory perception will change smaller.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号