首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
基于发音特征的汉语普通话语音声学建模   总被引:3,自引:0,他引:3  
将表征汉语普通话语音特点的发音特征引入汉语普通话语音识别的声学建模中,根据普通话发音特点,确定了用于区别普通话元音、辅音以及声调信息的9种发音特征,并以此为目标值训练神经网络得到语音信号属于各类发音特征的后验概率,将此概率作为语音识别的输入特征建立声学模型。在汉语普通话非特定人大词表自然口语对话识别系统中进行了实验验证,并与基于频谱特征的声学模型进行了比较,在相同解码速度下,由此方法建立的声学模型汉字错误率相对下降6.8%;将发音特征和频谱特征进行了融合实验,融合以后的识别系统相对基于频谱特征系统的汉字错误率相对下降10.1%。上述结果表明,基于发音特征的声学模型更加有效的实现了对语音特性的表征,通过利用发音特征和频谱特征的互补性,能够进一步实现对语音识别性能的提高。  相似文献   

2.
摘要为了提高计算机辅助语言学习中自动发音错误检测系统的性能,提出一种声学模型的区分性训练方法。该方法将经过正确度标注的非母语语音数据库上的发音错误检测的F1值的最大化作为模型参数的训练准则。采用Sigmoid 函数对F1值函数进行平滑构造目标函数,并利用构造弱意义辅助函数的方法以及扩展Baum-Welch 形式的参数更新公式进行优化。提出在模型参数更新与音素门限同时优化的策略保证目标函数增长的单调性。发音错误检测实验表明该方法能够有效地增大训练和测试数据检错的F1值。同时训练数据和测试数据上的精确度、召回率以及检测正确度都有明显改进。   相似文献   

3.
姜柏涛  阎守国  张碧星 《应用声学》2023,42(6):1170-1176
声波在奥氏体不锈钢焊缝中传播时声束弯曲,为超声成像带来了困难。基于Ogilvy焊缝模型,建立了奥氏体不锈钢焊缝非均匀各向异性声场仿真模型,采用Dijkstra路径搜索算法对各向异性条件下的声传播路径和声传播时间进行了数值模拟和分析。在此基础上,采用Verasonics超声相控阵成像系统,进行了奥氏体不锈钢焊缝的全聚焦成像实验,采集全矩阵回波数据,并结合理论模型计算的声传播路径和相应的传播时间,进行了成像结果修正。结果表明,与均匀介质模型的全聚焦成像结果相比,基于该文模型的焊缝全聚焦成像检测结果具有更高的缺陷定位精度和分辨率,验证了该方法的可行性,为奥氏体不锈钢焊缝成像检测提供了新的思路。  相似文献   

4.
柱面镜组的计算机辅助调校   总被引:1,自引:0,他引:1  
郭培基  余景池  张峰  王权陡  刘国淦 《光学技术》2000,26(4):323-324,328
计算机辅助装调技术是通过计算机对理论和实时检测结果的分析计算来有效地指导装调技术。本文通过计算机对由柱面反射镜组成的红外反射扩束系统的各种调整自由度的详细分析 ,以及在检测时对结果的实时分析 ,可确定合理的检测方案 ,有效的减少了调整时间 ,得到了理想的结果  相似文献   

5.
丁鹏  徐波 《声学学报》2004,29(1):23-28
分别采用基于数据聚类和基于先验知识的两种研究方法,深入探讨了性别、口音、语速、信道等非语境因素对语音数据分类与建模的影响。为了综合考虑语境、非语境因素在统一框架下建模的问题,采用非语境因素扩展决策树方法。而对于这种方法生成的多套非语境因素相关的高精度声学模型,提出一种依据最大似然准则,动态组合生成测试人相关声学模型的算法。这种方法可以使系统相对误识率平均降低8%~10%。实验结果说明为非语境因素分类建模可以提高声学模型的建模能力,而且模型组合算法可以有效解决统一建模所带来的模型选择问题。  相似文献   

6.
秦健勇  尚雪莲 《应用声学》2015,23(5):1482-1484, 1488
针对工业过程控制系统中的故障具有类型多样、时空独立和非线性等特点,使得检测与诊断效率降低,系统性能下降等问题,提出了一种基于自定义多条件约束的多传感器故障检测与诊断机制。该机制,首先考虑了系统的稳态和时空特征建立了非线性过程控制系统多故障模型,并给出了满足条件判定法则;然后对于系统中的单故障,并发故障和通信故障等类型给出了多条件约束法则及独立特性判断;最后提出了通过自定义多条件约束的多传感器故障检测与诊断机制。实验结果表明,在平均检测概率、稳态特征保持能力和系统功耗等方面明显优于无条件约束的机制,可以显著改善过程控制系统性能。  相似文献   

7.
作为乳腺癌计算机辅助诊断系统的重要环节,肿块分割的结果严重影响到肿块良恶性的判别.针对现有方法的不足,本文提出了一种基于简化型脉冲耦合神经网络和改进型矢量无边缘活动轮廓模型的乳腺X射线肿块分割方法.首先,通过数学分析计算SPCNN的相关参数与终止条件,进而利用SPCNN模型分割出肿块的初始轮廓.然后,针对传统CV模型的不足,进行相应的修正得到改进型矢量CV模型.最后,结合SPCNN分割出的初始轮廓,利用改进型的矢量CV模型处理ROI分割出肿块.采用北京大学人民医院乳腺中心提供的临床图像以及DDSM数据库的图像进行对比实验,实验结果表明,本文方法相比较现有方法分割结果更为准确,尤其是在处理东方女性致密性案例时,本文方法更有优势.  相似文献   

8.
物理实验中应用计算机辅助教学的分析与思考   总被引:1,自引:0,他引:1  
王军  周开学 《大学物理实验》2002,15(1):70-73,77
本文简要分析了物理实验教学的特点,依据实践经验探讨了计算机辅助教学在物理产验中具体的应用方向,研究了应用计算机辅助教学可能出现的问题。  相似文献   

9.
种子纯度反映种子品种在特征特性方面典型一致的程度,提高种子纯度检测的准确性和可靠性对保证种子的质量具有重要的意义。高光谱图像技术可以同时反映种子的内部特征和外部特征,在农产品无损检测中已经得到广泛应用。利用近红外高光谱图像实现农产品无损检测的实质就是建立光谱信息与农产品品质参数之间的数学模型关系。但光谱信息易受环境、时间的影响,当待测样本的产地或者年份发生改变时光谱信息也随之改变,导致建立的模型的稳定性变差、泛化能力减弱。针对这一问题,采用主动学习算法选择具有代表性的待测样本,最终以添加最少最优的样本数来扩大原模型的样本空间,从而实现模型的快速更新,提高模型的稳定性,同时与基于随机选择算法(RS)和Kennard-Stone算法(KS)的模型更新效果进行比较。实验结果表明:在不同样本集划分比例下(1∶1, 3∶1, 4∶1),利用主动学习添加40个新样本更新后的2010年的玉米种子纯度检测模型对2011年新样本的预测精度由47%,33.75%,49%提高到98.89%,98.33%,98.33%;利用主动学习添加56个新样本更新后的2011年的玉米种子纯度检测模型对2010年新样本的预测精度由50.83%,54.58%,53.75%提高到94.57%,94.02%,94.57%;同时基于主动学习算法的模型更新效果明显优于RS和KS。因此基于主动学习算法实现玉米种子纯度检测模型的更新是可行的。  相似文献   

10.
近红外光谱分析技术依赖于表征光谱向量和预测目标之间关系的化学计量学方法。然而,样品的光谱由信号和各种噪声组成,传统化学计量学方法较难直接提取光谱的有效特征,并为复杂的预测任务建立具有较强泛用性的校正模型。进一步地,受限于仪器间的差异,在一台仪器上建立的模型应用于另一台仪器时,难以取得相同的定量分析结果。为此,提出了一种基于卷积神经网络和迁移学习的定量分析建模及模型传递方案,以提高模型在单仪器和跨仪器上的预测性能。在卷积神经网络的基础上,一种结合多尺度特征融合和残差结构,名为MSRCNN的先进模型被设计,并在主仪器上展现了卓越的预测能力。然后,设计了四种的基于fine-tune模型迁移策略,将在主仪器上建立的MSRCNN模型迁移到从仪器。在药品和小麦的公开数据集上的实验结果表明,MSRCNN在主仪器上的RMSE和R2分别为2.587,0.981和0.309,0.977,优于PLS,SVM和CNN。在利用30个从仪器的样本微调主仪器建立的模型后,迁移MSRCNN中的卷积层和全连接层的方案取得了最好效果,其RMSE和R2可分别达到2.289,0.982和0.379,0.965。增加参与模型微调的从仪器样本,可进一步提高性能。  相似文献   

11.
蒿晓阳  张鹏远 《声学学报》2022,47(3):405-416
常见的多说话人语音合成有参数自适应及添加说话人标签两种方法。参数自适应方法获得的模型仅支持合成经过自适应的说话人的语音,模型不够鲁棒。传统的添加说话人标签的方法需要有监督地获得语音的说话人信息,并没有从语音信号本身无监督地学习说话人标签。为解决这些问题,提出了一种基于变分自编码器的自回归多说话人语音合成方法。方法首先利用变分自编码器无监督地学习说话人的信息并将其隐式编码为说话人标签,之后与文本的语言学特征送入到一个自回归声学参数预测网络中。此外,为了抑制多说话人语音数据引起的基频预测过拟合问题,声学参数网络采用了基频多任务学习的方法。预实验表明,自回归结构的加入降低了频谱误差1.018 dB,基频多任务学习降低了基频均方根误差6.861 Hz。在后续的多说话人对比实验中,提出的方法在3个多说话人实验的平均主观意见分(MOS)打分上分别达到3.71,3.55,3.15,拼音错误率分别为6.71%,7.54%,9.87%,提升了多说话人语音合成的音质。  相似文献   

12.
    
The spleen is one of the most frequently injured organs in blunt abdominal trauma. Computed tomography (CT) is the imaging modality of choice to assess patients with blunt spleen trauma, which may include lacerations, subcapsular or parenchymal hematomas, active hemorrhage, and vascular injuries. While computer-assisted diagnosis systems exist for other conditions assessed using CT scans, the current method to detect spleen injuries involves the manual review of scans by radiologists, which is a time-consuming and repetitive process. In this study, we propose an automated spleen injury detection method using machine learning. CT scans from patients experiencing traumatic injuries were collected from Michigan Medicine and the Crash Injury Research Engineering Network (CIREN) dataset. Ninety-nine scans of healthy and lacerated spleens were split into disjoint training and test sets, with random forest (RF), naive Bayes, SVM, k-nearest neighbors (k-NN) ensemble, and subspace discriminant ensemble models trained via 5-fold cross validation. Of these models, random forest performed the best, achieving an Area Under the receiver operating characteristic Curve (AUC) of 0.91 and an F1 score of 0.80 on the test set. These results suggest that an automated, quantitative assessment of traumatic spleen injury has the potential to enable faster triage and improve patient outcomes.  相似文献   

13.
    
The characteristic size and surface quality of O-ring seals (hereinafter referred to as “O-ring”) used in aerospace and guided weapon systems are important factors affecting the reliability of the main engine, and must be 100% fully checked. Due to the flexible characteristics of the O-ring material and the omnidirectional curved surface feature of the outer surface, the current manual measurement and detection methods have three major drawbacks: low efficiency, unstable results and high manpower consumption, which can no longer meet the requirements of the rapid development of aerospace and defense industries. With the advent of convolutional neural networks, target detection algorithms based on deep learning are widely used in the field of target detection because of their simple structure and good versatility. The micro-aerospace the O-ring studied in this paper has an inner diameter size range of Φ1.8 mm-Φ20 mm. Through the analysis of the surface topography of the defects, it is found that most of the defects have the characteristics of tiny targets and the pixels of the marked defects are less than 0.33% of the total pixels of the image, which is a typical tiny target detection. Compared with other computer vision tasks, tiny target detection has problems like fewer available features, higher positioning accuracy requirements, lower proportions of tiny targets in datasets, sample imbalance and tiny target aggregation. Because of its omnidirectional curved surface features, the O-ring presents severe bright areas and dark areas in images from any angle. The random defects are intertwined with these non-uniform areas which causes great difficulties in the detection and classification of surface defects. Especially for micro-O-ring, tiny defects impose higher demands on algorithm sensitivity and classification ability. Although the target detection algorithm based on deep learning has good detection capability but its detection efficiency is relatively low, and there is room for improvement in detection accuracy.To address the above problems, two deep-learning-based algorithms are proposed for detecting surface defects on the O-ring. By adding multi-head attention mechanisms to the inverse residual blocks of MobileNetv2, we constructed a lightweight backbone network called Efficient Model. By using the Next Hybrid strategy, we fused multiple attention mechanism modules from the industrial-grade Transformer network to build a Next Generation Vision Transformer backbone network. In each of these two backbone networks, feature extraction networks were added to design the Efficient-FPN Model and Transformer-FPN Model detection algorithms. The experimental results show that the mAP of the Efficient-FPN Model and Transformer-FPN Model detection algorithms is higher than that of YOLOv5s, YOLOv5x and YOLOv5z, among which the mAP of the Transformer-FPN model is the highest, reaching 91.4%. The Efficient-FPN Model has the fastest detection speed of the five models, reaching 110.8 frame/s. The mAP of the Efficient-FPN Model reached 86.1%, which was also higher than other YOLOv5 algorithms, and it was the detection model with the best comprehensive performance. The above algorithm is deployed in the self-developed intelligent measurement and inspection equipment of aerospace seal ring, and the purpose of detecting omnidirectional curved flexible parts is realized quickly and accurately.  相似文献   

14.
    
Shadow is one of the fundamental indicators of remote sensing image which could cause loss or interference of the target data. As a result, the detection and removal of shadow has already been the hotspot of current study because of the complicated background information. In the following passage, a model combining the Atmospheric Transport Model (hereinafter abbreviated as ATM) with the Poisson Equation, AP ShadowNet, is proposed for the shadow detection and removal of remote sensing images by unsupervised learning. This network based on a preprocessing network based on ATM, A Net, and a network based on the Poisson Equation, P Net. Firstly, corresponding mapping between shadow and unshaded area is generated by the ATM. The brightened image will then enter the Confrontation identification in the P Net. Lastly, the reconstructed image is optimized on color consistency and edge transition by Poisson Equation. At present, most shadow removal models based on neural networks are significantly data-driven. Fortunately, by the model in this passage, the unsupervised shadow detection and removal could be released from the data source restrictions from the remote sensing images themselves. By verifying the shadow removal on our model, the result shows a satisfying effect from a both qualitative and quantitative angle. From a qualitative point of view, our results have a prominent effect on tone consistency and removal of detailed shadows. From the quantitative point of view, we adopt the non-reference evaluation indicators: gradient structure similarity (NRSS) and Natural Image Quality Evaluator (NIQE). Combining various evaluation factors such as reasoning speed and memory occupation, it shows that it is outstanding among other current algorithms.  相似文献   

15.
针对当前行人检测方法计算量大、检测精度低的问题,基于YOLOv4-tiny提出一种改进的行人检测算法.引入通道注意力和空间注意力模块(CBAM)至CSPDarknet53-tiny网络中,通过学习图像的位置信息和通道信息得到更加丰富的特征;在骨干网络CSPDarknet53-tiny之后引入空间金字塔池化模块,能够极大...  相似文献   

16.
    
Future communication networks must address the scarce spectrum to accommodate extensive growth of heterogeneous wireless devices. Efforts are underway to address spectrum coexistence, enhance spectrum awareness, and bolster authentication schemes. Wireless signal recognition is becoming increasingly more significant for spectrum monitoring, spectrum management, secure communications, among others. Consequently, comprehensive spectrum awareness on the edge has the potential to serve as a key enabler for the emerging beyond 5G (fifth generation) networks. State-of-the-art studies in this domain have (i) only focused on a single task – modulation or signal (protocol) classification – which in many cases is insufficient information for a system to act on, (ii) consider either radar or communication waveforms (homogeneous waveform category), and (iii) does not address edge deployment during neural network design phase. In this work, for the first time in the wireless communication domain, we exploit the potential of deep neural networks based multi-task learning (MTL) framework to simultaneously learn modulation and signal classification tasks while considering heterogeneous wireless signals such as radar and communication waveforms in the electromagnetic spectrum. The proposed MTL architecture benefits from the mutual relation between the two tasks in improving the classification accuracy as well as the learning efficiency with a lightweight neural network model. We additionally include experimental evaluations of the model with over-the-air collected samples and demonstrate first-hand insight on model compression along with deep learning pipeline for deployment on resource-constrained edge devices. We demonstrate significant computational, memory, and accuracy improvement of the proposed model over two reference architectures. In addition to modeling a lightweight MTL model suitable for resource-constrained embedded radio platforms, we provide a comprehensive heterogeneous wireless signals dataset for public use.  相似文献   

17.
为了快速准确地自动提取和识别海面舰船疑似目标,为舰船目标精检测提供可信的数据基础,采用了二值化特征进行舰船目标粗检测,并根据舰船窄而长的几何特征提出了改进算法,按照舰船目标不同的方向分别进行模板训练。实验表明,二值化特征可以有效地提取疑似舰船目标,并且改进算法可以在提取窗口数相同时,提高查全率,更利于进一步的精检测。  相似文献   

18.
    
Insider threats are malicious acts that can be carried out by an authorized employee within an organization. Insider threats represent a major cybersecurity challenge for private and public organizations, as an insider attack can cause extensive damage to organization assets much more than external attacks. Most existing approaches in the field of insider threat focused on detecting general insider attack scenarios. However, insider attacks can be carried out in different ways, and the most dangerous one is a data leakage attack that can be executed by a malicious insider before his/her leaving an organization. This paper proposes a machine learning-based model for detecting such serious insider threat incidents. The proposed model addresses the possible bias of detection results that can occur due to an inappropriate encoding process by employing the feature scaling and one-hot encoding techniques. Furthermore, the imbalance issue of the utilized dataset is also addressed utilizing the synthetic minority oversampling technique (SMOTE). Well known machine learning algorithms are employed to detect the most accurate classifier that can detect data leakage events executed by malicious insiders during the sensitive period before they leave an organization. We provide a proof of concept for our model by applying it on CMU-CERT Insider Threat Dataset and comparing its performance with the ground truth. The experimental results show that our model detects insider data leakage events with an AUC-ROC value of 0.99, outperforming the existing approaches that are validated on the same dataset. The proposed model provides effective methods to address possible bias and class imbalance issues for the aim of devising an effective insider data leakage detection system.  相似文献   

19.
    
With the quick development of sensor technology in recent years, online detection of early fault without system halt has received much attention in the field of bearing prognostics and health management. While lacking representative samples of the online data, one can try to adapt the previously-learned detection rule to the online detection task instead of training a new rule merely using online data. As one may come across a change of the data distribution between offline and online working conditions, it is challenging to utilize the data from different working conditions to improve detection accuracy and robustness. To solve this problem, a new online detection method of bearing early fault is proposed in this paper based on deep transfer learning. The proposed method contains an offline stage and an online stage. In the offline stage, a new state assessment method is proposed to determine the period of the normal state and the degradation state for whole-life degradation sequences. Moreover, a new deep dual temporal domain adaptation (DTDA) model is proposed. By adopting a dual adaptation strategy on the time convolutional network and domain adversarial neural network, the DTDA model can effectively extract domain-invariant temporal feature representation. In the online stage, each sequentially-arrived data batch is directly fed into the trained DTDA model to recognize whether an early fault occurs. Furthermore, a health indicator of target bearing is also built based on the DTDA features to intuitively evaluate the detection results. Experiments are conducted on the IEEE Prognostics and Health Management (PHM) Challenge 2012 bearing dataset. The results show that, compared with nine state-of-the-art fault detection and diagnosis methods, the proposed method can get an earlier detection location and lower false alarm rate.  相似文献   

20.
传统变压器绝缘油品质荧光分析检测方法是利用荧光分光光度计采集油样本全谱段荧光光谱,根据不同老化程度绝缘油的全谱荧光特征变化建立变压器运行状态诊断模型.针对传统荧光方法中光度计体积大、价格昂贵以及因光谱采集时间长无法实现实时监测等问题,提出一种基于荧光双色比例的变压器绝缘油品质检测方法,提取荧光特征双波段信息并建立变压器...  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号