首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
王晓波  尹俊平  徐岩 《计算物理》2022,39(4):386-394
针对现实信号调制方式标注易发生错误, 即训练数据集中信号调制方式标签存在噪声情形, 我们选取l1模损失函数及其推广形式作为对标签噪声具有鲁棒性的损失函数, 结合深度卷积神经网络优良的自动特征提取能力, 提出一种针对信号调制方式存在误判噪声的深度学习算法。该算法在训练数据集合标签噪声率达50%情形下, 对信号调制方式的识别准确率依然保持较高水平。相反, 对于采用通常的交叉熵作为损失函数的深度卷积神经网络, 其已无法对信号调制方式进行分类识别。在公开的数据集上的数值实验表明, 所提算法对于标签有噪信号调制方式识别具有较强的鲁棒性。  相似文献   

2.
Contrast enhancement forensics techniques have always been of great interest for the image forensics community, as they can be an effective tool for recovering image history and identifying tampered images. Although several contrast enhancement forensic algorithms have been proposed, their accuracy and robustness against some kinds of processing are still unsatisfactory. In order to attenuate such deficiency, in this paper, we propose a new framework based on dual-domain fusion convolutional neural network to fuse the features of pixel and histogram domains for contrast enhancement forensics. Specifically, we first present a pixel-domain convolutional neural network to automatically capture the patterns of contrast-enhanced images in the pixel domain. Then, we present a histogram-domain convolutional neural network to extract the features in the histogram domain. The feature representations of pixel and histogram domains are fused and fed into two fully connected layers for the classification of contrast-enhanced images. Experimental results show that the proposed method achieves better performance and is robust against pre-JPEG compression and antiforensics attacks, obtaining over 99% detection accuracy for JPEG-compressed images with different QFs and antiforensics attack. In addition, a strategy for performance improvements of CNN-based forensics is explored, which could provide guidance for the design of CNN-based forensics tools.  相似文献   

3.
Convolutional neural networks utilize a hierarchy of neural network layers. The statistical aspects of information concentration in successive layers can bring an insight into the feature abstraction process. We analyze the saliency maps of these layers from the perspective of semiotics, also known as the study of signs and sign-using behavior. In computational semiotics, this aggregation operation (known as superization) is accompanied by a decrease of spatial entropy: signs are aggregated into supersign. Using spatial entropy, we compute the information content of the saliency maps and study the superization processes which take place between successive layers of the network. In our experiments, we visualize the superization process and show how the obtained knowledge can be used to explain the neural decision model. In addition, we attempt to optimize the architecture of the neural model employing a semiotic greedy technique. To the extent of our knowledge, this is the first application of computational semiotics in the analysis and interpretation of deep neural networks.  相似文献   

4.
In this paper, we propose to quantitatively compare loss functions based on parameterized Tsallis–Havrda–Charvat entropy and classical Shannon entropy for the training of a deep network in the case of small datasets which are usually encountered in medical applications. Shannon cross-entropy is widely used as a loss function for most neural networks applied to the segmentation, classification and detection of images. Shannon entropy is a particular case of Tsallis–Havrda–Charvat entropy. In this work, we compare these two entropies through a medical application for predicting recurrence in patients with head–neck and lung cancers after treatment. Based on both CT images and patient information, a multitask deep neural network is proposed to perform a recurrence prediction task using cross-entropy as a loss function and an image reconstruction task. Tsallis–Havrda–Charvat cross-entropy is a parameterized cross-entropy with the parameter α . Shannon entropy is a particular case of Tsallis–Havrda–Charvat entropy for α=1 . The influence of this parameter on the final prediction results is studied. In this paper, the experiments are conducted on two datasets including in total 580 patients, of whom 434 suffered from head–neck cancers and 146 from lung cancers. The results show that Tsallis–Havrda–Charvat entropy can achieve better performance in terms of prediction accuracy with some values of α .  相似文献   

5.
In this paper, we propose a robust wood species identification scheme by using a feature-level fusion scheme. First, a novel wood feature acquirement system is devised, which can get the curve of 1D wood spectral reflectance ratio and the 2D wood surface color image. Second, the 4 wood color features, the 4 principal texture features, the 4 secondary texture features and the 4 spectral features are established, respectively. Third, a fuzzy BP neural network is proposed to perform the classification work, which consists of 4 sub-networks based on the color feature, texture feature and spectral feature. We have experimentally proved that this scheme improves the mean recognition accuracy to approximately 90% for 5 wood species. Moreover, our feature-level fusion scheme is superior to the recognition schemes which use color feature and texture feature.  相似文献   

6.
针对复杂彩色图像中文本的特征,提出了基于小波变换和BP神经网络甄别文本区域的算法.该算法首先利用文本块的边缘特征遴选出备选图像块,而后采用小波变换提取备选图像块的纹理特征,把这些纹理特征参量连同图像块的颜色特征和笔画特征参量输入训练好的BP神经网络,判断备选图像块是否包含文本.该方法运算简单,定位时间短.采用专用的文本定位比赛用图进行实验的结果表明,定位准确率可达到92%,召回率为87.4%.  相似文献   

7.
李念永  梁艳梅  张舒  杨立  常胜江 《光子学报》2014,38(10):2712-2716
针对复杂彩色图像中文本的特征,提出了基于小波变换和BP神经网络甄别文本区域的算法.该算法首先利用文本块的边缘特征遴选出备选图像块,而后采用小波变换提取备选图像块的纹理特征,把这些纹理特征参量连同图像块的颜色特征和笔画特征参量输入训练好的BP神经网络,判断备选图像块是否包含文本.该方法运算简单,定位时间短.采用专用的文本定位比赛用图进行实验的结果表明,定位准确率可达到92%,召回率为87.4%.  相似文献   

8.
高分辨电镜图像中原子峰位置的检测具有十分重要的现实意义,通过精确定量化原子峰位置可以分析物质在微观尺度上的结构形变、电极化矢量分布等重要信息.近年来深度学习技术在图像目标检测领域取得了巨大突破,这一技术可用在高分辨电镜图像处理上,因为原子位置的检测可以看作是一个目标检测问题.本文利用先进的机器学习方法,通过制作高质量原子图像样本集,使用YOLOv3目标识别框架对原子图像进行自动检测,达到预期效果,实现了深度学习技术在高分辨电镜图像处理领域的应用.该方法的运用有望突破自动处理动态、大量电镜图片的瓶颈问题.  相似文献   

9.
基于多光谱图像的烟雾检测   总被引:2,自引:0,他引:2  
烟雾检测对于火灾早期防范非常重要,传统的智能视频和图像处理技术易受背景运动信息影响,抗干扰性差,且不容易区分森林水雾和燃烧产生的烟雾,森林防火误报率高。为此提出一种新的多光谱图像检测方法检测烟雾。采用多光谱成像系统,获取400至720 nm波段范围的烟雾、水雾光谱图像序列,对图像进行分层像素整合处理;利用欧氏距离度量不同分块光谱特征差异,获取动态区域光谱特征向量,根据目标与背景间光谱特征向量差异,提取烟雾、水雾区域。室内外试验结果表明:多光谱图像检测方法可用于烟雾检测,能够有效地检测并区分烟雾和水雾,与视频图像方法结合,可有效地用于森林火灾监测,降低森林火灾检测误报率。  相似文献   

10.
一种基于图像特征和神经网络的苹果图像分割算法   总被引:7,自引:1,他引:7  
张亚静  李民赞  乔军  刘刚 《光学学报》2008,28(11):2104-2108
苹果识别是开发苹果采摘机器人的关键环节,利用图像处理技术和神经网络分类器探索苹果图像分割算法.从苹果树图片中选取苹果图像样本和背景网像样本.分别计算这两类图像样本的颜色特征和纹理特征.颜色特征的计算基于RGB色彩模型,纹理特征的计算基于灰度共生矩阵.选取适当的颜色特征(R/B值)和纹理特征(对比度值和相关性值)作为输入节点,利用反向传播神经网络分类器建模,输出值是一个O~1之间的计算值.通过阈值将输出结果分类为苹果或背景.试验结果表明,该算法正确率大于87.6%,对光照的影响不敏感,是一利较为实用的苹果分割算法.  相似文献   

11.
目标果实的精准识别是实现果园测产和机器自动采摘的基本保障.然而受复杂的非结构化果园环境、绿色苹果与枝叶背景颜色接近等因素的影响,制约着可见光谱范围下目标果实的检测精度,给机器视觉识别带来极大挑战.针对复杂果园环境下的不同光照环境和果实姿态,提出一种优化的一阶全卷积(FCOS)神经网络绿色苹果识别模型.首先,新模型在FC...  相似文献   

12.
Diabetic macular edema (DME) is the most common cause of irreversible vision loss in diabetes patients. Early diagnosis of DME is necessary for effective treatment of the disease. Visual detection of DME in retinal screening images by ophthalmologists is a time-consuming process. Recently, many computer-aided diagnosis systems have been developed to assist doctors by detecting DME automatically. In this paper, a new deep feature transfer-based stacked autoencoder neural network system is proposed for the automatic diagnosis of DME in fundus images. The proposed system integrates the power of pretrained convolutional neural networks as automatic feature extractors with the power of stacked autoencoders in feature selection and classification. Moreover, the system enables extracting a large set of features from a small input dataset using four standard pretrained deep networks: ResNet-50, SqueezeNet, Inception-v3, and GoogLeNet. The most informative features are then selected by a stacked autoencoder neural network. The stacked network is trained in a semi-supervised manner and is used for the classification of DME. It is found that the introduced system achieves a maximum classification accuracy of 96.8%, sensitivity of 97.5%, and specificity of 95.5%. The proposed system shows a superior performance over the original pretrained network classifiers and state-of-the-art findings.  相似文献   

13.
Multiple magnetic resonance images of different contrasts are normally acquired for clinical diagnosis. Recently, research has shown that the previously acquired multi-contrast (MC) images of the same patient can be used as anatomical prior to accelerating magnetic resonance imaging (MRI). However, current MC-MRI networks are based on the assumption that the images are perfectly registered, which is rarely the case in real-world applications. In this paper, we propose an end-to-end deep neural network to reconstruct highly accelerated images by exploiting the shareable information from potentially misaligned reference images of an arbitrary contrast. Specifically, a spatial transformation (ST) module is designed and integrated into the reconstruction network to align the pre-acquired reference images with the images to be reconstructed. The misalignment is further alleviated by maximizing the normalized cross-correlation (NCC) between the MC images. The visualization of feature maps demonstrates that the proposed method effectively reduces the misalignment between the images for shareable information extraction when applied to the publicly available brain datasets. Additionally, the experimental results on these datasets show the proposed network allows the robust exploitation of shareable information across the misaligned MC images, leading to improved reconstruction results.  相似文献   

14.
Markov Chain Monte Carlo (MCMC) methods sample from unnormalized probability distributions and offer guarantees of exact sampling. However, in the continuous case, unfavorable geometry of the target distribution can greatly limit the efficiency of MCMC methods. Augmenting samplers with neural networks can potentially improve their efficiency. Previous neural network-based samplers were trained with objectives that either did not explicitly encourage exploration, or contained a term that encouraged exploration but only for well structured distributions. Here we propose to maximize proposal entropy for adapting the proposal to distributions of any shape. To optimize proposal entropy directly, we devised a neural network MCMC sampler that has a flexible and tractable proposal distribution. Specifically, our network architecture utilizes the gradient of the target distribution for generating proposals. Our model achieved significantly higher efficiency than previous neural network MCMC techniques in a variety of sampling tasks, sometimes by more than an order magnitude. Further, the sampler was demonstrated through the training of a convergent energy-based model of natural images. The adaptive sampler achieved unbiased sampling with significantly higher proposal entropy than a Langevin dynamics sample. The trained sampler also achieved better sample quality.  相似文献   

15.
Significant progress has been made in generating counterfeit images and videos. Forged videos generated by deepfaking have been widely spread and have caused severe societal impacts, which stir up public concern about automatic deepfake detection technology. Recently, many deepfake detection methods based on forged features have been proposed. Among the popular forged features, textural features are widely used. However, most of the current texture-based detection methods extract textures directly from RGB images, ignoring the mature spectral analysis methods. Therefore, this research proposes a deepfake detection network fusing RGB features and textural information extracted by neural networks and signal processing methods, namely, MFF-Net. Specifically, it consists of four key components: (1) a feature extraction module to further extract textural and frequency information using the Gabor convolution and residual attention blocks; (2) a texture enhancement module to zoom into the subtle textural features in shallow layers; (3) an attention module to force the classifier to focus on the forged part; (4) two instances of feature fusion to firstly fuse textural features from the shallow RGB branch and feature extraction module and then to fuse the textural features and semantic information. Moreover, we further introduce a new diversity loss to force the feature extraction module to learn features of different scales and directions. The experimental results show that MFF-Net has excellent generalization and has achieved state-of-the-art performance on various deepfake datasets.  相似文献   

16.
红外与可见光图像融合一直是图像领域研究的热点,融合技术能弥补单一传感器的不足,为图像理解与分析提供良好的成像基础。因生产工艺以及成本的限制,红外探测器的分辨率远低于可见光探测器,并在一定程度上因源图像分辨率的差异阻碍了实际应用。针对红外与可见光图像分辨率不一致的问题,提出了用于红外图像超分辨率重建与融合的多任务卷积网络框架,应用于多分辨率图像融合。在网络结构方面,首先设计了双通道网络分别提取红外与可见光特征,使算法不受源图像分辨率的限制;其次提出了特征上采样模块,先用双线性插值方法增加像素个数,再通过多层感知器精细化拟合像素平滑空间与高频空间的映射关系,无需重新训练模型即可实现任意尺度的红外图像上采样;接着将线性注意力引入网络,学习特征空间位置间的非线性关系,抑制无关信息并增强网络对全局信息的表达。在损失函数方面,提出了梯度损失,保留红外与可见光图像中绝对值较大的滤波器响应值,并计算该值与重建的融合图像响应值的Frobenius范数,无需理想的融合图像作为真值监督网络学习就能生成融合图像;此外,在梯度损失、像素损失的共同作用下对多任务模型进行优化,可以同时重建融合图像和高分辨率红外图像...  相似文献   

17.
The health condition of the rolling bearing seriously affects the operation of the whole mechanical system. When the rolling bearing parts fail, the time series collected in the field generally shows strong nonlinearity and non-stationarity. To obtain the faulty characteristics of mechanical equipment accurately, a rolling bearing fault detection technique based on k-optimized adaptive local iterative filtering (ALIF), improved multiscale permutation entropy (improved MPE), and BP neural network was proposed. In the ALIF algorithm, a k-optimized ALIF method based on permutation entropy (PE) is presented to select the number of ALIF decomposition layers adaptively. The completely average coarse-graining method was proposed to excavate more hidden information. The performance analysis of the simulation signal shows that the improved MPE can more accurately dig out the depth information of the time series, and the entropy value obtained is more consistent and stable. In the research application, rolling bearing time series are decomposed by k-optimized ALIF to obtain a certain number of intrinsic mode functions (IMFs). Then the improved MPE value of effective IMF is calculated and input into backpropagation (BP) neural network as the feature vector for automatic fault identification. The comparative analysis of simulation signals shows that this method can extract fault information effectively. At the same time, the experimental part shows that this scheme not only effectively extracts the fault features, but also realizes the classification and identification of different fault modes and faults of different degrees, which has a certain application prospect in the research and application direction of rolling bearing fault identification.  相似文献   

18.
Cross-modality person re-identification is the study of images of people matching under different modalities (RGB modality, IR modality). Given one RGB image of a pedestrian collected under visible light in the daytime, cross-modality person re-identification aims to determine whether the same pedestrian appears in infrared images (IR images) collected by infrared cameras at night, and vice versa. Cross-modality person re-identification can solve the task of pedestrian recognition in low light or at night. This paper aims to improve the degree of similarity for the same pedestrian in two modalities by improving the feature expression ability of the network and designing appropriate loss functions. To implement our approach, we introduce a deep neural network structure combining heterogeneous center loss (HC loss) and a non-local mechanism. On the one hand, this can heighten the performance of feature representation of the feature learning module, and, on the other hand, it can improve the similarity of cross-modality within the class. Experimental data show that the network achieves excellent performance on SYSU-MM01 datasets.  相似文献   

19.
Multi-scale analysis is a powerful tool in the field of signal processing. In this paper, we propose an efficient small target detection algorithm that is mainly based on the dual multi-scale filters which work sequentially. The algorithm consists of two stages: at the first stage, Spectrum Scale-Space (SSS) is used as the pre-process procedure to obtain the multi-scale saliency maps, which can suppress the low frequency background noise and make the target region prominently at different scale levels. As a result, the more detail information and feature information can be exhibited in the different decomposition image level. After then, the least information entropy is used as the criterion to select the optimal salient map out; At the second stage, the Gabor wavelets (GW) algorithm is utilized to suppress the high frequency noise remained in the optimal salient map and match the feature of size and direction of small target at different scales and angles, and next, to ensure the robustness of the target detection, Non-negative Matrix Factorization (NMF) is applied to fuse all the GW multi-scale images into one optimal target image, which is the final output of the presented method. Experimental results show that, compared with the contrast method, the proposed algorithm has high SCRG and high correct target detection rate, and works well in different types of complex backgrounds.  相似文献   

20.
在分形理论和脊波神经网络的基础上,综合利用彩色遥感图像的光谱、纹理和形状特征,提出了一种彩色遥感图像的分类新方法.该方法把彩色图像的蓝、绿、红波段作为3个光谱特征,由分形理论计算的DBC维和多重分形维数作为2个纹理特征,平均不变矩作为1个形状特征,并利用对曲线具有极强方向识别能力的脊波神经网络作为分类器.实验结果表明,提出的彩色遥感图像分类方法具有较高的分类准确率和较强的抗噪音能力.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号