期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

王晓波尹俊平徐岩《计算物理》2022,39(4):386-394

针对现实信号调制方式标注易发生错误, 即训练数据集中信号调制方式标签存在噪声情形, 我们选取l₁模损失函数及其推广形式作为对标签噪声具有鲁棒性的损失函数, 结合深度卷积神经网络优良的自动特征提取能力, 提出一种针对信号调制方式存在误判噪声的深度学习算法。该算法在训练数据集合标签噪声率达50%情形下, 对信号调制方式的识别准确率依然保持较高水平。相反, 对于采用通常的交叉熵作为损失函数的深度卷积神经网络, 其已无法对信号调制方式进行分类识别。在公开的数据集上的数值实验表明, 所提算法对于标签有噪信号调制方式识别具有较强的鲁棒性。相似文献

2.

Dual-Domain Fusion Convolutional Neural Network for Contrast Enhancement Forensics

Pengpeng Yang 《Entropy (Basel, Switzerland)》2021,23(10)

Contrast enhancement forensics techniques have always been of great interest for the image forensics community, as they can be an effective tool for recovering image history and identifying tampered images. Although several contrast enhancement forensic algorithms have been proposed, their accuracy and robustness against some kinds of processing are still unsatisfactory. In order to attenuate such deficiency, in this paper, we propose a new framework based on dual-domain fusion convolutional neural network to fuse the features of pixel and histogram domains for contrast enhancement forensics. Specifically, we first present a pixel-domain convolutional neural network to automatically capture the patterns of contrast-enhanced images in the pixel domain. Then, we present a histogram-domain convolutional neural network to extract the features in the histogram domain. The feature representations of pixel and histogram domains are fused and fed into two fully connected layers for the classification of contrast-enhanced images. Experimental results show that the proposed method achieves better performance and is robust against pre-JPEG compression and antiforensics attacks, obtaining over 99% detection accuracy for JPEG-compressed images with different QFs and antiforensics attack. In addition, a strategy for performance improvements of CNN-based forensics is explored, which could provide guidance for the design of CNN-based forensics tools. 相似文献

3.

Semiotic Aggregation in Deep Learning

Bogdan Muat R&#x;zvan Andonie 《Entropy (Basel, Switzerland)》2020,22(12)

Convolutional neural networks utilize a hierarchy of neural network layers. The statistical aspects of information concentration in successive layers can bring an insight into the feature abstraction process. We analyze the saliency maps of these layers from the perspective of semiotics, also known as the study of signs and sign-using behavior. In computational semiotics, this aggregation operation (known as superization) is accompanied by a decrease of spatial entropy: signs are aggregated into supersign. Using spatial entropy, we compute the information content of the saliency maps and study the superization processes which take place between successive layers of the network. In our experiments, we visualize the superization process and show how the obtained knowledge can be used to explain the neural decision model. In addition, we attempt to optimize the architecture of the neural model employing a semiotic greedy technique. To the extent of our knowledge, this is the first application of computational semiotics in the analysis and interpretation of deep neural networks. 相似文献

4.

A Quantitative Comparison between Shannon and Tsallis–Havrda–Charvat Entropies Applied to Cancer Outcome Prediction

Thibaud Brochet Jrme Lapuyade-Lahorgue Alexandre Huat Sbastien Thureau David Pasquier Isabelle Gardin Romain Modzelewski David Gibon Juliette Thariat Vincent Grgoire Pierre Vera Su Ruan 《Entropy (Basel, Switzerland)》2022,24(4)

In this paper, we propose to quantitatively compare loss functions based on parameterized Tsallis–Havrda–Charvat entropy and classical Shannon entropy for the training of a deep network in the case of small datasets which are usually encountered in medical applications. Shannon cross-entropy is widely used as a loss function for most neural networks applied to the segmentation, classification and detection of images. Shannon entropy is a particular case of Tsallis–Havrda–Charvat entropy. In this work, we compare these two entropies through a medical application for predicting recurrence in patients with head–neck and lung cancers after treatment. Based on both CT images and patient information, a multitask deep neural network is proposed to perform a recurrence prediction task using cross-entropy as a loss function and an image reconstruction task. Tsallis–Havrda–Charvat cross-entropy is a parameterized cross-entropy with the parameter

α

. Shannon entropy is a particular case of Tsallis–Havrda–Charvat entropy for

α = 1

. The influence of this parameter on the final prediction results is studied. In this paper, the experiments are conducted on two datasets including in total 580 patients, of whom 434 suffered from head–neck cancers and 146 from lung cancers. The results show that Tsallis–Havrda–Charvat entropy can achieve better performance in terms of prediction accuracy with some values of

α

. 相似文献

5.

Wood species identification using feature-level fusion scheme

Peng Zhao Gang Dou Guang-Sheng Chen 《Optik》2014

In this paper, we propose a robust wood species identification scheme by using a feature-level fusion scheme. First, a novel wood feature acquirement system is devised, which can get the curve of 1D wood spectral reflectance ratio and the 2D wood surface color image. Second, the 4 wood color features, the 4 principal texture features, the 4 secondary texture features and the 4 spectral features are established, respectively. Third, a fuzzy BP neural network is proposed to perform the classification work, which consists of 4 sub-networks based on the color feature, texture feature and spectral feature. We have experimentally proved that this scheme improves the mean recognition accuracy to approximately 90% for 5 wood species. Moreover, our feature-level fusion scheme is superior to the recognition schemes which use color feature and texture feature. 相似文献

6.

基于BP神经网络的复杂彩色图像文本定位

李念永梁艳梅张舒杨立常胜江《光子学报》2009,38(10)

针对复杂彩色图像中文本的特征,提出了基于小波变换和BP神经网络甄别文本区域的算法.该算法首先利用文本块的边缘特征遴选出备选图像块,而后采用小波变换提取备选图像块的纹理特征,把这些纹理特征参量连同图像块的颜色特征和笔画特征参量输入训练好的BP神经网络,判断备选图像块是否包含文本.该方法运算简单,定位时间短.采用专用的文本定位比赛用图进行实验的结果表明,定位准确率可达到92%,召回率为87.4%. 相似文献

7.

基于BP神经网络的复杂彩色图像文本定位*

李念永梁艳梅张舒杨立常胜江《光子学报》2014,38(10):2712-2716

针对复杂彩色图像中文本的特征,提出了基于小波变换和BP神经网络甄别文本区域的算法.该算法首先利用文本块的边缘特征遴选出备选图像块,而后采用小波变换提取备选图像块的纹理特征,把这些纹理特征参量连同图像块的颜色特征和笔画特征参量输入训练好的BP神经网络,判断备选图像块是否包含文本.该方法运算简单,定位时间短.采用专用的文本定位比赛用图进行实验的结果表明,定位准确率可达到92％,召回率为87.4%. 相似文献

8.

基于YOLOv3框架的高分辨电镜图像原子峰位置检测

下载免费PDF全文

南虎麻晓晶赵海博汤少杰刘卫华王大威贾春林《物理学报》2021,(7):285-292

高分辨电镜图像中原子峰位置的检测具有十分重要的现实意义,通过精确定量化原子峰位置可以分析物质在微观尺度上的结构形变、电极化矢量分布等重要信息.近年来深度学习技术在图像目标检测领域取得了巨大突破,这一技术可用在高分辨电镜图像处理上,因为原子位置的检测可以看作是一个目标检测问题.本文利用先进的机器学习方法,通过制作高质量原子图像样本集,使用YOLOv3目标识别框架对原子图像进行自动检测,达到预期效果,实现了深度学习技术在高分辨电镜图像处理领域的应用.该方法的运用有望突破自动处理动态、大量电镜图片的瓶颈问题. 相似文献

9.

An Enhanced Steganography Network for Concealing and Protecting Secret Image Data

Feng Chen Qinghua Xing Bing Sun Xuehu Yan Jingwen Cheng 《Entropy (Basel, Switzerland)》2022,24(9)

The development of Internet technology has provided great convenience for data transmission and sharing, but it also brings serious security problems that are related to data protection. As is detailed in this paper, an enhanced steganography network was designed to protect secret image data that contains private or confidential information; this network consists of a concealing network and a revealing network in order to achieve image embedding and recovery separately. To reduce the system’s computation complexity, we constructed the network’s framework using a down–up structure in order to compress the intermediate feature maps. In order to mitigate the input’s information loss caused by a sequence of convolution blocks, the long skip concatenation method was designed to pass the raw information to the top layer, thus synthesizing high-quality hidden images with fine texture details. In addition, we propose a novel strategy called non-activated feature fusion (NAFF), which is designed to provide stronger supervision for synthetizing higher-quality hidden images and recovered images. In order to further boost the hidden image’s visual quality and enhance its imperceptibility, an attention mechanism-based enhanced module was designed to reconstruct and enhance the salient target, thus covering up and obscuring the embedded secret content. Furthermore, a hybrid loss function that is composed of pixel domain loss and structure domain loss was designed to boost the hidden image’s structural quality and visual security. Our experimental results demonstrate that, due to the elaborate design of the network structure and loss function, our proposed method achieves high levels of imperceptibility and security. 相似文献

10.

基于多光谱图像的烟雾检测 总被引：2，自引：0，他引：2

卫颖卓张绍武刘彦伟《光谱学与光谱分析》2010,30(4):1061-1064

烟雾检测对于火灾早期防范非常重要,传统的智能视频和图像处理技术易受背景运动信息影响,抗干扰性差,且不容易区分森林水雾和燃烧产生的烟雾,森林防火误报率高。为此提出一种新的多光谱图像检测方法检测烟雾。采用多光谱成像系统,获取400至720 nm波段范围的烟雾、水雾光谱图像序列,对图像进行分层像素整合处理;利用欧氏距离度量不同分块光谱特征差异,获取动态区域光谱特征向量,根据目标与背景间光谱特征向量差异,提取烟雾、水雾区域。室内外试验结果表明：多光谱图像检测方法可用于烟雾检测,能够有效地检测并区分烟雾和水雾,与视频图像方法结合,可有效地用于森林火灾监测,降低森林火灾检测误报率。相似文献

11.

一种基于图像特征和神经网络的苹果图像分割算法 总被引：7，自引：1，他引：7

张亚静李民赞乔军刘刚《光学学报》2008,28(11):2104-2108

苹果识别是开发苹果采摘机器人的关键环节,利用图像处理技术和神经网络分类器探索苹果图像分割算法.从苹果树图片中选取苹果图像样本和背景网像样本.分别计算这两类图像样本的颜色特征和纹理特征.颜色特征的计算基于RGB色彩模型,纹理特征的计算基于灰度共生矩阵.选取适当的颜色特征(R/B值)和纹理特征(对比度值和相关性值)作为输入节点,利用反向传播神经网络分类器建模,输出值是一个O～1之间的计算值.通过阈值将输出结果分类为苹果或背景.试验结果表明,该算法正确率大于87.6%,对光照的影响不敏感,是一利较为实用的苹果分割算法. 相似文献

12.

优化FCOS网络复杂果园环境下绿色苹果检测模型

张中华贾伟宽邵文静侯素娟 Ji Ze 郑元杰《光谱学与光谱分析》2022,42(2):647-653

目标果实的精准识别是实现果园测产和机器自动采摘的基本保障.然而受复杂的非结构化果园环境、绿色苹果与枝叶背景颜色接近等因素的影响,制约着可见光谱范围下目标果实的检测精度,给机器视觉识别带来极大挑战.针对复杂果园环境下的不同光照环境和果实姿态,提出一种优化的一阶全卷积(FCOS)神经网络绿色苹果识别模型.首先,新模型在FC... 相似文献

13.

DFTSA-Net: Deep Feature Transfer-Based Stacked Autoencoder Network for DME Diagnosis

Ghada Atteia Nagwan Abdel Samee Hassan Zohair Hassan 《Entropy (Basel, Switzerland)》2021,23(10)

Diabetic macular edema (DME) is the most common cause of irreversible vision loss in diabetes patients. Early diagnosis of DME is necessary for effective treatment of the disease. Visual detection of DME in retinal screening images by ophthalmologists is a time-consuming process. Recently, many computer-aided diagnosis systems have been developed to assist doctors by detecting DME automatically. In this paper, a new deep feature transfer-based stacked autoencoder neural network system is proposed for the automatic diagnosis of DME in fundus images. The proposed system integrates the power of pretrained convolutional neural networks as automatic feature extractors with the power of stacked autoencoders in feature selection and classification. Moreover, the system enables extracting a large set of features from a small input dataset using four standard pretrained deep networks: ResNet-50, SqueezeNet, Inception-v3, and GoogLeNet. The most informative features are then selected by a stacked autoencoder neural network. The stacked network is trained in a semi-supervised manner and is used for the classification of DME. It is found that the introduced system achieves a maximum classification accuracy of 96.8%, sensitivity of 97.5%, and specificity of 95.5%. The proposed system shows a superior performance over the original pretrained network classifiers and state-of-the-art findings. 相似文献

14.

Deep unregistered multi-contrast MRI reconstruction

《Magnetic resonance imaging》2021

Multiple magnetic resonance images of different contrasts are normally acquired for clinical diagnosis. Recently, research has shown that the previously acquired multi-contrast (MC) images of the same patient can be used as anatomical prior to accelerating magnetic resonance imaging (MRI). However, current MC-MRI networks are based on the assumption that the images are perfectly registered, which is rarely the case in real-world applications. In this paper, we propose an end-to-end deep neural network to reconstruct highly accelerated images by exploiting the shareable information from potentially misaligned reference images of an arbitrary contrast. Specifically, a spatial transformation (ST) module is designed and integrated into the reconstruction network to align the pre-acquired reference images with the images to be reconstructed. The misalignment is further alleviated by maximizing the normalized cross-correlation (NCC) between the MC images. The visualization of feature maps demonstrates that the proposed method effectively reduces the misalignment between the images for shareable information extraction when applied to the publicly available brain datasets. Additionally, the experimental results on these datasets show the proposed network allows the robust exploitation of shareable information across the misaligned MC images, leading to improved reconstruction results. 相似文献

15.

A Neural Network MCMC Sampler That Maximizes Proposal Entropy

Zengyi Li Yubei Chen Friedrich T. Sommer 《Entropy (Basel, Switzerland)》2021,23(3)

Markov Chain Monte Carlo (MCMC) methods sample from unnormalized probability distributions and offer guarantees of exact sampling. However, in the continuous case, unfavorable geometry of the target distribution can greatly limit the efficiency of MCMC methods. Augmenting samplers with neural networks can potentially improve their efficiency. Previous neural network-based samplers were trained with objectives that either did not explicitly encourage exploration, or contained a term that encouraged exploration but only for well structured distributions. Here we propose to maximize proposal entropy for adapting the proposal to distributions of any shape. To optimize proposal entropy directly, we devised a neural network MCMC sampler that has a flexible and tractable proposal distribution. Specifically, our network architecture utilizes the gradient of the target distribution for generating proposals. Our model achieved significantly higher efficiency than previous neural network MCMC techniques in a variety of sampling tasks, sometimes by more than an order magnitude. Further, the sampler was demonstrated through the training of a convergent energy-based model of natural images. The adaptive sampler achieved unbiased sampling with significantly higher proposal entropy than a Langevin dynamics sample. The trained sampler also achieved better sample quality. 相似文献

16.

MFF-Net: Deepfake Detection Network Based on Multi-Feature Fusion

Lei Zhao Mingcheng Zhang Hongwei Ding Xiaohui Cui 《Entropy (Basel, Switzerland)》2021,23(12)

Significant progress has been made in generating counterfeit images and videos. Forged videos generated by deepfaking have been widely spread and have caused severe societal impacts, which stir up public concern about automatic deepfake detection technology. Recently, many deepfake detection methods based on forged features have been proposed. Among the popular forged features, textural features are widely used. However, most of the current texture-based detection methods extract textures directly from RGB images, ignoring the mature spectral analysis methods. Therefore, this research proposes a deepfake detection network fusing RGB features and textural information extracted by neural networks and signal processing methods, namely, MFF-Net. Specifically, it consists of four key components: (1) a feature extraction module to further extract textural and frequency information using the Gabor convolution and residual attention blocks; (2) a texture enhancement module to zoom into the subtle textural features in shallow layers; (3) an attention module to force the classifier to focus on the forged part; (4) two instances of feature fusion to firstly fuse textural features from the shallow RGB branch and feature extraction module and then to fuse the textural features and semantic information. Moreover, we further introduce a new diversity loss to force the feature extraction module to learn features of different scales and directions. The experimental results show that MFF-Net has excellent generalization and has achieved state-of-the-art performance on various deepfake datasets. 相似文献

17.

基于多任务卷积神经网络的红外与可见光多分辨率图像融合

朱雯青张宁李争刘鹏汤心溢《光谱学与光谱分析》2023,43(1):289-296

红外与可见光图像融合一直是图像领域研究的热点,融合技术能弥补单一传感器的不足,为图像理解与分析提供良好的成像基础。因生产工艺以及成本的限制,红外探测器的分辨率远低于可见光探测器,并在一定程度上因源图像分辨率的差异阻碍了实际应用。针对红外与可见光图像分辨率不一致的问题,提出了用于红外图像超分辨率重建与融合的多任务卷积网络框架,应用于多分辨率图像融合。在网络结构方面,首先设计了双通道网络分别提取红外与可见光特征,使算法不受源图像分辨率的限制;其次提出了特征上采样模块,先用双线性插值方法增加像素个数,再通过多层感知器精细化拟合像素平滑空间与高频空间的映射关系,无需重新训练模型即可实现任意尺度的红外图像上采样;接着将线性注意力引入网络,学习特征空间位置间的非线性关系,抑制无关信息并增强网络对全局信息的表达。在损失函数方面,提出了梯度损失,保留红外与可见光图像中绝对值较大的滤波器响应值,并计算该值与重建的融合图像响应值的Frobenius范数,无需理想的融合图像作为真值监督网络学习就能生成融合图像;此外,在梯度损失、像素损失的共同作用下对多任务模型进行优化,可以同时重建融合图像和高分辨率红外图像... 相似文献

18.

A Rolling Bearing Fault Classification Scheme Based on k-Optimized Adaptive Local Iterative Filtering and Improved Multiscale Permutation Entropy

Yi Zhang Yong Lv Mao Ge 《Entropy (Basel, Switzerland)》2021,23(2)

The health condition of the rolling bearing seriously affects the operation of the whole mechanical system. When the rolling bearing parts fail, the time series collected in the field generally shows strong nonlinearity and non-stationarity. To obtain the faulty characteristics of mechanical equipment accurately, a rolling bearing fault detection technique based on k-optimized adaptive local iterative filtering (ALIF), improved multiscale permutation entropy (improved MPE), and BP neural network was proposed. In the ALIF algorithm, a k-optimized ALIF method based on permutation entropy (PE) is presented to select the number of ALIF decomposition layers adaptively. The completely average coarse-graining method was proposed to excavate more hidden information. The performance analysis of the simulation signal shows that the improved MPE can more accurately dig out the depth information of the time series, and the entropy value obtained is more consistent and stable. In the research application, rolling bearing time series are decomposed by k-optimized ALIF to obtain a certain number of intrinsic mode functions (IMFs). Then the improved MPE value of effective IMF is calculated and input into backpropagation (BP) neural network as the feature vector for automatic fault identification. The comparative analysis of simulation signals shows that this method can extract fault information effectively. At the same time, the experimental part shows that this scheme not only effectively extracts the fault features, but also realizes the classification and identification of different fault modes and faults of different degrees, which has a certain application prospect in the research and application direction of rolling bearing fault identification. 相似文献

19.

Multi-Stage Attentive Network for Motion Deblurring via Binary Cross-Entropy Loss

Cai Guo Xinan Chen Yanhua Chen Chuying Yu 《Entropy (Basel, Switzerland)》2022,24(10)

In this paper, we present the multi-stage attentive network (MSAN), an efficient and good generalization performance convolutional neural network (CNN) architecture for motion deblurring. We build a multi-stage encoder–decoder network with self-attention and use the binary cross-entropy loss to train our model. In MSAN, there are two core designs. First, we introduce a new attention-based end-to-end method on top of multi-stage networks, which applies group convolution to the self-attention module, effectively reducing the computing cost and improving the model’s adaptability to different blurred images. Secondly, we propose using binary cross-entropy loss instead of pixel loss to optimize our model to minimize the over-smoothing impact of pixel loss while maintaining a good deblurring effect. We conduct extensive experiments on several deblurring datasets to evaluate the performance of our solution for deblurring. Our MSAN achieves superior performance while also generalizing and compares well with state-of-the-art methods. 相似文献

20.

Cross-Modality Person Re-Identification Based on Heterogeneous Center Loss and Non-Local Features

Chengmei Han Peng Pan Aihua Zheng Jin Tang 《Entropy (Basel, Switzerland)》2021,23(7)

Cross-modality person re-identification is the study of images of people matching under different modalities (RGB modality, IR modality). Given one RGB image of a pedestrian collected under visible light in the daytime, cross-modality person re-identification aims to determine whether the same pedestrian appears in infrared images (IR images) collected by infrared cameras at night, and vice versa. Cross-modality person re-identification can solve the task of pedestrian recognition in low light or at night. This paper aims to improve the degree of similarity for the same pedestrian in two modalities by improving the feature expression ability of the network and designing appropriate loss functions. To implement our approach, we introduce a deep neural network structure combining heterogeneous center loss (HC loss) and a non-local mechanism. On the one hand, this can heighten the performance of feature representation of the feature learning module, and, on the other hand, it can improve the similarity of cross-modality within the class. Experimental data show that the network achieves excellent performance on SYSU-MM01 datasets. 相似文献