期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Information-Theoretic Generalization Bounds for Meta-Learning and Applications

Sharu Theresa Jose Osvaldo Simeone 《Entropy (Basel, Switzerland)》2021,23(1)

Meta-learning, or “learning to learn”, refers to techniques that infer an inductive bias from data corresponding to multiple related tasks with the goal of improving the sample efficiency for new, previously unobserved, tasks. A key performance measure for meta-learning is the meta-generalization gap, that is, the difference between the average loss measured on the meta-training data and on a new, randomly selected task. This paper presents novel information-theoretic upper bounds on the meta-generalization gap. Two broad classes of meta-learning algorithms are considered that use either separate within-task training and test sets, like model agnostic meta-learning (MAML), or joint within-task training and test sets, like reptile. Extending the existing work for conventional learning, an upper bound on the meta-generalization gap is derived for the former class that depends on the mutual information (MI) between the output of the meta-learning algorithm and its input meta-training data. For the latter, the derived bound includes an additional MI between the output of the per-task learning procedure and corresponding data set to capture within-task uncertainty. Tighter bounds are then developed for the two classes via novel individual task MI (ITMI) bounds. Applications of the derived bounds are finally discussed, including a broad class of noisy iterative algorithms for meta-learning. 相似文献

2.

Meta-Learning-Based Physics-Informed Neural Network: Numerical Simulations of Initial Value Problems of Nonlinear Dynamical Systems without Labeled Data and Correlation Analyses

下载免费PDF全文

Worrawat Duanyai Weon Keun Song Thanadol Chitthamler Girish Kumar 《Journal of Nonlinear Modeling and Analysis》2024,6(2):485-513

There are several main challenges in solving nonlinear differential equations with artificial neural networks (ANNs), such as a nonlinear system''s sensitivity to its initial values, discretization, and strategies for incorporating physics-based information into ANNs. As for the first issue, this paper addresses the initial value problems of nonlinear dynamical systems (a Duffing oscillator and a Burger''s equation), which cause large global truncation errors in sub-domains with a significant reduction in the influence of initial constraints, using meta-learning-based physics-informed neural networks (MPINNs). The MPINNs with dual learners outperform physics-informed neural networks with a single learner (no fine reinitialization capability). As a result, the former approach improves solution convergence by 98.83\% in the sub-time domain (III) of a Duffing oscillator, and by 85.89\% at $t = 45$ in a Burger''s equation problem, compared to the latter one. Model accuracy is highly dependent on the adaptability of the initial parameters in the first hidden layers of the meta-models. From correlation analyses, it is obvious that the parameters become less (the Duffing oscillator) or more (the Burger''s equation) correlated during fine reinitialization, as the update manner differs or is similar to the one used in pre-initialization. In the first example, the MPINN achieves both the mitigation of model sensitivity to its output and the improvement of model accuracy. Conversely, the second example shows that the proposed approach is not enough to solve both issues simultaneously, as increased model sensitivity to its output leads to higher model accuracy. The application of transfer learning reduces the number of iterative pre-meta-trainings. 相似文献

3.

三维小样本元学习模型的大豆食心虫虫害高光谱检测 总被引：1，自引：0，他引：1

桂江生费婧怡傅霞萍《光谱学与光谱分析》2021,41(7):2171-2174

为降低大豆食心虫对大豆产量以及品质的影响,实现对大豆食心虫虫害的快速检测,提出了一种基于三维关系网络小样本元学习(3D-RN)模型的大豆食心虫虫害的检测方法.首先分别对附着虫卵的,附着食心虫幼虫的,被啃食的及正常的大豆各20颗进行高光谱图像采集,提取感兴趣区,建立基于高光谱图像的3D-RN模型.最终模型的正确率达82％... 相似文献

4.

Meta-Strategy for Learning Tuning Parameters with Guarantees

Dimitri Meunier Pierre Alquier 《Entropy (Basel, Switzerland)》2021,23(10)

Online learning methods, similar to the online gradient algorithm (OGA) and exponentially weighted aggregation (EWA), often depend on tuning parameters that are difficult to set in practice. We consider an online meta-learning scenario, and we propose a meta-strategy to learn these parameters from past tasks. Our strategy is based on the minimization of a regret bound. It allows us to learn the initialization and the step size in OGA with guarantees. It also allows us to learn the prior or the learning rate in EWA. We provide a regret analysis of the strategy. It allows to identify settings where meta-learning indeed improves on learning each task in isolation. 相似文献

5.

Robust Spike-Based Continual Meta-Learning Improved by Restricted Minimum Error Entropy Criterion

Shuangming Yang Jiangtong Tan Badong Chen 《Entropy (Basel, Switzerland)》2022,24(4)

The spiking neural network (SNN) is regarded as a promising candidate to deal with the great challenges presented by current machine learning techniques, including the high energy consumption induced by deep neural networks. However, there is still a great gap between SNNs and the online meta-learning performance of artificial neural networks. Importantly, existing spike-based online meta-learning models do not target the robust learning based on spatio-temporal dynamics and superior machine learning theory. In this invited article, we propose a novel spike-based framework with minimum error entropy, called MeMEE, using the entropy theory to establish the gradient-based online meta-learning scheme in a recurrent SNN architecture. We examine the performance based on various types of tasks, including autonomous navigation and the working memory test. The experimental results show that the proposed MeMEE model can effectively improve the accuracy and the robustness of the spike-based meta-learning performance. More importantly, the proposed MeMEE model emphasizes the application of the modern information theoretic learning approach on the state-of-the-art spike-based learning algorithms. Therefore, in this invited paper, we provide new perspectives for further integration of advanced information theory in machine learning to improve the learning performance of SNNs, which could be of great merit to applied developments with spike-based neuromorphic systems. 相似文献

6.

Speeding up Training of Linear Predictors for Multi-Antenna Frequency-Selective Channels via Meta-Learning

Sangwoo Park Osvaldo Simeone 《Entropy (Basel, Switzerland)》2022,24(10)

An efficient data-driven prediction strategy for multi-antenna frequency-selective channels must operate based on a small number of pilot symbols. This paper proposes novel channel-prediction algorithms that address this goal by integrating transfer and meta-learning with a reduced-rank parametrization of the channel. The proposed methods optimize linear predictors by utilizing data from previous frames, which are generally characterized by distinct propagation characteristics, in order to enable fast training on the time slots of the current frame. The proposed predictors rely on a novel long short-term decomposition (LSTD) of the linear prediction model that leverages the disaggregation of the channel into long-term space-time signatures and fading amplitudes. We first develop predictors for single-antenna frequency-flat channels based on transfer/meta-learned quadratic regularization. Then, we introduce transfer and meta-learning algorithms for LSTD-based prediction models that build on equilibrium propagation (EP) and alternating least squares (ALS). Numerical results under the 3GPP 5G standard channel model demonstrate the impact of transfer and meta-learning on reducing the number of pilots for channel prediction, as well as the merits of the proposed LSTD parametrization. 相似文献

7.

An Optimized Black-Box Adversarial Simulator Attack Based on Meta-Learning

Zhiyu Chen Jianyu Ding Fei Wu Chi Zhang Yiming Sun Jing Sun Shangdong Liu Yimu Ji 《Entropy (Basel, Switzerland)》2022,24(10)

Much research on adversarial attacks has proved that deep neural networks have certain security vulnerabilities. Among potential attacks, black-box adversarial attacks are considered the most realistic based on the the natural hidden nature of deep neural networks. Such attacks have become a critical academic emphasis in the current security field. However, current black-box attack methods still have shortcomings, resulting in incomplete utilization of query information. Our research, based on the newly proposed Simulator Attack, proves the correctness and usability of feature layer information in a simulator model obtained by meta-learning for the first time. Then, we propose an optimized Simulator Attack+ based on this discovery. Our optimization methods used in Simulator Attack+ include: (1) a feature attentional boosting module that uses the feature layer information of the simulator to enhance the attack and accelerate the generation of adversarial examples; (2) a linear self-adaptive simulator-predict interval mechanism that allows the simulator model to be fully fine-tuned in the early stage of the attack and dynamically adjusts the interval for querying the black-box model; and (3) an unsupervised clustering module to provide a warm-start for targeted attacks. Results from experiments on the CIFAR-10 and CIFAR-100 datasets clearly show that Simulator Attack+ can further reduce the number of consuming queries to improve query efficiency while maintaining the attack. 相似文献

8.

高一学生化学元学习能力调查与对策

罗银先徐玉定郑柳萍《化学教育》2018,39(11):38-42

对2所中学的高一学生进行问卷调查发现,高一学生化学元学习能力处于中等水平,其中计划性、意识性、方法性、总结性等维度正向比例较低;男生的化学元学习能力水平总体上比女生略高,主要体现在计划性、补救性和总结性等3个维度,但女生的反馈性比男生略高;优秀学生、中等学生和学困生的化学元学习能力存在显著差异。提出鼓励自我设定学习目标培养学习计划性,提高元学习的意识性,加强元学习能力知识和策略的直接指导等措施培养元学习能力。相似文献