Modular Meta-Learning with Shrinkage

Zusammenfassung

Most gradient-based approaches to meta-learning do not explicitly account for the fact that different parts of the underlying model adapt by different amounts when applied to a new task. For example, the input layers of an image classification convnet typically adapt very little, while the output layers can change significantly. This can cause parts of the model to begin to overfit while others underfit. To address this, we introduce a hierarchical Bayesian model with per-module shrinkage parameters, which we propose to learn by maximizing an approximation of the predictive likelihood using implicit differentiation. Our algorithm subsumes Reptile and outperforms variants of MAML on two synthetic few-shot meta-learning problems.

BibTeX-Schlüssel: chen2019modular
Eintragstyp: article
Jahr: 2019
URL: http://arxiv.org/abs/1909.05557
Hinweis: cite arxiv:1909.05557Comment: 14 pages (4 main, 8 supplement), under review

BibSonomy

Modular Meta-Learning with Shrinkage

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf