tag :: generalization readings

bookmarks (hide)
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

No matching posts.

⟨⟨
⟨
⟩
⟩⟩

publications (hide)101
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

2Emergence of Invariance and Disentanglement in Deep Representations
A. Achille, and S. Soatto. (2017)cite arxiv:1706.01350Comment: Deep learning, neural network, representation, flat minima, information bottleneck, overfitting, generalization, sufficiency, minimality, sensitivity, information complexity, stochastic gradient descent, regularization, total correlation, PAC-Bayes.
5 years ago by @kirk86
show all tags
complexity
deep-learning
readings
generalization
feature-selection
optimization
sparsity
complexitydeep-learningreadingsgeneralizationfeature-selectionoptimizationsparsity
copydeleteadd this publication to your clipboard
2High-dimensional dynamics of generalization error in neural networks
M. Advani, and A. Saxe. (2017)cite arxiv:1710.03667.
4 years ago by @kirk86
show all tags
readings
generalization
dynamic
readingsgeneralizationdynamic
copydeleteadd this publication to your clipboard
3A Deep Conditioning Treatment of Neural Networks
N. Agarwal, P. Awasthi, and S. Kale. (2020)cite arxiv:2002.01523.
4 years ago by @kirk86
show all tags
deep-learning
readings
generalization
theory
deep-learningreadingsgeneralizationtheory
copydeleteadd this publication to your clipboard
2Leverage Score Sampling for Faster Accelerated Regression and ERM
N. Agarwal, S. Kakade, R. Kidambi, Y. Lee, P. Netrapalli, and A. Sidford. (2017)cite arxiv:1711.08426.
5 years ago by @kirk86
show all tags
readings
generalization
bounds
readingsgeneralizationbounds
copydeleteadd this publication to your clipboard
2Backward Feature Correction: How Deep Learning Performs Deep Learning
Z. Allen-Zhu, and Y. Li. (2020)cite arxiv:2001.04413.
4 years ago by @kirk86
show all tags
readings
generalization
optimization
stochastic
learning
readingsgeneralizationoptimizationstochasticlearning
copydeleteadd this publication to your clipboard
2Lower Bounds for Non-Convex Stochastic Optimization
Y. Arjevani, Y. Carmon, J. Duchi, D. Foster, N. Srebro, and B. Woodworth. (2019)cite arxiv:1912.02365.
4 years ago by @kirk86
show all tags
readings
generalization
optimization
bounds
readingsgeneralizationoptimizationbounds
copydeleteadd this publication to your clipboard
2A Tight Convergence Analysis for Stochastic Gradient Descent with Delayed Updates
Y. Arjevani, O. Shamir, and N. Srebro. (2018)cite arxiv:1806.10188.
5 years ago by @kirk86
show all tags
readings
generalization
convergence
optimization
mathematics
readingsgeneralizationconvergenceoptimizationmathematics
copydeleteadd this publication to your clipboard
3Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks
S. Arora, S. Du, W. Hu, Z. Li, and R. Wang. (2019)cite arxiv:1901.08584Comment: In ICML 2019.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
optimization
theory
deep-learningreadingsgeneralizationoptimizationtheory
copydeleteadd this publication to your clipboard
2Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
S. Arora, S. Du, Z. Li, R. Salakhutdinov, R. Wang, and D. Yu. (2019)cite arxiv:1910.01663Comment: Code for UCI experiments: https://github.com/LeoYu/neural-tangent-kernel-UCI.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
kernels
theory
deep-learningreadingsgeneralizationkernelstheory
copydeleteadd this publication to your clipboard
2Why are deep nets reversible: A simple theory, with implications for training
S. Arora, Y. Liang, and T. Ma. (2015)cite arxiv:1511.05653.
5 years ago by @kirk86
show all tags
readings
generalization
learning
theory
readingsgeneralizationlearningtheory
copydeleteadd this publication to your clipboard
2Synthesizing Robust Adversarial Examples
A. Athalye, L. Engstrom, A. Ilyas, and K. Kwok. (2017)cite arxiv:1707.07397Comment: ICML 2018.
5 years ago by @kirk86
show all tags
readings
generalization
adversarial
readingsgeneralizationadversarial
copydeleteadd this publication to your clipboard
2Unreasonable Effectiveness of Learning Neural Networks: From Accessible States and Robust Ensembles to Basic Algorithmic Schemes
C. Baldassi, C. Borgs, J. Chayes, A. Ingrosso, C. Lucibello, L. Saglietti, and R. Zecchina. (2016)cite arxiv:1605.06444Comment: 31 pages (14 main text, 18 appendix), 12 figures (6 main text, 6 appendix).
5 years ago by @kirk86
show all tags
readings
generalization
optimization
robustness
readingsgeneralizationoptimizationrobustness
copydeleteadd this publication to your clipboard
1Subdominant Dense Clusters Allow for Simple Learning and High Computational Performance in Neural Networks with Discrete Synapses
C. Baldassi, A. Ingrosso, C. Lucibello, L. Saglietti, and R. Zecchina. (2015)cite arxiv:1509.05753Comment: 11 pages, 4 figures (main text: 5 pages, 3 figures; Supplemental Material: 6 pages, 1 figure).
5 years ago by @kirk86
show all tags
readings
generalization
optimization
readingsgeneralizationoptimization
copydeleteadd this publication to your clipboard
3Generalization Bounds in the Predict-then-Optimize Framework
O. Balghiti, A. Elmachtoub, P. Grigas, and A. Tewari. (2019)cite arxiv:1905.11488.
5 years ago by @kirk86
show all tags
readings
generalization
optimization
bounds
learning
readingsgeneralizationoptimizationboundslearning
copydeleteadd this publication to your clipboard
2Benign Overfitting in Linear Regression
P. Bartlett, P. Long, G. Lugosi, and A. Tsigler. (2019)cite arxiv:1906.11300.
4 years ago by @kirk86
show all tags
readings
generalization
bounds
theory
readingsgeneralizationboundstheory
copydeleteadd this publication to your clipboard
2Distribution Free Learning with Local Queries
G. Bary-Weisberg, A. Daniely, and S. Shalev-Shwartz. (2016)cite arxiv:1603.03714.
5 years ago by @kirk86
show all tags
readings
generalization
readingsgeneralization
copydeleteadd this publication to your clipboard
2Learning Internal Representations
J. Baxter. (2019)cite arxiv:1911.03731Comment: Phd Thesis, Jonathan Baxter, 1994.
5 years ago by @kirk86
show all tags
thesis
deep-learning
readings
generalization
theory
thesisdeep-learningreadingsgeneralizationtheory
copydeleteadd this publication to your clipboard
3On the distance between two neural networks and the stability of learning
J. Bernstein, A. Vahdat, Y. Yue, and M. Liu. (2020)cite arxiv:2002.03432.
4 years ago by @kirk86
show all tags
readings
generalization
stable
noise
readingsgeneralizationstablenoise
copydeleteadd this publication to your clipboard
2Learning with Differentiable Perturbed Optimizers
Q. Berthet, M. Blondel, O. Teboul, M. Cuturi, J. Vert, and F. Bach. (2020)cite arxiv:2002.08676.
4 years ago by @kirk86
show all tags
readings
generalization
optimization
learning
perturbations
readingsgeneralizationoptimizationlearningperturbations
copydeleteadd this publication to your clipboard
4Stability and Generalization
O. Bousquet, and A. Elisseeff. Journal of Machine Learning Research, 2 (Mar): 499-526 (2002)
4 years ago by @kirk86
show all tags
readings
generalization
stable
readingsgeneralizationstable
copydeleteadd this publication to your clipboard
2Sharper bounds for uniformly stable algorithms
O. Bousquet, Y. Klochkov, and N. Zhivotovskiy. (2019)cite arxiv:1910.07833Comment: 14 pages.
5 years ago by @kirk86
show all tags
readings
generalization
bounds
readingsgeneralizationbounds
copydeleteadd this publication to your clipboard
2Network size and weights size for memorization with two-layers neural networks
S. Bubeck, R. Eldan, Y. Lee, and D. Mikulincer. (2020)cite arxiv:2006.02855Comment: 27 pages.
4 years ago by @kirk86
show all tags
memory
readings
generalization
memoryreadingsgeneralization
copydeleteadd this publication to your clipboard
2The intriguing role of module criticality in the generalization of deep networks
N. Chatterji, B. Neyshabur, and H. Sedghi. (2019)cite arxiv:1912.00528.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
bounds
deep-learningreadingsgeneralizationbounds
copydeleteadd this publication to your clipboard
2Multiple Descent: Design Your Own Generalization Curve
L. Chen, Y. Min, M. Belkin, and A. Karbasi. (2020)cite arxiv:2008.01036.
4 years ago by @kirk86
show all tags
readings
generalization
optimization
bounds
readingsgeneralizationoptimizationbounds
copydeleteadd this publication to your clipboard
1A Group-Theoretic Framework for Data Augmentation
S. Chen, E. Dobriban, and J. Lee. (2019)cite arxiv:1907.10905Comment: Changed title. Added results on overparametrized 2-layer nets. Added error bars to experiments. Numerous other minor improvements.
4 years ago by @kirk86
show all tags
readings
generalization
invariance
augmentation
readingsgeneralizationinvarianceaugmentation
copydeleteadd this publication to your clipboard
4Distribution-Independent PAC Learning of Halfspaces with Massart Noise
I. Diakonikolas, T. Gouleakis, and C. Tzamos. (2019)cite arxiv:1906.10075.
5 years ago by @kirk86
show all tags
readings
generalization
bounds
learning
neurips2019
best_paper
readingsgeneralizationboundslearningneurips2019best_paper
copydeleteadd this publication to your clipboard
1Revisiting Generalization for Deep Learning: PAC-Bayes, Flat Minima, and Generative Models
G. Dziugaite. (2018)
4 years ago by @kirk86
show all tags
thesis
readings
generalization
thesisreadingsgeneralization
copydeleteadd this publication to your clipboard
3Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data
G. Dziugaite, and D. Roy. (2017)cite arxiv:1703.11008Comment: 14 pages, 1 table, 2 figures. Corresponds with UAI camera ready and supplement. Includes additional references and related experiments.
4 years ago by @kirk86
show all tags
readings
generalization
bounds
readingsgeneralizationbounds
copydeleteadd this publication to your clipboard
1Sampling Without Compromising Accuracy in Adaptive Data Analysis
B. Fish, L. Reyzin, and B. Rubinstein. (2017)cite arxiv:1709.09778.
5 years ago by @kirk86
show all tags
readings
generalization
convergence
alt2020
sampling
readingsgeneralizationconvergencealt2020sampling
copydeleteadd this publication to your clipboard
3Deep Ensembles: A Loss Landscape Perspective
S. Fort, H. Hu, and B. Lakshminarayanan. (2019)cite arxiv:1912.02757.
4 years ago by @kirk86
show all tags
readings
generalization
optimization
uncertainty
readingsgeneralizationoptimizationuncertainty
copydeleteadd this publication to your clipboard
3Large Scale Structure of Neural Network Loss Landscapes
S. Fort, and S. Jastrzebski. (2019)cite arxiv:1906.04724Comment: Submitted for review.
4 years ago by @kirk86
show all tags
readings
generalization
optimization
objectives
readingsgeneralizationoptimizationobjectives
copydeleteadd this publication to your clipboard
1Stiffness: A New Perspective on Generalization in Neural Networks
S. Fort, P. Nowak, S. Jastrzebski, and S. Narayanan. (2019)cite arxiv:1901.09491.
5 years ago by @kirk86
show all tags
readings
generalization
adversarial
robustness
theory
readingsgeneralizationadversarialrobustnesstheory
copydeleteadd this publication to your clipboard
4The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
J. Frankle, and M. Carbin. (2018)cite arxiv:1803.03635Comment: ICLR camera ready.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
sparsity
deep-learningreadingsgeneralizationsparsity
copydeleteadd this publication to your clipboard
2Linear Mode Connectivity and the Lottery Ticket Hypothesis
J. Frankle, G. Dziugaite, D. Roy, and M. Carbin. (2019)cite arxiv:1912.05671Comment: This submission subsumes 1903.01611 ("Stabilizing the Lottery Ticket Hypothesis" and "The Lottery Ticket Hypothesis at Scale").
4 years ago by @kirk86
show all tags
readings
generalization
compression
readingsgeneralizationcompression
copydeleteadd this publication to your clipboard
4Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks
S. Frei, Y. Cao, and Q. Gu. (2019)cite arxiv:1910.02934Comment: 37 pages. In NeurIPS 2019.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
bounds
neurips2019
theory
deep-learningreadingsgeneralizationboundsneurips2019theory
copydeleteadd this publication to your clipboard
2The jamming transition as a paradigm to understand the loss landscape of deep neural networks
M. Geiger, S. Spigler, S. d'Ascoli, L. Sagun, M. Baity-Jesi, G. Biroli, and M. Wyart. (2018)cite arxiv:1809.09349.
4 years ago by @kirk86
show all tags
readings
generalization
readingsgeneralization
copydeleteadd this publication to your clipboard
2Limitations of Lazy Training of Two-layers Neural Networks
B. Ghorbani, S. Mei, T. Misiakiewicz, and A. Montanari. (2019)cite arxiv:1906.08899Comment: 39 pages; 2 pdf figures.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
deep-learningreadingsgeneralization
copydeleteadd this publication to your clipboard
3Tradeoffs between Convergence Speed and Reconstruction Accuracy in Inverse Problems
R. Giryes, Y. Eldar, A. Bronstein, and G. Sapiro. (2016)cite arxiv:1605.09232Comment: To appear in IEEE Transactions on Signal Processing.
5 years ago by @kirk86
show all tags
complexity
readings
generalization
complexityreadingsgeneralization
copydeleteadd this publication to your clipboard
2PAC-Bayes unleashed: generalisation bounds with unbounded losses
M. Haddouche, B. Guedj, O. Rivasplata, and J. Shawe-Taylor. (2020)cite arxiv:2006.07279.
4 years ago by @kirk86
show all tags
bayesian
readings
generalization
bounds
objectives
bayesianreadingsgeneralizationboundsobjectives
copydeleteadd this publication to your clipboard
3Sharpened Generalization Bounds based on Conditional Mutual Information and an Application to Noisy, Iterative Algorithms
M. Haghifam, J. Negrea, A. Khisti, D. Roy, and G. Dziugaite. (2020)cite arxiv:2004.12983Comment: 17 pages.
4 years ago by @kirk86
show all tags
readings
generalization
bounds
information
theory
readingsgeneralizationboundsinformationtheory
copydeleteadd this publication to your clipboard
2Surprises in High-Dimensional Ridgeless Least Squares Interpolation
T. Hastie, A. Montanari, S. Rosset, and R. Tibshirani. (2019)cite arxiv:1903.08560Comment: 53 pages; 13 figures.
4 years ago by @kirk86
show all tags
readings
generalization
interpolation
readingsgeneralizationinterpolation
copydeleteadd this publication to your clipboard
3Generalization Error Bounds via $m$th Central Moments of the Information Density
F. Hellström, and G. Durisi. (2020)cite arxiv:2004.09148Comment: to be presented at ISIT 2020.
4 years ago by @kirk86
show all tags
readings
generalization
bounds
density
information
readingsgeneralizationboundsdensityinformation
copydeleteadd this publication to your clipboard
3Generalization Bounds via Information Density and Conditional Information Density
F. Hellström, and G. Durisi. (2020)cite arxiv:2005.08044.
4 years ago by @kirk86
show all tags
readings
generalization
bounds
information
theory
readingsgeneralizationboundsinformationtheory
copydeleteadd this publication to your clipboard
3Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
M. Igl, K. Ciosek, Y. Li, S. Tschiatschek, C. Zhang, S. Devlin, and K. Hofmann. (2019)cite arxiv:1910.12911Comment: Published at Neurips 2019.
5 years ago by @kirk86
show all tags
approximate
readings
generalization
optimization
information
compression
theory
approximatereadingsgeneralizationoptimizationinformationcompressiontheory
copydeleteadd this publication to your clipboard
1An empirical analysis of the optimization of deep network loss surfaces
D. Im, M. Tao, and K. Branson. (2016)cite arxiv:1612.04010.
5 years ago by @kirk86
show all tags
readings
generalization
optimization
readingsgeneralizationoptimization
copydeleteadd this publication to your clipboard
3Neural Tangent Kernel: Convergence and Generalization in Neural Networks
A. Jacot, F. Gabriel, and C. Hongler. (2018)cite arxiv:1806.07572.
5 years ago by @kirk86
show all tags
readings
generalization
convergence
kernels
readingsgeneralizationconvergencekernels
copydeleteadd this publication to your clipboard
2Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks
Z. Ji, and M. Telgarsky. (2019)cite arxiv:1909.12292.
5 years ago by @kirk86
show all tags
readings
generalization
optimization
theory
readingsgeneralizationoptimizationtheory
copydeleteadd this publication to your clipboard
3Fantastic Generalization Measures and Where to Find Them
Y. Jiang, B. Neyshabur, H. Mobahi, D. Krishnan, and S. Bengio. (2019)cite arxiv:1912.02178.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
bounds
theory
deep-learningreadingsgeneralizationboundstheory
copydeleteadd this publication to your clipboard
1Over-parametrized deep neural networks do not generalize well
M. Kohler, and A. Krzyzak. (2019)cite arxiv:1912.03925.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
bounds
deep-learningreadingsgeneralizationbounds
copydeleteadd this publication to your clipboard
4Data-Dependent Stability of Stochastic Gradient Descent
I. Kuzborskij, and C. Lampert. (2017)cite arxiv:1703.01678.
4 years ago by @kirk86
show all tags
readings
generalization
optimization
stable
readingsgeneralizationoptimizationstable
copydeleteadd this publication to your clipboard
2Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent
J. Lee, L. Xiao, S. Schoenholz, Y. Bahri, R. Novak, J. Sohl-Dickstein, and J. Pennington. (2019)cite arxiv:1902.06720Comment: 11+17 pages, open-source code available at https://github.com/google/neural-tangents.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
optimization
kernels
theory
deep-learningreadingsgeneralizationoptimizationkernelstheory
copydeleteadd this publication to your clipboard
3Understanding Generalization in Deep Learning via Tensor Methods
J. Li, Y. Sun, J. Su, T. Suzuki, and F. Huang. (2020)cite arxiv:2001.05070Comment: 9 pages (main paper), 42 pages (full version).
4 years ago by @kirk86
show all tags
deep-learning
readings
generalization
deep-learningreadingsgeneralization
copydeleteadd this publication to your clipboard
2Generalization bounds for deep convolutional neural networks
P. Long, and H. Sedghi. International Conference on Learning Representations, (2020)
4 years ago by @kirk86
show all tags
readings
generalization
iclr2020
bounds
readingsgeneralizationiclr2020bounds
copydeleteadd this publication to your clipboard
2Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior
C. Martin, and M. Mahoney. (2017)cite arxiv:1710.09553Comment: 31 pages; added brief discussion of recent papers that use/extend these ideas.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
bounds
learning
theory
deep-learningreadingsgeneralizationboundslearningtheory
copydeleteadd this publication to your clipboard
2A PAC-Bayesian Tutorial with A Dropout Bound
D. McAllester. (2013)cite arxiv:1307.2118.
5 years ago by @kirk86
show all tags
complexity
readings
generalization
tutorials
bounds
complexityreadingsgeneralizationtutorialsbounds
copydeleteadd this publication to your clipboard
3Mean-field theory of two-layers neural networks: dimension-free bounds and kernel limit
S. Mei, T. Misiakiewicz, and A. Montanari. (2019)cite arxiv:1902.06015Comment: 61 pages.
5 years ago by @kirk86
show all tags
approximate
readings
generalization
optimization
probability
dynamic
approximatereadingsgeneralizationoptimizationprobabilitydynamic
copydeleteadd this publication to your clipboard
1The generalization error of random features regression: Precise asymptotics and double descent curve
S. Mei, and A. Montanari. (2019)cite arxiv:1908.05355Comment: We added two sections in version 3. One section provides the precise asymptotics of the training error. The other section describes a Gaussian covariate model, which gives the same asymptotic test error as the random features model.
4 years ago by @kirk86
show all tags
readings
generalization
interpolation
readingsgeneralizationinterpolation
copydeleteadd this publication to your clipboard
2A Mean Field View of the Landscape of Two-Layers Neural Networks
S. Mei, A. Montanari, and P. Nguyen. (2018)cite arxiv:1804.06561Comment: 103 pages.
5 years ago by @kirk86
show all tags
approximate
deep-learning
readings
generalization
optimization
approximatedeep-learningreadingsgeneralizationoptimization
copydeleteadd this publication to your clipboard
2Learning Functions: When Is Deep Better Than Shallow
H. Mhaskar, Q. Liao, and T. Poggio. (2016)cite arxiv:1603.00988.
5 years ago by @kirk86
show all tags
approximate
complexity
readings
generalization
approximatecomplexityreadingsgeneralization
copydeleteadd this publication to your clipboard
2Distributional Generalization: A New Kind of Generalization
P. Nakkiran, and Y. Bansal. (2020)cite arxiv:2009.08092Comment: PN, YB co-first authors.
4 years ago by @kirk86
show all tags
readings
generalization
probability
readingsgeneralizationprobability
copydeleteadd this publication to your clipboard
2In Defense of Uniform Convergence: Generalization via derandomization with an application to interpolating predictors
J. Negrea, G. Dziugaite, and D. Roy. (2019)cite arxiv:1912.04265Comment: 12 pages.
4 years ago by @kirk86
show all tags
readings
generalization
bounds
learning
readingsgeneralizationboundslearning
copydeleteadd this publication to your clipboard
3Information-Theoretic Generalization Bounds for SGLD via Data-Dependent Estimates
J. Negrea, M. Haghifam, G. Dziugaite, A. Khisti, and D. Roy. (2019)cite arxiv:1911.02151Comment: 23 pages, 1 figure. To appear in, Advances in Neural Information Processing Systems (33), 2019.
4 years ago by @kirk86
show all tags
readings
generalization
bounds
neurips2019
information
theory
readingsgeneralizationboundsneurips2019informationtheory
copydeleteadd this publication to your clipboard
1A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks
B. Neyshabur, S. Bhojanapalli, and N. Srebro. (2017)cite arxiv:1707.09564Comment: Accepted to ICLR 2018.
4 years ago by @kirk86
show all tags
readings
generalization
spectral
iclr2018
learning
readingsgeneralizationspectraliclr2018learning
copydeleteadd this publication to your clipboard
2The role of over-parametrization in generalization of neural networks
B. Neyshabur, Z. Li, S. Bhojanapalli, Y. LeCun, and N. Srebro. International Conference on Learning Representations, (2019)
4 years ago by @kirk86
show all tags
readings
generalization
optimization
iclr2019
bounds
theory
readingsgeneralizationoptimizationiclr2019boundstheory
copydeleteadd this publication to your clipboard
2Mean Field Limit of the Learning Dynamics of Multilayer Neural Networks
P. Nguyen. (2019)cite arxiv:1902.02880.
5 years ago by @kirk86
show all tags
approximate
readings
generalization
optimization
probability
dynamic
approximatereadingsgeneralizationoptimizationprobabilitydynamic
copydeleteadd this publication to your clipboard
4Sensitivity and Generalization in Neural Networks: an Empirical Study
R. Novak, Y. Bahri, D. Abolafia, J. Pennington, and J. Sohl-Dickstein. (2018)cite arxiv:1802.08760Comment: Published as a conference paper at ICLR 2018.
4 years ago by @kirk86
show all tags
readings
generalization
robustness
uncertainty
readingsgeneralizationrobustnessuncertainty
copydeleteadd this publication to your clipboard
2Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes
R. Novak, L. Xiao, J. Lee, Y. Bahri, G. Yang, J. Hron, D. Abolafia, J. Pennington, and J. Sohl-Dickstein. (2018)cite arxiv:1810.05148Comment: Published as a conference paper at ICLR 2019.
5 years ago by @kirk86
show all tags
gaussian-proceses
bayesian
deep-learning
readings
generalization
gaussian-procesesbayesiandeep-learningreadingsgeneralization
copydeleteadd this publication to your clipboard
2PAC-Bayesian Contrastive Unsupervised Representation Learning
K. Nozawa, P. Germain, and B. Guedj. (2019)cite arxiv:1910.04464Comment: 16 pages.
4 years ago by @kirk86
show all tags
bayesian
readings
generalization
unsupervised
bounds
theory
bayesianreadingsgeneralizationunsupervisedboundstheory
copydeleteadd this publication to your clipboard
1Statistical Mechanics of Learning: Generalization
M. Opper. (1995)
5 years ago by @kirk86
show all tags
readings
generalization
learning
readingsgeneralizationlearning
copydeleteadd this publication to your clipboard
3Convolutional Neural Networks Analyzed via Convolutional Sparse Coding
V. Papyan, Y. Romano, and M. Elad. (2016)cite arxiv:1607.08194.
5 years ago by @kirk86
show all tags
approximate
readings
generalization
optimization
sparsity
theory
approximatereadingsgeneralizationoptimizationsparsitytheory
copydeleteadd this publication to your clipboard
2What's Hidden in a Randomly Weighted Neural Network?
V. Ramanujan, M. Wortsman, A. Kembhavi, A. Farhadi, and M. Rastegari. (2019)cite arxiv:1911.13299.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
theory
deep-learningreadingsgeneralizationtheory
copydeleteadd this publication to your clipboard
1PAC-Bayes with Backprop
O. Rivasplata, V. Tankasali, and C. Szepesvari. (2019)cite arxiv:1908.07380.
4 years ago by @kirk86
show all tags
bayesian
readings
generalization
bounds
bayesianreadingsgeneralizationbounds
copydeleteadd this publication to your clipboard
7Opening the Black Box of Deep Neural Networks via Information
R. Shwartz-Ziv, and N. Tishby. (2017)cite arxiv:1703.00810Comment: 19 pages, 8 figures.
5 years ago by @kirk86
show all tags
approximate
deep-learning
readings
generalization
optimization
information
compression
theory
approximatedeep-learningreadingsgeneralizationoptimizationinformationcompressiontheory
copydeleteadd this publication to your clipboard
5A Bayesian Perspective on Generalization and Stochastic Gradient Descent
S. Smith, and Q. Le. (2017)cite arxiv:1710.06451Comment: 13 pages, 9 figures. Published as a conference paper at ICLR 2018.
5 years ago by @kirk86
show all tags
bayesian
complexity
readings
generalization
tutorials
bounds
bayesiancomplexityreadingsgeneralizationtutorialsbounds
copydeleteadd this publication to your clipboard
1Robust Large Margin Deep Neural Networks
J. Sokolic, R. Giryes, G. Sapiro, and M. Rodrigues. (2016)cite arxiv:1605.08254Comment: accepted to IEEE Transactions on Signal Processing.
5 years ago by @kirk86
show all tags
readings
generalization
learning
svms
readingsgeneralizationlearningsvms
copydeleteadd this publication to your clipboard
2Discovering the Compositional Structure of Vector Representations with Role Learning Networks
P. Soulos, T. McCoy, T. Linzen, and P. Smolensky. (2019)cite arxiv:1910.09113.
5 years ago by @kirk86
show all tags
readings
generalization
feature-selection
compositionality
readingsgeneralizationfeature-selectioncompositionality
copydeleteadd this publication to your clipboard
3Reasoning About Generalization via Conditional Mutual Information
T. Steinke, and L. Zakynthinou. (2020)cite arxiv:2001.09122.
4 years ago by @kirk86
show all tags
deep-learning
readings
generalization
deep-learningreadingsgeneralization
copydeleteadd this publication to your clipboard
2Multi-Layer Convolutional Sparse Modeling: Pursuit and Dictionary Learning
J. Sulam, V. Papyan, Y. Romano, and M. Elad. (2017)cite arxiv:1708.08705.
5 years ago by @kirk86
show all tags
approximate
readings
generalization
optimization
sparsity
approximatereadingsgeneralizationoptimizationsparsity
copydeleteadd this publication to your clipboard
1Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches
W. Sun, N. Jiang, A. Krishnamurthy, A. Agarwal, and J. Langford. (2018)cite arxiv:1811.08540Comment: COLT 2019.
5 years ago by @kirk86
show all tags
reinforcement-learning
readings
generalization
bounds
reinforcement-learningreadingsgeneralizationbounds
copydeleteadd this publication to your clipboard
2Learnability Can Be Independent of ZFC Axioms: Explanations and Implications
W. Taylor. (2019)cite arxiv:1909.08410.
5 years ago by @kirk86
show all tags
readings
generalization
mathematics
learning
readingsgeneralizationmathematicslearning
copydeleteadd this publication to your clipboard
1A Strongly Quasiconvex PAC-Bayesian Bound
N. Thiemann, C. Igel, O. Wintenberger, and Y. Seldin. (2016)cite arxiv:1608.05610.
5 years ago by @kirk86
show all tags
bayesian
complexity
readings
generalization
tutorials
bounds
bayesiancomplexityreadingsgeneralizationtutorialsbounds
copydeleteadd this publication to your clipboard
1Student Specialization in Deep ReLU Networks With Finite Width and Input Dimension
Y. Tian. (2019)cite arxiv:1909.13458.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
optimization
deep-learningreadingsgeneralizationoptimization
copydeleteadd this publication to your clipboard
5Deep Learning and the Information Bottleneck Principle
N. Tishby, and N. Zaslavsky. (2015)cite arxiv:1503.02406Comment: 5 pages, 2 figures, Invited paper to ITW 2015; 2015 IEEE Information Theory Workshop (ITW) (IEEE ITW 2015).
5 years ago by @kirk86
show all tags
readings
generalization
optimization
sparsity
information
compression
readingsgeneralizationoptimizationsparsityinformationcompression
copydeleteadd this publication to your clipboard
2REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models
G. Tucker, A. Mnih, C. Maddison, D. Lawson, and J. Sohl-Dickstein. (2017)cite arxiv:1703.07370Comment: NIPS 2017.
5 years ago by @kirk86
show all tags
approximate
readings
generalization
optimization
approximatereadingsgeneralizationoptimization
copydeleteadd this publication to your clipboard
1PAC-Bayes Mini-tutorial: A Continuous Union Bound
T. van Erven. (2014)cite arxiv:1405.1580.
5 years ago by @kirk86
show all tags
complexity
readings
generalization
tutorials
bounds
complexityreadingsgeneralizationtutorialsbounds
copydeleteadd this publication to your clipboard
2Rethinking statistical learning theory: learning using statistical invariants
V. Vapnik, and R. Izmailov. Machine Learning, 108 (3): 381--423 (Mar 1, 2019)
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
machine-learning
foundations
invariance
theory
deep-learningreadingsgeneralizationmachine-learningfoundationsinvariancetheory
copydeleteadd this publication to your clipboard
4Mathematics of Deep Learning
R. Vidal, J. Bruna, R. Giryes, and S. Soatto. (2017)cite arxiv:1712.04741.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
stats
machine-learning
mathematics
probability
stable
bounds
theory
deep-learningreadingsgeneralizationstatsmachine-learningmathematicsprobabilitystableboundstheory
copydeleteadd this publication to your clipboard
3On Learnability, Complexity and Stability
S. Villa, L. Rosasco, and T. Poggio. (2013)cite arxiv:1303.5976.
5 years ago by @kirk86
show all tags
complexity
readings
generalization
stats
probability
stable
learning
theory
complexityreadingsgeneralizationstatsprobabilitystablelearningtheory
copydeleteadd this publication to your clipboard
1Regularization Matters: Generalization and Optimization of Neural Nets v.s. their Induced Kernel
C. Wei, J. Lee, Q. Liu, and T. Ma. (2018)cite arxiv:1810.05369Comment: version 2: title changed from originally Ön the Margin Theory of Feedforward Neural Networks". Substantial changes from old version of paper, including a new lower bound on NTK sample complexity version 3: reorganized NTK lower bound proof.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
regularisation
optimization
theory
deep-learningreadingsgeneralizationregularisationoptimizationtheory
copydeleteadd this publication to your clipboard
5Data-dependent Sample Complexity of Deep Neural Networks via Lipschitz Augmentation
C. Wei, and T. Ma. (2019)cite arxiv:1905.03684.
5 years ago by @kirk86
show all tags
deep-learning
readings
generalization
optimization
stats
probability
bounds
theory
deep-learningreadingsgeneralizationoptimizationstatsprobabilityboundstheory
copydeleteadd this publication to your clipboard
2Improved Sample Complexities for Deep Networks and Robust Classification via an All-Layer Margin
C. Wei, and T. Ma. (2019)cite arxiv:1910.04284.
5 years ago by @kirk86
show all tags
complexity
readings
generalization
sampling
bounds
complexityreadingsgeneralizationsamplingbounds
copydeleteadd this publication to your clipboard
3Bayesian Deep Learning and a Probabilistic Perspective of Generalization
A. Wilson, and P. Izmailov. (2020)cite arxiv:2002.08791Comment: 27 pages, 17 figures.
4 years ago by @kirk86
show all tags
bayesian
readings
generalization
uncertainty
bayesianreadingsgeneralizationuncertainty
copydeleteadd this publication to your clipboard
4Fast-rate PAC-Bayes Generalization Bounds via Shifted Rademacher Processes
J. Yang, S. Sun, and D. Roy. (2019)cite arxiv:1908.07585Comment: 18 pages.
4 years ago by @kirk86
show all tags
bayesian
readings
generalization
processes
bounds
bayesianreadingsgeneralizationprocessesbounds
copydeleteadd this publication to your clipboard
4Rethinking Bias-Variance Trade-off for Generalization of Neural Networks
Z. Yang, Y. Yu, C. You, J. Steinhardt, and Y. Ma. (2020)cite arxiv:2002.11328.
4 years ago by @kirk86
show all tags
readings
generalization
variance
bias
analysis
readingsgeneralizationvariancebiasanalysis
copydeleteadd this publication to your clipboard
3MOPO: Model-based Offline Policy Optimization
T. Yu, G. Thomas, L. Yu, S. Ermon, J. Zou, S. Levine, C. Finn, and T. Ma. (2020)cite arxiv:2005.13239Comment: First two authors contributed equally. Last two authors advised equally.
4 years ago by @kirk86
show all tags
reinforcement-learning
readings
generalization
optimization
bounds
reinforcement-learningreadingsgeneralizationoptimizationbounds
copydeleteadd this publication to your clipboard
3Are deep ResNets provably better than linear predictors?
C. Yun, S. Sra, and A. Jadbabaie. (2019)cite arxiv:1907.03922Comment: 17 pages.
5 years ago by @kirk86
show all tags
readings
generalization
optimization
objectives
theory
readingsgeneralizationoptimizationobjectivestheory
copydeleteadd this publication to your clipboard
2Identity Crisis: Memorization and Generalization Under Extreme Overparameterization
C. Zhang, S. Bengio, M. Hardt, M. Mozer, and Y. Singer. International Conference on Learning Representations, (2020)
4 years ago by @kirk86
show all tags
memory
readings
generalization
optimization
iclr2020
memoryreadingsgeneralizationoptimizationiclr2020
copydeleteadd this publication to your clipboard
5Understanding deep learning requires rethinking generalization
C. Zhang, S. Bengio, M. Hardt, B. Recht, and O. Vinyals. (2016)cite arxiv:1611.03530Comment: Published in ICLR 2017.
5 years ago by @kirk86
show all tags
approximate
deep-learning
readings
generalization
theory
approximatedeep-learningreadingsgeneralizationtheory
copydeleteadd this publication to your clipboard
1Understanding Generalization Error of SGD in Nonconvex Optimization
Y. Zhou, Y. Liang, and H. Zhang. (2019)
4 years ago by @kirk86
show all tags
readings
generalization
optimization
stable
readingsgeneralizationoptimizationstable
copydeleteadd this publication to your clipboard
1Hausdorff Dimension, Stochastic Differential Equations, and Generalization in Neural Networks
U. Şimşekli, O. Sener, G. Deligiannidis, and M. Erdogdu. (2020)cite arxiv:2006.09313Comment: 34 Pages.
4 years ago by @kirk86
show all tags
deep-learning
readings
generalization
stochastic
differential-equations
deep-learningreadingsgeneralizationstochasticdifferential-equations
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
⟩
⟩⟩

bookmarks (hide) displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)101 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

bookmarks (hide)
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)101
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...