Misc,

Knowledge Projection for Deep Neural Networks

Z. Zhang, G. Ning, and Z. He.
(2017)cite arxiv:1710.09505.

Abstract

While deeper and wider neural networks are actively pushing the performance limits of various computer vision and machine learning tasks, they often require large sets of labeled data for effective training and suffer from extremely high computational complexity. In this paper, we will develop a new framework for training deep neural networks on datasets with limited labeled samples using cross-network knowledge projection which is able to improve the network performance while reducing the overall computational complexity significantly. Specifically, a large pre-trained teacher network is used to observe samples from the training data. A projection matrix is learned to project this teacher-level knowledge and its visual representations from an intermediate layer of the teacher network to an intermediate layer of a thinner and faster student network to guide and regulate its training process. Both the intermediate layers from the teacher network and the injection layers from the student network are adaptively selected during training by evaluating a joint loss function in an iterative manner. This knowledge projection framework allows us to use crucial knowledge learned by large networks to guide the training of thinner student networks, avoiding over-fitting, achieving better network performance, and significantly reducing the complexity. Extensive experimental results on benchmark datasets have demonstrated that our proposed knowledge projection approach outperforms existing methods, improving accuracy by up to 4% while reducing network complexity by 4 to 10 times, which is very attractive for practical applications of deep neural networks.

BibTeX key: zhang2017knowledge
entry type: misc
year: 2017
url: http://arxiv.org/abs/1710.09505
note: cite arxiv:1710.09505

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@misc{zhang2017knowledge, abstract = {While deeper and wider neural networks are actively pushing the performance limits of various computer vision and machine learning tasks, they often require large sets of labeled data for effective training and suffer from extremely high computational complexity. In this paper, we will develop a new framework for training deep neural networks on datasets with limited labeled samples using cross-network knowledge projection which is able to improve the network performance while reducing the overall computational complexity significantly. Specifically, a large pre-trained teacher network is used to observe samples from the training data. A projection matrix is learned to project this teacher-level knowledge and its visual representations from an intermediate layer of the teacher network to an intermediate layer of a thinner and faster student network to guide and regulate its training process. Both the intermediate layers from the teacher network and the injection layers from the student network are adaptively selected during training by evaluating a joint loss function in an iterative manner. This knowledge projection framework allows us to use crucial knowledge learned by large networks to guide the training of thinner student networks, avoiding over-fitting, achieving better network performance, and significantly reducing the complexity. Extensive experimental results on benchmark datasets have demonstrated that our proposed knowledge projection approach outperforms existing methods, improving accuracy by up to 4% while reducing network complexity by 4 to 10 times, which is very attractive for practical applications of deep neural networks.}, added-at = {2017-11-23T07:50:21.000+0100}, author = {Zhang, Zhi and Ning, Guanghan and He, Zhihai}, biburl = {https://www.bibsonomy.org/bibtex/2cee352646f7b525a932dc43c01fbd06d/hotho}, description = {Knowledge Projection for Deep Neural Networks}, interhash = {13b2cafaee521d4da53a74f4e2344d80}, intrahash = {cee352646f7b525a932dc43c01fbd06d}, keywords = {deep knowledge learning toread}, note = {cite arxiv:1710.09505}, timestamp = {2017-11-23T07:50:21.000+0100}, title = {Knowledge Projection for Deep Neural Networks}, url = {http://arxiv.org/abs/1710.09505}, year = 2017 }

BibSonomy

Knowledge Projection for Deep Neural Networks

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on