Group Sparse Regularization for Deep Neural Networks

Scardapane, Simone; Comminiello, Danilo; Hussain, Amir; Uncini, Aurelio

doi:10.1016/j.neucom.2017.02.029

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/24942

Full metadata record

DC Field	Value	Language
dc.contributor.author	Scardapane, Simone	en_UK
dc.contributor.author	Comminiello, Danilo	en_UK
dc.contributor.author	Hussain, Amir	en_UK
dc.contributor.author	Uncini, Aurelio	en_UK
dc.date.accessioned	2017-05-16T00:31:51Z	-
dc.date.available	2017-05-16T00:31:51Z	-
dc.date.issued	2017-06-07	en_UK
dc.identifier.uri	http://hdl.handle.net/1893/24942	-
dc.description.abstract	In this paper, we address the challenging task of simultaneously optimizing (i) the weights of a neural network, (ii) the number of neurons for each hidden layer, and (iii) the subset of active input features (i.e., feature selection). While these problems are traditionally dealt with separately, we propose an efficient regularized formulation enabling their simultaneous parallel execution, using standard optimization routines. Specifically, we extend the group Lasso penalty, originally proposed in the linear regression literature, to impose group-level sparsity on the network's connections, where each group is defined as the set of outgoing weights from a unit. Depending on the specific case, the weights can be related to an input variable, to a hidden neuron, or to a bias unit, thus performing simultaneously all the aforementioned tasks in order to obtain a compact network. We carry out an extensive experimental evaluation, in comparison with classical weight decay and Lasso penalties, both on a toy dataset for handwritten digit recognition, and multiple realistic mid-scale classification benchmarks. Comparative results demonstrate the potential of our proposed sparse group Lasso penalty in producing extremely compact networks, with a significantly lower number of input features, with a classification accuracy which is equal or only slightly inferior to standard regularization terms.	en_UK
dc.language.iso	en	en_UK
dc.publisher	Elsevier	en_UK
dc.relation	Scardapane S, Comminiello D, Hussain A & Uncini A (2017) Group Sparse Regularization for Deep Neural Networks. Neurocomputing, 241, pp. 81-89. https://doi.org/10.1016/j.neucom.2017.02.029	en_UK
dc.rights	This item has been embargoed for a period. During the embargo please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author. You can only request a copy if you wish to use this work for your own research or private study. Accepted refereed manuscript of: Scardapane S, Comminiello D, Hussain A & Uncini A (2017) Group Sparse Regularization for Deep Neural Networks, Neurocomputing, 241, pp. 81-89. DOI: 10.1016/j.neucom.2017.02.029 © 2017, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/	en_UK
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	en_UK
dc.subject	Deep networks	en_UK
dc.subject	Group sparsity	en_UK
dc.subject	Pruning	en_UK
dc.subject	Feature selection	en_UK
dc.title	Group Sparse Regularization for Deep Neural Networks	en_UK
dc.type	Journal Article	en_UK
dc.rights.embargodate	2018-02-11	en_UK
dc.rights.embargoreason	[Scardapane_etal_Manuscript.pdf] Publisher requires embargo of 12 months after formal publication.	en_UK
dc.identifier.doi	10.1016/j.neucom.2017.02.029	en_UK
dc.citation.jtitle	Neurocomputing	en_UK
dc.citation.issn	0925-2312	en_UK
dc.citation.volume	241	en_UK
dc.citation.spage	81	en_UK
dc.citation.epage	89	en_UK
dc.citation.publicationstatus	Published	en_UK
dc.citation.peerreviewed	Refereed	en_UK
dc.type.status	AM - Accepted Manuscript	en_UK
dc.author.email	ahu@cs.stir.ac.uk	en_UK
dc.citation.date	10/02/2017	en_UK
dc.contributor.affiliation	Sapienza University of Rome	en_UK
dc.contributor.affiliation	Sapienza University of Rome	en_UK
dc.contributor.affiliation	Computing Science	en_UK
dc.contributor.affiliation	Sapienza University of Rome	en_UK
dc.identifier.isi	WOS:000398752700008	en_UK
dc.identifier.scopusid	2-s2.0-85013055161	en_UK
dc.identifier.wtid	536294	en_UK
dc.contributor.orcid	0000-0002-8080-082X	en_UK
dc.date.accepted	2017-02-07	en_UK
dcterms.dateAccepted	2017-02-07	en_UK
dc.date.filedepositdate	2017-02-08	en_UK
rioxxterms.apc	not required	en_UK
rioxxterms.type	Journal Article/Review	en_UK
rioxxterms.version	AM	en_UK
local.rioxx.author	Scardapane, Simone\|	en_UK
local.rioxx.author	Comminiello, Danilo\|	en_UK
local.rioxx.author	Hussain, Amir\|0000-0002-8080-082X	en_UK
local.rioxx.author	Uncini, Aurelio\|	en_UK
local.rioxx.project	Internal Project\|University of Stirling\|https://isni.org/isni/0000000122484331	en_UK
local.rioxx.freetoreaddate	2018-02-11	en_UK
local.rioxx.licence	http://www.rioxx.net/licenses/under-embargo-all-rights-reserved\|\|2018-02-10	en_UK
local.rioxx.licence	http://creativecommons.org/licenses/by-nc-nd/4.0/\|2018-02-11\|	en_UK
local.rioxx.filename	Scardapane_etal_Manuscript.pdf	en_UK
local.rioxx.filecount	1	en_UK
local.rioxx.source	0925-2312	en_UK
Appears in Collections:	Computing Science and Mathematics Journal Articles

Files in This Item:

File	Description	Size	Format
Scardapane_etal_Manuscript.pdf	Fulltext - Accepted Version	564.69 kB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show simple item record

A file in this item is licensed under a Creative Commons License

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository