Automated search for configurations of convolutional neural network architectures

Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Citations (Scopus)

Abstract

Convolutional Neural Networks (CNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view CNNs as configurable systems and propose an end-to-end framework that allows the configuration, evaluation and automated search for CNN architectures. Therefore, our contribution is threefold. First, we model the variability of CNN architectures with a Feature Model (FM) that generalizes over existing architectures. Each valid configuration of the FM corresponds to a valid CNN model that can be built and trained. Second, we implement, on top of Tensorflow, an automated procedure to deploy, train and evaluate the performance of a configured model. Third, we propose a method to search for configurations and demonstrate that it leads to good CNN models. We evaluate our method by applying it on image classification tasks (MNIST, CIFAR-10) and show that, with limited amount of computation and training, our method can identify high-performing architectures (with high accuracy). We also demonstrate that we outperform existing state-of-the-art architectures handcrafted by ML researchers. Our FM and framework have been released to support replication and future research.

Original languageEnglish
Title of host publicationSPLC 2019 - 23rd International Systems and Software Product Line Conference
EditorsThorsten Berger, Philippe Collet, Laurence Duchien, Thomas Fogdal, Patrick Heymans, Timo Kehrer, Jabier Martinez, Raul Mazo, Leticia Montalvillo, Camille Salinesi, Xhevahire Ternava, Thomas Thum, Tewfik Ziadi
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450371384
DOIs
Publication statusPublished - 9 Sept 2019
Externally publishedYes
Event23rd International Systems and Software Product Line Conference, SPLC 2019, co-located with the 13th European Conference on Software Architecture, ECSA 2019 - Paris, France
Duration: 9 Sept 201913 Sept 2019

Publication series

NameACM International Conference Proceeding Series
VolumeA

Conference

Conference23rd International Systems and Software Product Line Conference, SPLC 2019, co-located with the 13th European Conference on Software Architecture, ECSA 2019
Country/TerritoryFrance
CityParis
Period9/09/1913/09/19

Keywords

  • AutoML
  • Configuration search
  • Feature model
  • NAS
  • Neural architecture search

Fingerprint

Dive into the research topics of 'Automated search for configurations of convolutional neural network architectures'. Together they form a unique fingerprint.

Cite this