TY - GEN
T1 - Automated search for configurations of convolutional neural network architectures
AU - Ghamizi, Salah
AU - Cordy, Maxime
AU - Papadakis, Mike
AU - Le Traon, Yves
N1 - Publisher Copyright:
© 2019 Association for Computing Machinery.
PY - 2019/9/9
Y1 - 2019/9/9
N2 - Convolutional Neural Networks (CNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view CNNs as configurable systems and propose an end-to-end framework that allows the configuration, evaluation and automated search for CNN architectures. Therefore, our contribution is threefold. First, we model the variability of CNN architectures with a Feature Model (FM) that generalizes over existing architectures. Each valid configuration of the FM corresponds to a valid CNN model that can be built and trained. Second, we implement, on top of Tensorflow, an automated procedure to deploy, train and evaluate the performance of a configured model. Third, we propose a method to search for configurations and demonstrate that it leads to good CNN models. We evaluate our method by applying it on image classification tasks (MNIST, CIFAR-10) and show that, with limited amount of computation and training, our method can identify high-performing architectures (with high accuracy). We also demonstrate that we outperform existing state-of-the-art architectures handcrafted by ML researchers. Our FM and framework have been released to support replication and future research.
AB - Convolutional Neural Networks (CNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view CNNs as configurable systems and propose an end-to-end framework that allows the configuration, evaluation and automated search for CNN architectures. Therefore, our contribution is threefold. First, we model the variability of CNN architectures with a Feature Model (FM) that generalizes over existing architectures. Each valid configuration of the FM corresponds to a valid CNN model that can be built and trained. Second, we implement, on top of Tensorflow, an automated procedure to deploy, train and evaluate the performance of a configured model. Third, we propose a method to search for configurations and demonstrate that it leads to good CNN models. We evaluate our method by applying it on image classification tasks (MNIST, CIFAR-10) and show that, with limited amount of computation and training, our method can identify high-performing architectures (with high accuracy). We also demonstrate that we outperform existing state-of-the-art architectures handcrafted by ML researchers. Our FM and framework have been released to support replication and future research.
KW - AutoML
KW - Configuration search
KW - Feature model
KW - NAS
KW - Neural architecture search
UR - http://www.scopus.com/inward/record.url?scp=85123043273&partnerID=8YFLogxK
U2 - 10.1145/3336294.3336306
DO - 10.1145/3336294.3336306
M3 - Conference contribution
AN - SCOPUS:85123043273
T3 - ACM International Conference Proceeding Series
BT - SPLC 2019 - 23rd International Systems and Software Product Line Conference
A2 - Berger, Thorsten
A2 - Collet, Philippe
A2 - Duchien, Laurence
A2 - Fogdal, Thomas
A2 - Heymans, Patrick
A2 - Kehrer, Timo
A2 - Martinez, Jabier
A2 - Mazo, Raul
A2 - Montalvillo, Leticia
A2 - Salinesi, Camille
A2 - Ternava, Xhevahire
A2 - Thum, Thomas
A2 - Ziadi, Tewfik
PB - Association for Computing Machinery
T2 - 23rd International Systems and Software Product Line Conference, SPLC 2019, co-located with the 13th European Conference on Software Architecture, ECSA 2019
Y2 - 9 September 2019 through 13 September 2019
ER -