Semantic Scholar Open Access 2016 1538 sitasi

Designing Neural Network Architectures using Reinforcement Learning

Bowen Baker O. Gupta Nikhil Naik R. Raskar

Lihat Sumber

Abstrak

At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using $Q$-learning with an $\epsilon$-greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.

Topik & Kata Kunci

Computer Science

Penulis (4)

Bowen Baker

O. Gupta

Nikhil Naik

R. Raskar

Format Sitasi

APA MLA BibTeX

Baker, B., Gupta, O., Naik, N., Raskar, R. (2016). Designing Neural Network Architectures using Reinforcement Learning. https://www.semanticscholar.org/paper/6cd5dfccd9f52538b19a415e00031d0ee4e5b181

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2016
Bahasa: en
Total Sitasi: 1538×
Sumber Database: Semantic Scholar
Akses: Open Access ✓