Semantic Scholar Open Access 2015 1880 sitasi

Highway Networks

R. Srivastava Klaus Greff J. Schmidhuber

Abstrak

There is plenty of theoretical and empirical evidence that depth of neural networks is a crucial ingredient for their success. However, network training becomes more difficult with increasing depth and training of very deep networks remains an open problem. In this extended abstract, we introduce a new architecture designed to ease gradient-based training of very deep networks. We refer to networks with this architecture as highway networks, since they allow unimpeded information flow across several layers on"information highways". The architecture is characterized by the use of gating units which learn to regulate the flow of information through a network. Highway networks with hundreds of layers can be trained directly using stochastic gradient descent and with a variety of activation functions, opening up the possibility of studying extremely deep and efficient architectures.

Topik & Kata Kunci

Penulis (3)

R

R. Srivastava

K

Klaus Greff

J

J. Schmidhuber

Format Sitasi

Srivastava, R., Greff, K., Schmidhuber, J. (2015). Highway Networks. https://www.semanticscholar.org/paper/e0945081b5b87187a53d4329cf77cd8bff635795

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2015
Bahasa
en
Total Sitasi
1880×
Sumber Database
Semantic Scholar
Akses
Open Access ✓