Semantic Scholar Open Access 2015 1880 sitasi

Highway Networks

R. Srivastava Klaus Greff J. Schmidhuber

Lihat Sumber

Abstrak

There is plenty of theoretical and empirical evidence that depth of neural networks is a crucial ingredient for their success. However, network training becomes more difficult with increasing depth and training of very deep networks remains an open problem. In this extended abstract, we introduce a new architecture designed to ease gradient-based training of very deep networks. We refer to networks with this architecture as highway networks, since they allow unimpeded information flow across several layers on"information highways". The architecture is characterized by the use of gating units which learn to regulate the flow of information through a network. Highway networks with hundreds of layers can be trained directly using stochastic gradient descent and with a variety of activation functions, opening up the possibility of studying extremely deep and efficient architectures.

Topik & Kata Kunci

Computer Science

Penulis (3)

R. Srivastava

Klaus Greff

J. Schmidhuber

Format Sitasi

APA MLA BibTeX

Srivastava, R., Greff, K., Schmidhuber, J. (2015). Highway Networks. https://www.semanticscholar.org/paper/e0945081b5b87187a53d4329cf77cd8bff635795

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2015
Bahasa: en
Total Sitasi: 1880×
Sumber Database: Semantic Scholar
Akses: Open Access ✓