DOAJ Open Access 2022

Direct nonlinear acceleration

Aritra Dutta El Houcine Bergou Yunming Xiao Marco Canini Peter Richtárik

Abstrak

Optimization acceleration techniques such as momentum play a key role in state-of-the-art machine learning algorithms. Recently, generic vector sequence extrapolation techniques, such as regularized nonlinear acceleration (RNA) of Scieur et al. [22], were proposed and shown to accelerate fixed point iterations. In contrast to RNA which computes extrapolation coefficients by (approximately) setting the gradient of the objective function to zero at the extrapolated point, we propose a more direct approach, which we call direct nonlinear acceleration (DNA). In DNA, we aim to minimize (an approximation of) the function value at the extrapolated point instead. We adopt a regularized approach with regularizers designed to prevent the model from entering a region in which the functional approximation is less precise. While the computational cost of DNA is comparable to that of RNA, our direct approach significantly outperforms RNA on both synthetic and real-world datasets. While the focus of this paper is on convex problems, we obtain very encouraging results in accelerating the training of neural networks.

Topik & Kata Kunci

Applied mathematics. Quantitative methods Electronic computers. Computer science

Penulis (5)

Aritra Dutta

El Houcine Bergou

Yunming Xiao

Marco Canini

Peter Richtárik

Format Sitasi

APA MLA BibTeX

Dutta, A., Bergou, E.H., Xiao, Y., Canini, M., Richtárik, P. (2022). Direct nonlinear acceleration. https://doi.org/10.1016/j.ejco.2022.100047

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber doi.org/10.1016/j.ejco.2022.100047

Informasi Jurnal

Tahun Terbit: 2022
Sumber Database: DOAJ
DOI: 10.1016/j.ejco.2022.100047
Akses: Open Access ✓