arXiv Open Access 2024

The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks

Bum Jun Kim Yoshinobu Kawahara Sang Woo Kim
Lihat Sumber

Abstrak

Dynamical systems are often time-varying, whose modeling requires a function that evolves with respect to time. Recent studies such as the neural ordinary differential equation proposed a time-dependent neural network, which provides a neural network varying with respect to time. However, we claim that the architectural choice to build a time-dependent neural network significantly affects its time-awareness but still lacks sufficient validation in its current states. In this study, we conduct an in-depth analysis of the architecture of modern time-dependent neural networks. Here, we report a vulnerability of vanishing timestep embedding, which disables the time-awareness of a time-dependent neural network. Furthermore, we find that this vulnerability can also be observed in diffusion models because they employ a similar architecture that incorporates timestep embedding to discriminate between different timesteps during a diffusion process. Our analysis provides a detailed description of this phenomenon as well as several solutions to address the root cause. Through experiments on neural ordinary differential equations and diffusion models, we observed that ensuring alive time-awareness via proposed solutions boosted their performance, which implies that their current implementations lack sufficient time-dependency.

Topik & Kata Kunci

Penulis (3)

B

Bum Jun Kim

Y

Yoshinobu Kawahara

S

Sang Woo Kim

Format Sitasi

Kim, B.J., Kawahara, Y., Kim, S.W. (2024). The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks. https://arxiv.org/abs/2405.14126

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓