DOAJ Open Access 2024

Block-Wise Separable Convolutions: An Alternative Way to Factorize Standard Convolutions

Yan-Jen Huang Hsin-Lung Wu Ching-Chen

Abstrak

In this paper, we introduce block-wise separable convolutions (BlkSConv) to replace the standard convolutions for compressing deep CNN models. First, BlkSConv expresses the standard convolutional kernel as an ordered set of block vectors each of which is a linear combination of fixed basis block vectors. Then it eliminates most basis block vectors and their corresponding coefficients to obtain an approximated convolutional kernel. Moreover, the proposed BlkSConv operation can be efficiently realized via a combination of pointwise and group-wise convolutions. Thus the constructed networks have smaller model size and fewer multiply-adds operations while keeping comparable prediction accuracy. We also develop a hyperparameter search framework based on principal component analysis (PCA) to determine a qualified hyperparameter setting of the block depth and number of basis block vectors. By this search framework, we construct networks which achieve nice prediction performance while simultaneously satisfying the constraints of model size and model efficiency. Our code, data, and models are available at <uri>https://github.com/yanjenhuang/blksconv</uri>.

Penulis (3)

Y

Yan-Jen Huang

H

Hsin-Lung Wu

Ching-Chen

Format Sitasi

Huang, Y., Wu, H., Ching-Chen (2024). Block-Wise Separable Convolutions: An Alternative Way to Factorize Standard Convolutions. https://doi.org/10.1109/ACCESS.2024.3358620

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1109/ACCESS.2024.3358620
Informasi Jurnal
Tahun Terbit
2024
Sumber Database
DOAJ
DOI
10.1109/ACCESS.2024.3358620
Akses
Open Access ✓