Low-Rank Optimization for Efficient Compression of CNN Models

Hao Liu
Zheng Jiang
Bin Liu
Liang Li
Xiaokang Zhang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Tensor decomposition is an important method for compressing convolutional neural network (CNN) models. However, in the decomposition process, it is necessary to configure appropriate rank parameters for each convolutional kernel tensor. To address the difficulty in setting ranks, we propose a low-rank optimization algorithm based on information entropy. By solving the optimization problems, this algorithm can automatically learn the low-rank structure and rank parameters of convolutional kernel tensors, achieving global automatic configuration while ensuring model accuracy. Moreover, we design a weight generator for the network after tensor decomposition, which dynamically assesses the importance of filters of low-dimensional convolutional kernel tensors on a global scale. Indeed, pruning in the low-dimensional space can further enhance compression effects with minimal loss in accuracy. By testing various CNN models on different datasets, the results show that the proposed low-rank optimization algorithm can obtain all rank parameters in a single training process, and the average accuracy loss of the decomposed model does not exceed 1%. Meanwhile, the pruning method in low-dimensional space can achieve a compression ratio of over 4.7× with an accuracy loss of less than 1.3%.

Version published to 10.21203/rs.3.rs-5388638/v1 on Research Square
Nov 25, 2024

Data-Free Pruning of CNN Using Kernel Similarity

This article has 4 authors:
1. Xinwang Chen
2. Fengrui Ji
3. Renxin Chu
4. Baolin Liu
This article has no evaluationsLatest version Dec 18, 2024
Dual-branch Feature Fusion Network for image denoising

This article has 6 authors:
1. Lijun Gao
2. Xiao Jin
3. Youzhi Zhang
4. Suran Wang
5. Zeyang Sun
6. Jiehong Wu
This article has no evaluationsLatest version Dec 19, 2024
Efficient Compression of Encoder-Decoder Models for Semantic Segmentation Using the Separation Index

This article has 3 authors:
1. Movahed Jamshidi
2. Ahmad Kalhor
3. Abdol-Hossein Vahabie
This article has no evaluationsLatest version Nov 28, 2024

Listed in

Abstract

Article activity feed

Related articles

Data-Free Pruning of CNN Using Kernel Similarity

Dual-branch Feature Fusion Network for image denoising

Efficient Compression of Encoder-Decoder Models for Semantic Segmentation Using the Separation Index