语音与音频技术实验室
论文推荐
Unstructured pruning and low rank factorisation of self-supervised pre-trained speech models
- DOI码:
- 10.1109/JSTSP.2024.3433616
- 发表刊物:
- IEEE Journal of Selected Topics in Signal Processing
- 摘要:
- Self-supervised pre-trained speech models require significant memory and computational resources, limiting their applicability to many speech tasks. Unstructured pruning is a compression method that can achieve minimal performance degradation, while the resulting sparse matrix mandates special hardware or computational operators for acceleration. In this study, we propose a novel approach that leverages the potential low-rank structures of the unstructured sparse matrices by applying truncated singular value decomposition (SVD), thus converting them into parameter-efficient dense models. Moreover, we introduce nuclear norm regularisation to ensure lower rank and a learnable singular value selection strategy to determine the approximate truncation rate for each matrix. Experiments on multiple speech tasks demonstrate that the proposed method can convert an unstructured sparse model into a light-weight and hardware-friendly dense model with comparable or superior performance.
- 第一作者:
- Haoyu Wang
- 论文类型:
- 期刊论文
- 通讯作者:
- Wei-Qiang Zhang
- 是否译文:
- 否
- 发表时间:
- 2024-07-25
- 发布期刊链接:
- https://ieeexplore.ieee.org/document/10609479