A new speech enhancement method based on Swin-UNet model

Buy Article:

Authors: Sun, Chengli ¹ ; Jiang, Weiqi ¹ ; Leng, Yan ² ; Chen, Feilong ¹ ;

Source: Noise Control Engineering Journal, Volume 71, Number 4, 1 July 2023, pp. 258-267(10)

Publisher: Institute of Noise Control Engineering

DOI: https://doi.org/10.3397/1/377122

U-shaped Network (UNet) has shown excellent performance in a variety of speech enhancement tasks. However, because of the intrinsic limitation of convolutional operation, traditional UNet built with convolutional neural network (CNN) cannot learn global and long-term information well. In this work, we propose a new Swin-UNet-based speech enhancement method. Unlike the traditional UNet model, the CNN blocks are all replaced with Swin-Transformer blocks to explore more multi-scale contextual information. The Swin-UNet model employs shifted window mechanism which not only overcomes the defect of high computational complexity of the Transformer but also enhances global information interaction by utilizing the powerful global modeling capability of the Transformer. Through hierarchical Swin-Transformer blocks, global and local speech features can be fully leveraged to improve speech reconstruction ability. Experimental results confirm that the proposed method can eliminate more background noise while maintaining good objective speech quality.

The requested document is freely available to subscribers. Users without a subscription can purchase this article.

Register for online access if you are a personal subscriber to and want to set up online access
Visit Ingenta Connect to activate access to an institutional subscription
Find out how to subscribe

Sign in

Keywords: 74.3; 74.8

Document Type: Research Article

Affiliations: 1: School of Information, Nanchang Hangkong University 2: College of Physics and Electronics, Shandong Normal University

Publication date: 01 July 2023

More about this publication?

NCEJ is the pre-eminent academic journal of noise control. It is the Journal of the Institute of Noise Control Engineering of the USA. Since 1973 NCEJ has served as the primary source for noise control researchers, students, and consultants.
Information for Authors
Submit a Paper
Subscribe to this Title
Membership Information
INCE Subject Classification
Ingenta Connect is not responsible for the content or availability of external websites

Access Key
Free content
Partial Free content
New content
Open access content
Partial Open access content
Subscribed content
Partial Subscribed content
Free trial content

A new speech enhancement method based on Swin-UNet model

Buy Article:

Tools

Share Content