
Environmental noise tagging via audio spectrogram transformer
Environmental noise pollution is a significant concern for public health and well-being. It is important to accurately identify and classify environmental noise sources to develop effective noise reduction strategies. This paper examines using an audio spectrogram transformer (AST)
for environmental noise tagging tasks. The AST is a pure attention-based model that takes the spectrograms of the audio signals as input and calculates the self-attention without convolutions. Previously, it was pre-trained on large datasets such as ImageNet and AudioSet, showing higher precision
than prior work. Although the hyperparameters were given, many have not been clear from the previous literature. Results show that there are a few choices for the patch split overlap, more overlap does not result in significantly improved performance. It is also shown that instead of the default
128 frequency bins, 96 is another choice, which can reduce the computations. The results further show that 30% - 40% can be masked for the frequency, 20% - 50% for the time dimension. The trained model is further tested on the dataset of different environmental noise sources collected by SiteHive
Hexanodes across Australia and New Zealand. Results show that the AST model can achieve high accuracy in identifying different environmental noises.
The requested document is freely available to subscribers. Users without a subscription can purchase this article.
- Sign in below if you have already registered for online access
Sign in
Document Type: Research Article
Affiliations: 1: Centre for Audio, Acoustics and Vibration, Faculty of Engineering and IT, University of Technology Sydney 2: SiteHive Pty Ltd
Publication date: 30 November 2023
The Noise-Con conference proceedings are sponsored by INCE/USA and the Inter-Noise proceedings by I-INCE. NOVEM (Noise and Vibration Emerging Methods) conference proceedings are included. All NoiseCon Proceedings one year or older are free to download. InterNoise proceedings from outside the USA older than 10 years are free to download. Others are free to INCE/USA members and member societies of I-INCE.
- Membership Information
- INCE Subject Classification
- Ingenta Connect is not responsible for the content or availability of external websites
- Access Key
- Free content
- Partial Free content
- New content
- Open access content
- Partial Open access content
- Subscribed content
- Partial Subscribed content
- Free trial content