
Prediction-segmentation tasks for self-supervision of anomaly detection networks under noisy conditions
For detecting anomalies from sounds generated by electronic devices, self-supervised learning of deep neural networks (DNNs) has been popularly employed. In self-supervised learning, a DNN model is trained over normal data to solve some pretext tasks, and test data giving reduced task
performance are regarded as anomalies. Popular choices for the pretext task are the reconstruction and the classification tasks, where a model is trained to predict masked parts of the spectrogram and to classify the internal classes of normal data, respectively. However, the reconstruction
task is hard to distinguish anomalies from noises in noisy conditions, and the classification task often fails to learn meaningful features when the diversity across internal classes is too small or too evident. We propose a combination of prediction and segmentation tasks to overcome these
limitations. For the proposed tasks, two different machine sounds are mixed with a constant ratio, and a model is trained to predict both the mixed spectrogram of future time and mixing ratio based on the present and past sound mixture. We train a WaveNet-based model using dual tasks simultaneously,
which shows remarkable performance improvements over the conventional models and achieves state-of-the-art performance on the DCASE 2020 Task 2 dataset.
The requested document is freely available to subscribers. Users without a subscription can purchase this article.
- Sign in below if you have already registered for online access
Sign in
Document Type: Research Article
Affiliations: School of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST)
Publication date: 30 November 2023
The Noise-Con conference proceedings are sponsored by INCE/USA and the Inter-Noise proceedings by I-INCE. NOVEM (Noise and Vibration Emerging Methods) conference proceedings are included. All NoiseCon Proceedings one year or older are free to download. InterNoise proceedings from outside the USA older than 10 years are free to download. Others are free to INCE/USA members and member societies of I-INCE.
- Membership Information
- INCE Subject Classification
- Ingenta Connect is not responsible for the content or availability of external websites
- Access Key
- Free content
- Partial Free content
- New content
- Open access content
- Partial Open access content
- Subscribed content
- Partial Subscribed content
- Free trial content