Skip to main content

Enhancing the Recognition of Speakers in Different Distances using Voice Features

Buy Article:

$15.00 + tax (Refund Policy)

This paper proposes a method to enhance speaker recognition at varying distances by adjusting the reference voice based on voice features. Speaker recognition is the process of identifying an individual based on their voice. It involves analyzing and comparing various acoustic features of a person's voice with the reference voice in the database. Conventional speaker recognition techniques have limitations of reduced accuracy when speakers are from varying distances. In this work, we found that high-frequency signals tend to decline faster than low-frequency ones with respect to speaker distance. Based on this, we propose a method that utilizes support vector machines (SVM) to classify speaker distance using sound features, such as the amplitude sum of high-frequency signals and the dynamic range. Once the speaker distance is determined, the reference signal in the database is adjusted according to the distance before being used for speaker recognition. Mel-Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) were employed as the recognition algorithm. Experiments were conducted with speakers placed at three distances, 0.1, 1, and 2.5 meters from the microphone. The experimental results reveal that signals with the frequency of 4 kHz and above experience a faster decline in amplitude than lower ones with increasing distance. The recognition results also demonstrate a significant improvement in accuracy.

The requested document is freely available to subscribers. Users without a subscription can purchase this article.

Sign in

Document Type: Research Article

Affiliations: Department of Mechanical Engineering, National Yang Ming Chiao Tung University

Publication date: 30 November 2023

More about this publication?
  • The Noise-Con conference proceedings are sponsored by INCE/USA and the Inter-Noise proceedings by I-INCE. NOVEM (Noise and Vibration Emerging Methods) conference proceedings are included. All NoiseCon Proceedings one year or older are free to download. InterNoise proceedings from outside the USA older than 10 years are free to download. Others are free to INCE/USA members and member societies of I-INCE.

  • Membership Information
  • INCE Subject Classification
  • Ingenta Connect is not responsible for the content or availability of external websites
  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content