Open Access Open Access  Restricted Access Subscription or Fee Access

Real-time Implementation of Binary Mask Algorithm for Hearing Prosthetics

Swati R. Pawar, Hemantkumar B. Mali

Abstract


As the current speech enhancement algorithms can give results for improved speech audibility only. So, our main motivation is to develop speech enhancement algorithm that would improve performance in speech for hearing impaired persons. In this paper, an algorithm is presented for improving speech intelligibility. Various speech enhancement algorithms were developed but only some of them can be used for real time hearing aid applications. This proposed algorithm can be used for practical hearing prosthetic devices. Implementation of the binary asking algorithm uses a bank of band-pass filters to perform mapping of signals. Also, classification is performed with a signal-to-noise (SNR) estimate and a comparator. This includes spatial filtering method, classification of signals such as original and noisy signal. After this based on SNR threshold, level signals are recombined to obtain reduced noise level in speech signal. In this, Matlab implementation of binary mask algorithm is provided which shows better results for speech intelligibility as compared to other algorithms.

 


Full Text:

PDF

References


Wang DL, Brown GJ. Computational auditory scene analysis: Principles, algorithms, and applications, Hoboken, NJ: Wiley/IEEE Press, 2006.

Scalart P, Filho J. Speech enhancement based on a priori signal to noise estimation, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. 1996, 629–632p.

Gustafsson H, Nordholm S, Claesson I. Spectral subtraction using reduced delay convolution and adaptive averaging, IEEE T Audio process. Nov. 2001; 9(8): 799–807p.

Journal of Microelectronics and Solid State Devices

Volume 3, Issue 1

ISSN: 2455-3336(online)

JoMSD (2016) 36-42 © STM Journals 2016. All Rights Reserved Page 42

Li J, Sakamoto S, Hongo S, et al. Two- Stage Bin- aural Speech Enhancement With Wiener Filter Based On Equalization-Cancellation Model, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 18-21, Oct. 2009, 133–136p.

Hanson V, Odame K. Real-Time Embedded Implementation of the Binary Mask Algorithm for Hearing Prosthetics, IEEE T Biomed Circ Syst. August 2014; 8(4): 465–473p.

Hanson V, Odame K. Real-time source separation on a field programmable gate array platform, in Proc. Annual International Conference of IEEE Engineering in Medicine and Biology Soc. Aug. 2012, 2925–2928p.

Loizou PC, Kim G. Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions, IEEE T Audio Speech Language Process. Jan. 2011; 19(1): 4756p.

Taal CH, Hendriks RC, Heusdens R, et al. An algorithm for intelligibility prediction of time-frequency weighted noisy speech, IEEE T Audio Speech Language Process. Sep. 2011; 19(7): 2125–2136p.

Li N, Loizou PC. Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction, J Acoust Soc Amer. Mar. 2008; 123(3): 1673–1682p.

Boldt J, Kjems U, Pedersen MS, et al. Estimation of the ideal binary mask using directional systems, in Proc. 11th Int. Workshop Acoustic Echo and Noise Control, Sep. 2008; 395–401p.

Marrone N, Mason CR, Kidd G Jr. The effects of hearing loss and age on the benefit of spatial separation between multiple talkers in reverberant rooms, J Acoust Soc Amer. Nov. 2008; 124(5): 3064–3075p.

Wang D. Time-frequency masking for speech separation and its potential for hearing aid design, Trends Amplification. Dec. 2008; 12(4): 332–353p.

Wang D, Kjems U, Pedersen M, et al. Speech intelligibility in background noise with ideal binary time-frequency masking, J Acoust Soc Amer. April 2009; 125(4): 2336–2347.

Oppenheim AV, et al. Discrete-Time Signal Processing. Upper Saddle River, NJ, USA: Prentice-Hall, Vol. 5, 1999.

Elko GW, Pong ATN. A steerable and variable first-order differential microphone array, in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Munich, Apr. 1997, 223–226p.

Elko GW, Pong ATN. A simple adaptive first-order differential microphone, in Proc. IEEE ASSP Workshop Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, Oct. 1995, 169–172p.

DeLiang Wang, Time-frequency Masking for Speech Separation and Its Potential for Hearing Aid Design, Trends Amplification, Oct. 2008.




DOI: https://doi.org/10.37591/jomsd.v3i1.5204

Refbacks

  • There are currently no refbacks.


Copyright (c) 2021 Journal of Microelectronics and Solid State Devices