Articles with "speech separation" as a keyword



Photo from wikipedia

Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers

Sign Up to like & get
recommendations!
Published in 2017 at "International Journal of Speech Technology"

DOI: 10.1007/s10772-016-9392-y

Abstract: Speech separation is an essential part of any voice recognition system like speaker recognition, speech recognition and hearing aids etc. When speech separation is applied at the front-end of any voice recognition system increases the… read more here.

Keywords: speech; information; speech separation; multi pitch ... See more keywords
Photo from wikipedia

Monaural speech separation using GA-DNN integration scheme

Sign Up to like & get
recommendations!
Published in 2020 at "Applied Acoustics"

DOI: 10.1016/j.apacoust.2019.107140

Abstract: Abstract In this research work, we propose the model based on the Genetic Algorithm (GA) and Deep Neural Network (DNN) to enhance the quality and intelligibility of the noisy speech. In this proposed model, the… read more here.

Keywords: quality intelligibility; mask; speech; model ... See more keywords
Photo from wikipedia

Deep neural networks based binary classification for single channel speaker independent multi-talker speech separation

Sign Up to like & get
recommendations!
Published in 2020 at "Applied Acoustics"

DOI: 10.1016/j.apacoust.2020.107385

Abstract: Abstract Speech separation is an important task of separating a target speech from the mixture signals. Speaker-independent multi-talker speech separation is a challenging task due to unpredictability of the target and interfering speech in the… read more here.

Keywords: speech; target; speaker independent; speech separation ... See more keywords
Photo from wikipedia

Dual-Path Hybrid Attention Network for Monaural Speech Separation

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Access"

DOI: 10.1109/access.2022.3193245

Abstract: Recent advances in the time-domain speech separation methods, particularly those specialized in using attention mechanisms to model sequences, have significantly improved speech separation performance. In this paper, we address monaural (one microphone) speaker separation, mainly… read more here.

Keywords: hybrid attention; dual path; path hybrid; attention ... See more keywords
Photo from wikipedia

Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain

Sign Up to like & get
recommendations!
Published in 2021 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2021.3076374

Abstract: To date, mainstream target speech separation (TSS) approaches are formulated to estimate the complex ratio mask (cRM) of target speech in time-frequency domain under supervised deep learning framework. However, the existing methods are designed in… read more here.

Keywords: complex domain; target; multi channel; speech separation ... See more keywords
Photo from wikipedia

Distributed Microphones Speech Separation by Learning Spatial Information With Recurrent Neural Network

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2022.3188178

Abstract: With the development of deep neural networks, multi-channel speech separation techniques with fixed array geometries have achieved remarkable performance. However, distributed microphone array processing remains a challenging problem because it requires the network to be… read more here.

Keywords: neural network; separation; speech separation; recurrent neural ... See more keywords
Photo by historyhd from unsplash

Alias-and-Separate: Wideband Speech Coding Using Sub-Nyquist Sampling and Speech Separation

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2022.3207381

Abstract: Decimation of a discrete-time signal below the Nyquist rate without applying an appropriate lowpass filter results in a distortion called aliasing. If wideband speech sampled at 16 kHz is decimated by 2 to result in… read more here.

Keywords: speech; wideband; speech coding; wideband speech ... See more keywords
Photo by pinjasaur from unsplash

Multichannel Variational Autoencoder-Based Speech Separation in Designated Speaker Order

Sign Up to like & get
recommendations!
Published in 2022 at "Symmetry"

DOI: 10.3390/sym14122514

Abstract: The multichannel variational autoencoder (MVAE) integrates the rule-based update of a separation matrix and the deep generative model and proves to be a competitive speech separation method. However, the output (global) permutation ambiguity still exists… read more here.

Keywords: speaker; multichannel variational; variational autoencoder; separation ... See more keywords