Non-negative Tensor Factorization for Speech Enhancement

Liang He; Weiqiang Zhang; Mengnan Shi

doi:10.2991/icaita-16.2016.5

<Previous Article In Volume

Next Article In Volume>

Non-negative Tensor Factorization for Speech Enhancement

Authors

Liang He, Weiqiang Zhang, Mengnan Shi

Corresponding Author

Liang He

Available Online January 2016.

DOI: 10.2991/icaita-16.2016.5 How to use a DOI?
Keywords: non-negative tensor factorization (NTF); speech enhancement; sound source separation
Abstract: This paper proposes an algorithm for speech enhancement by non-negative tensor factorisation. We group adjacent time-frequency matrices in the spectrograms together to form a tensor as a basic input in our algorithm. The non-negative tensor factorisation is followed to perform sound source separation between speeches and noises. The proposed strategy benefits from both short time spectral analysis and long term information. From the consideration of auditory theory and linguistics, the latter preserves the temporal dynamics information and intrinsic structure of speech, which are important for the continuity and integrity of hearing. We collected several types of real-life noises and conducted experiments on the TIMIT database. Experimental results demonstrated that the segmental signal to noise ratio (SSNR) and the perceptual evaluation of speech quality (PESQ) were significantly improved respectively.
Copyright: © 2016, the Authors. Published by Atlantis Press.
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the 2016 International Conference on Artificial Intelligence: Technologies and Applications
Series: Advances in Intelligent Systems Research
Publication Date: January 2016
ISBN: 10.2991/icaita-16.2016.5
ISSN: 1951-6851
DOI: 10.2991/icaita-16.2016.5 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - CONF
AU  - Liang He
AU  - Weiqiang Zhang
AU  - Mengnan Shi
PY  - 2016/01
DA  - 2016/01
TI  - Non-negative Tensor Factorization for Speech Enhancement
BT  - Proceedings of the 2016 International Conference on Artificial Intelligence: Technologies and Applications
PB  - Atlantis Press
SP  - 18
EP  - 22
SN  - 1951-6851
UR  - https://doi.org/10.2991/icaita-16.2016.5
DO  - 10.2991/icaita-16.2016.5
ID  - He2016/01
ER  -

download .riscopy to clipboard