《应用声学》编辑部

文章摘要

康坊,杨飞然,杨军.子带t分布的快速独立向量分析在语声盲源分离中的应用*[J].,2022,41(2):173-182

子带t分布的快速独立向量分析在语声盲源分离中的应用*

Fast independent vector analysis based on clique t-distribution source model for blind speech separation

投稿时间：2021-03-15 修订日期：2022-03-01

中文摘要:

为了提高独立向量分析算法在盲语音分离任务中的分离性能,降低算法计算复杂度,并改善目前尚未完全解决的顺序模糊性的问题,本文提出一种基于子带t分布的快速独立向量分析算法。在声源模型方面,该算法首先利用语音信号重尾分布的特性,假设声源概率密度函数服从t分布,同时采用子带建模的方法来增强同一声源相邻频点的高阶依赖性,进而减轻频点间的顺序不一致问题。在空间模型方面,该算法采用秩1更新的方式估计声源信号,避免矩阵求逆操作和分离矩阵的估计,从而降低计算复杂度。实验结果表明,与现有的基于独立向量分析的盲源分离算法相比,该算法能够在相同的迭代次数下取得更优的语音分离性能。

英文摘要:

We propose a novel fast independent vector analysis based on clique t-distribution source model. The proposed method assumes that the source generative model follows the complex Student’s t distribution which is known to be more suited for speech signals. We independently model sources in each overlapping frequency clique to improve the dependence among neighboring frequencies and further solve the permutation problem. Furthermore, we integrate the modified source model into the iterative source steering algorithm to reduce the computational cost via rank-1 update rules. The experimental results show that the proposed method achieves a better speech separation performance than the conventional methods.

DOI：10.11684/j.issn.1000-310X.2022.02.001

中文关键词: 盲源分离秩1更新独立向量分析子带t分布

英文关键词: Blind source separation, Rank-1 update, Independent vector analysis, Clique t-distribution

基金项目:国家自然科学基金(编号: 62171438)，中国科学院青年促进会(编号: 2018027)资助项目

作者	单位	E-mail
康坊	中国科学院声学研究所噪声与振动重点实验室	kangfang@mail.ioa.ac.cn
杨飞然	中国科学院声学研究所噪声与振动重点实验室	feiran@mail.ioa.ac.cn
杨军^*	中国科学院声学研究所噪声与振动重点实验室,中国科学院大学	jyang@mail.ioa.ac.cn

摘要点击次数: 1249

全文下载次数: 1211

查看全文查看/发表评论下载PDF阅读器

关闭