-
公开(公告)号:US20160073198A1
公开(公告)日:2016-03-10
申请号:US14777825
申请日:2013-03-20
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Miikka VILERMO , Mikko TAMMI , Joonas NIKUNEN , Tuomas VIRTANEN
CPC classification number: H04R5/027 , G10L21/028 , H04N7/15 , H04R1/406 , H04R3/005 , H04R2201/401 , H04R2430/23
Abstract: An apparatus comprising: an input configured to receive at least two audio signals; a frequency domain transformer configured to transform the at least two audio signals into a frequency domain representation of the at least two signals; a spatial covariance processor configured to generate an observed spatial covariance matrix from the frequency domain representations of the at least two audio signals; a beamformer configured to generate a spatial covariance matrix model comprising at least one beamformer kernel; a matrix factorizer configured to generate a linear magnitude mode! of audio objects; to combine the spatial covariance matrix model and the linear magnitude model; and further configured to determine at least one combination parameter, such that the at least one parameter for the combination attempts to optimise the combination; and a separator configured to cluster the audio objects based on the at least one combination parameter to create separated audio sources.
Abstract translation: 一种装置,包括:被配置为接收至少两个音频信号的输入; 频域变换器,被配置为将所述至少两个音频信号变换为所述至少两个信号的频域表示; 空间协方差处理器,被配置为从所述至少两个音频信号的频域表示中产生观测空间协方差矩阵; 波束形成器,被配置为生成包括至少一个波束形成器内核的空间协方差矩阵模型; 配置成生成线性幅度模式的矩阵分解器! 的音频对象; 组合空间协方差矩阵模型和线性幅度模型; 并且还被配置为确定至少一个组合参数,使得所述组合的所述至少一个参数尝试优化所述组合; 以及分配器,被配置为基于所述至少一个组合参数来聚集所述音频对象以创建分离的音频源。