-
1.
公开(公告)号:US12062381B2
公开(公告)日:2024-08-13
申请号:US17995081
申请日:2021-04-08
申请人: VOICEAGE CORPORATION
发明人: Vladimir Malenovsky
IPC分类号: G10L19/22 , G10L15/02 , G10L19/008 , G10L19/12 , G10L19/20 , G10L25/18 , G10L25/21 , G10L25/81 , G10L25/84 , G10L25/90
CPC分类号: G10L19/22 , G10L15/02 , G10L19/008 , G10L19/12 , G10L19/20 , G10L25/18 , G10L25/21 , G10L25/81 , G10L25/84 , G10L25/90
摘要: Two-stage speech/music classification device and method classify an input sound signal and select a core encoder for encoding the sound signal. A first stage classifies the input sound signal into one of a number of final classes. A second stage extracts high-level features of the input sound signal and selects the core encoder for encoding the input sound signal in response to the extracted high-level features and the final class selected in the first stage.
-
2.
公开(公告)号:US20230215448A1
公开(公告)日:2023-07-06
申请号:US17995081
申请日:2021-04-08
申请人: VOICEAGE CORPORATION
发明人: Vladimir MALENOVSKY
IPC分类号: G10L19/22 , G10L15/02 , G10L25/81 , G10L25/84 , G10L19/12 , G10L19/008 , G10L25/21 , G10L25/18 , G10L25/90 , G10L19/20
CPC分类号: G10L19/22 , G10L15/02 , G10L19/008 , G10L19/12 , G10L19/20 , G10L25/18 , G10L25/21 , G10L25/81 , G10L25/84 , G10L25/90
摘要: Two-stage speech/music classification device and method classify an input sound signal and select a core encoder for encoding the sound signal. A first stage classifies the input sound signal into one of a number of final classes. A second stage extracts high-level features of the input sound signal and selects the core encoder for encoding the input sound signal in response to the extracted high-level features and the final class selected in the first stage.
-
公开(公告)号:USRE49363E1
公开(公告)日:2023-01-10
申请号:US15877829
申请日:2018-01-23
申请人: VoiceAge Corporation
发明人: Philippe Gournay , Bruno Bessette , Redwan Salami
摘要: A device and a method for quantizing a LPC filter in the form of an input vector in a quantization domain, comprises a calculator of a first-stage approximation of the input vector, a subtractor of the first-stage approximation from the input vector to produce a residual vector, a calculator of a weighting function from the first-stage approximation, a warper of the residual vector with the weighting function, and a quantizer of the weighted residual vector to supply a quantized weighted residual vector. A device and a method for inverse quantizing of a LPC filter, comprises means for receiving coded indices representative of a first-stage approximation of a vector representative of the LPC filter in a quantization domain and of a quantized weighted residual version of the vector, a calculator of an inverse weighting function from the first-stage approximation, an inverse quantizer of the quantized weighted residual version of the vector to produce a weighted residual vector, a multiplier of the weighted residual vector by the inverse weighting function to produce a residual vector, and an adder of the first-stage approximation with the residual vector to produce the vector representative of the LPC filter in the quantization domain.
-
公开(公告)号:US11276412B2
公开(公告)日:2022-03-15
申请号:US16648623
申请日:2018-09-20
申请人: VOICEAGE CORPORATION
发明人: Vaclav Eksler
IPC分类号: G10L19/002 , G10L19/038 , G10L19/12 , G10L19/24
摘要: A method and device allocates a bit-budget to a plurality of first parts of a CELP core module of (a) an encoder for encoding a sound signal or (b) a decoder for decoding the sound signal. In the method and device, bit-budget allocation tables assign, for each of a plurality of intermediate bit rates, respective bit-budgets to the first CELP core module parts. A CELP core module bit rate is determined and one of the intermediate bit rates is selected based on the determined CELP core module bit rate. The respective bit-budgets assigned by the bit-budget allocation tables for the selected intermediate bit rate are allocated to the first CELP core module parts.
-
公开(公告)号:US20210027794A1
公开(公告)日:2021-01-28
申请号:US17071299
申请日:2020-10-15
申请人: VOICEAGE CORPORATION
发明人: Tommy Vaillancourt , Milan Jelinek
IPC分类号: G10L19/008
摘要: A stereo sound decoding method and system decode left and right channels of a stereo sound signal, using received encoding parameters comprising encoding parameters of a primary channel, encoding parameters of a secondary channel, and a factor β. The primary channel encoding parameters comprise LP filter coefficients of the primary channel. The primary channel is decoded in response to the primary channel encoding parameters. The secondary channel is decoded using one of a plurality of coding models, wherein at least one of the coding models uses the primary channel LP filter coefficients to decode the secondary channel. The decoded primary and secondary channels are time domain up-mixed using the factor β to produce the decoded left and right channels of the stereo sound signal, wherein the factor β determines respective contributions of the primary and secondary channels upon production of the left and right channels.
-
公开(公告)号:US10573327B2
公开(公告)日:2020-02-25
申请号:US16369156
申请日:2019-03-29
申请人: VoiceAge Corporation
发明人: Tommy Vaillancourt , Milan Jelinek
IPC分类号: G10L19/008 , G10L19/06 , G10L19/24 , G10L19/09 , G10L25/51 , G10L25/03 , G10L19/002 , G10L19/032 , G10L25/21 , H04S1/00
摘要: A stereo sound signal encoding method and system for time domain down mixing right and left channels of an input stereo sound signal into primary and secondary channels, determine normalised correlations of the left channel and right channel in relation to a monophonic signal version of the sound. A long-term correlation difference is determined on the basis of the normalised correlation of the left channel and the normalized correlation of the right channel. The long-term correlation difference is converted into a factor β, and the left and right channels are mixed to produce the primary and secondary channels using the factor β, wherein the factor β determines respective contributions of the left and right channels upon production of the primary and secondary channels.
-
公开(公告)号:US10522157B2
公开(公告)日:2019-12-31
申请号:US15761895
申请日:2016-09-22
申请人: VOICEAGE CORPORATION
发明人: Tommy Vaillancourt , Milan Jelinek
IPC分类号: G10L19/008 , G10L19/002 , G10L19/06 , G10L19/09 , G10L25/03 , G10L25/21 , G10L25/51 , H04S1/00 , G10L19/24 , G10L19/032
摘要: A method and system are implemented in a stereo sound signal encoding system for time domain down mixing right and left channels of an input stereo sound signal into primary and secondary channels. Correlation of the primary and secondary channels of previous frames is determined, and an out-of-phase condition of the left and right channels is detected based on the correlation of the primary and secondary channels of the previous frames. The left and right channels are time domain down mixed, as a function of the detection, to produce the primary and secondary channels using a factor β, wherein the factor β determines respective contributions of the left and right channels upon production of the primary and secondary channels.
-
公开(公告)号:US20190237087A1
公开(公告)日:2019-08-01
申请号:US16381706
申请日:2019-04-11
申请人: VoiceAge Corporation
发明人: Tommy Vaillancourt , Milan Jelinek
IPC分类号: G10L19/008 , G10L25/51 , G10L25/21 , H04S1/00 , G10L19/032 , G10L19/002 , G10L25/03 , G10L19/09 , G10L19/24 , G10L19/06
CPC分类号: G10L19/008 , G10L19/002 , G10L19/032 , G10L19/06 , G10L19/09 , G10L19/24 , G10L25/03 , G10L25/21 , G10L25/51 , H04S1/007 , H04S2400/01 , H04S2400/03
摘要: A stereo sound encoding method and system for encoding left and right channels of a stereo sound signal, down mix the left and right channels of the stereo sound signal to produce primary and secondary channels, encode the primary channel, and encode the secondary channel. Encoding the secondary channel comprises analyzing coherence between coding parameters calculated during the secondary channel encoding and coding parameters calculated during the primary channel encoding to decide if the coding parameters calculated during the primary channel encoding are sufficiently close to the coding parameters calculated during the secondary channel encoding to be re-used during the secondary channel encoding
-
9.
公开(公告)号:US08930198B2
公开(公告)日:2015-01-06
申请号:US13004385
申请日:2011-01-11
申请人: Bernhard Grill , Roch Lefebvre , Bruno Bessette , Jimmy Lapierre , Philippe Gournay , Redwan Salami , Stefan Bayer , Guillaume Fuchs , Stefan Geyersberger , Ralf Geiger , Johannes Hilpert , Ulrich Kraemer , Jeremie Lecomte , Markus Multrus , Max Neuendorf , Harald Popp , Nikolaus Rettelbach
发明人: Bernhard Grill , Roch Lefebvre , Bruno Bessette , Jimmy Lapierre , Philippe Gournay , Redwan Salami , Stefan Bayer , Guillaume Fuchs , Stefan Geyersberger , Ralf Geiger , Johannes Hilpert , Ulrich Kraemer , Jeremie Lecomte , Markus Multrus , Max Neuendorf , Harald Popp , Nikolaus Rettelbach
IPC分类号: G10L19/00 , G10L19/18 , G10L19/16 , G10L19/008 , G10L19/02
CPC分类号: G10L19/008 , G10L19/0017 , G10L19/0212 , G10L19/173 , G10L19/18 , G10L2019/0008
摘要: An audio encoder has a first information sink oriented encoding branch, a second information source or SNR oriented encoding branch, and a switch for switching between the first encoding branch and the second encoding branch, wherein the second encoding branch has a converter into a specific domain different from the spectral domain, and wherein the second encoding branch furthermore has a specific domain coding branch, and a specific spectral domain coding branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder for decoding a signal, and a third domain decoder and two cascaded switches for switching between the decoders.
摘要翻译: 音频编码器具有第一信息汇集编码分支,第二信息源或SNR定向编码分支,以及用于在第一编码分支和第二编码分支之间进行切换的开关,其中第二编码分支具有转换器到特定的域 并且其中第二编码分支还具有特定的域编码分支,以及特定频域编码分支,以及用于在特定的域编码分支与特定频域编码分支之间切换的附加开关。 音频解码器具有第一域解码器,用于对信号进行解码的第二域解码器,以及用于在解码器之间切换的第三域解码器和两个级联开关。
-
公开(公告)号:US20130121508A1
公开(公告)日:2013-05-16
申请号:US13667921
申请日:2012-11-02
申请人: Voiceage Corporation
发明人: Tommy Vaillancourt , Milan Jelinek
IPC分类号: H03G3/20
CPC分类号: H03G3/20 , G10L19/08 , G10L19/20 , G10L19/22 , G10L19/26 , G10L25/78 , G10L25/81 , G10L25/93
摘要: A method and device for modifying a synthesis of a time-domain excitation decoded by a time-domain decoder, wherein the synthesis of the decoded time-domain excitation is classified into one of a number of categories. The decoded time-domain excitation is converted into a frequency-domain excitation, and the frequency-domain excitation is modified as a function of the category in which the synthesis of the decoded time-domain excitation is classified. The modified frequency-domain excitation is converted into a modified time-domain excitation, and a synthesis filter is supplied with the modified time-domain excitation to produce a modified synthesis of the decoded time-domain excitation.
摘要翻译: 一种用于修改由时域解码器解码的时域激励的合成的方法和装置,其中解码的时域激励的合成被分类为多个类别之一。 解码的时域激励被转换为频域激励,并且频域激励被修改为将解码的时域激励的合成分类的类别的函数。 将修正的频域激励转换为修正的时域激励,并对合成滤波器进行修改的时域激励,以产生经解码的时域激励的修正合成。
-
-
-
-
-
-
-
-
-