-
公开(公告)号:US11922956B2
公开(公告)日:2024-03-05
申请号:US17653332
申请日:2022-03-03
发明人: Sascha Disch , Frederik Nagel , Ralf Geiger , Balaji Nagendran Thoshkahna , Konstantin Schmidt , Stefan Bayer , Christian Neukam , Bernd Edler , Christian Helmrich
IPC分类号: G10L19/00 , G10L19/008 , G10L19/02 , G10L19/022 , G10L19/025 , G10L19/03 , G10L19/032 , G10L19/06 , G10L21/0388 , G10L25/06 , G10L25/18 , G10L25/21 , H04S1/00
CPC分类号: G10L19/008 , G10L19/0204 , G10L19/0208 , G10L19/0212 , G10L19/022 , G10L19/025 , G10L19/03 , G10L19/032 , G10L19/06 , G10L21/0388 , G10L25/06 , G10L25/18 , G10L25/21 , H04S1/007 , G10L19/02
摘要: An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.
-
公开(公告)号:US11875805B2
公开(公告)日:2024-01-16
申请号:US17495184
申请日:2021-10-06
IPC分类号: G10L19/00 , G10L19/02 , G10L21/038 , G10L21/0388 , G10L19/26
CPC分类号: G10L19/0208 , G10L19/02 , G10L19/0212 , G10L19/26 , G10L21/038 , G10L21/0388
摘要: There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.
-
公开(公告)号:US20230410832A1
公开(公告)日:2023-12-21
申请号:US18033025
申请日:2020-11-05
IPC分类号: G10L21/0388 , G10L19/008
CPC分类号: G10L21/0388 , G10L19/008
摘要: For each frame, an n-th channel compensated decoded sound signal {tilde over ( )}X′n is obtained that is a signal obtained by compensating a high frequency of an n-th channel purified decoded sound signal {tilde over ( )}Xn obtained by performing signal processing in a time domain on an n-th channel decoded sound signal {circumflex over ( )}Xn that is a decoded sound signal of each channel of stereo obtained by decoding a stereo code CS. At this time, for the each frame with respect to the each channel, an n-th channel high-frequency compensation gain ρn that is a value for bringing high-frequency energy of {tilde over ( )}X′n close to high-frequency energy of {circumflex over ( )}Xn is obtained, and for the each frame with respect to the each channel, a signal obtained by adding {tilde over ( )}Xn and a signal obtained by multiplying a high-frequency component of a monaural decoded sound signal that is obtained by decoding a monaural code CM that is a code different from the stereo code CS or a signal obtained by upmixing, for the each channel, the monaural decoded sound signal by the n-th channel high-frequency compensation gain ρn is obtained and output as the n-th channel compensated decoded sound signal {tilde over ( )}X′n.
-
公开(公告)号:US11769512B2
公开(公告)日:2023-09-26
申请号:US17217533
申请日:2021-03-30
发明人: Christian Neukam , Sascha Disch , Frederik Nagel , Andreas Niedermeier , Konstantin Schmidt , Balaji Nagendran Thoshkahna
IPC分类号: G10L19/00 , G10L19/008 , G10L25/18 , G10L25/21 , G10L21/0388 , G10L19/025 , G10L19/03 , G10L19/02 , G10L19/022 , G10L19/032 , G10L19/06 , G10L25/06 , H04S1/00
CPC分类号: G10L19/008 , G10L19/022 , G10L19/0204 , G10L19/025 , G10L19/0208 , G10L19/0212 , G10L19/03 , G10L19/032 , G10L19/06 , G10L21/0388 , G10L25/06 , G10L25/18 , G10L25/21 , H04S1/007 , G10L19/02
摘要: An apparatus for decoding an encoded signal includes: an audio decoder for decoding an encoded representation of a first set of first spectral portions to obtain a decoded first set of first spectral portions; a parametric decoder for decoding an encoded parametric representation of a second set of second spectral portions to obtain a decoded representation of the parametric representation, wherein the parametric information includes, for each target frequency tile, a source region identification as a matching information; and a frequency regenerator for regenerating a target frequency tile using a source region from the first set of first spectral portions identified by the matching information.
-
公开(公告)号:US20230197103A1
公开(公告)日:2023-06-22
申请号:US18113406
申请日:2023-02-23
IPC分类号: G10L21/0388 , G10L19/008 , G10L19/02 , H04S3/00
CPC分类号: G10L21/0388 , G10L19/02 , G10L19/008 , H04S3/008
摘要: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
公开(公告)号:US11657788B2
公开(公告)日:2023-05-23
申请号:US17549661
申请日:2021-12-13
发明人: Per Ekstrand , Lars Villemoes , Per Hedelin
IPC分类号: G10L21/038 , G10H1/00 , G10L19/26 , G10H1/12 , G10L21/0388
CPC分类号: G10H1/0091 , G10H1/125 , G10L19/265 , G10L21/038 , G10H2210/311 , G10L21/0388
摘要: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of Δf. The system further comprises a nonlinear processing unit (502) configured to determine a set of synthesis subband signals from the set of analysis subband signals using a transposition order P; wherein the set of synthesis subband signals comprises a portion of the set of analysis subband signals phase shifted by an amount derived from the transposition order P; and a synthesis filter bank (504) configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein the synthesis filter bank (504) has a frequency resolution of FΔf; with F being a resolution factor, with F≥1; wherein the transposition order P is different from the resolution factor F.
-
公开(公告)号:US20230142711A1
公开(公告)日:2023-05-11
申请号:US18096655
申请日:2023-01-13
申请人: Staton Techiya LLC
发明人: John Usher , Dan Ellis
IPC分类号: G10L21/0388 , H04R3/00
CPC分类号: G10L21/0388 , H04R3/00 , H04R1/028
摘要: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the input narrowband audio signal. Other embodiments are disclosed.
-
公开(公告)号:US20230087552A1
公开(公告)日:2023-03-23
申请号:US17987545
申请日:2022-11-15
IPC分类号: G10L21/0388 , G10L19/008 , G10L19/02 , H04S3/00
摘要: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
公开(公告)号:US11527256B2
公开(公告)日:2022-12-13
申请号:US17050669
申请日:2019-04-25
IPC分类号: G10L21/0388 , G10L19/008 , G10L19/02 , H04S3/00
摘要: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
公开(公告)号:US11430464B2
公开(公告)日:2022-08-30
申请号:US16962060
申请日:2018-12-03
IPC分类号: G10L21/038 , G10L21/0388 , G10L19/032
摘要: A decoding apparatus includes: a bandwidth extending part 25 obtaining a decoded extended frequency spectrum sequence by arranging samples based on K samples included in a frequency-domain sample sequence obtained by decoding, on a higher side than the frequency-domain sample sequence; and a fricative sound adjustment releasing part 23 obtaining, if inputted information indicating whether a hissing sound or not indicates being a hissing sound, what is obtained by exchanging all or a part of a low-side frequency sample sequence existing on a lower side than a predetermined frequency in the decoded extended frequency spectrum sequence for all or a part of a high-side frequency sample sequence existing on a higher side than the predetermined frequency in the decoded extended frequency spectrum sequence as an adjusted frequency spectrum sequence, the number of all or the part of the high-side frequency spectrum sequence being the same as the number of all or the part of the low-side frequency spectrum sequence.
-
-
-
-
-
-
-
-
-