-
1.
公开(公告)号:US12106765B2
公开(公告)日:2024-10-01
申请号:US17757968
申请日:2020-11-09
发明人: Xianchun Zhang , Jinyun Zhong
IPC分类号: G10L21/0208 , G10L19/02 , G10L21/0324 , G10L21/038 , H04R1/10 , H04R3/00
CPC分类号: G10L21/0208 , G10L19/02 , G10L21/0324 , G10L21/038 , H04R1/10 , H04R3/005 , G10L2021/02082
摘要: A speech signal processing method and apparatus. The method includes preprocessing a speech signal that is in a first frequency band and that is collected by an ear canal speech collector, to obtain a first speech signal; preprocessing a speech signal that is in a second frequency band and that is collected by at least one external speech collector, to obtain an external speech signal, where frequency ranges of the first frequency band and the second frequency band are different; performing correlation processing on the first speech signal and the external speech signal to obtain a second speech signal; and outputting a target speech signal, where the target speech signal includes the first speech signal and the second speech signal.
-
公开(公告)号:US20240313729A1
公开(公告)日:2024-09-19
申请号:US18672224
申请日:2024-05-23
IPC分类号: H03G3/30 , G10L21/038 , G10L25/21 , G10L25/51 , H03G1/00 , H03G7/00 , H03G9/00 , H04R3/00 , H04R3/04
CPC分类号: H03G3/3005 , G10L21/038 , G10L25/21 , G10L25/51 , H03G1/00 , H03G3/3089 , H03G7/007 , H03G9/005 , H04R3/00 , H04R3/04 , H04R2430/03
摘要: In some embodiments, a method for processing an audio signal in an audio processing apparatus is disclosed. The method includes receiving an audio signal and a parameter, the parameter indicating a location of an auditory event boundary. An audio portion between consecutive auditory event boundaries constitutes an auditory event. The method further includes applying a modification to the audio signal based in part on an occurrence of the auditory event. The parameter may be generated by monitoring a characteristic of the audio signal and identifying a change in the characteristic.
-
公开(公告)号:US12094477B2
公开(公告)日:2024-09-17
申请号:US18318443
申请日:2023-05-16
发明人: Lars Villemoes , Heiko Purnhagen , Per Ekstrand
IPC分类号: G10L19/16 , G10L19/035 , G10L19/24 , G10L21/038
CPC分类号: G10L19/167 , G10L19/035 , G10L19/24 , G10L21/038
摘要: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.
-
公开(公告)号:US20240282315A1
公开(公告)日:2024-08-22
申请号:US18653833
申请日:2024-05-02
发明人: Kristofer Kjoerling
IPC分类号: G10L19/00 , G10L19/02 , G10L21/038
CPC分类号: G10L19/0017 , G10L19/0204 , G10L21/038
摘要: The application relates to HFR (High Frequency Reconstruction/Regeneration) of audio signals. In particular, the application relates to a method and system for performing HFR of audio signals having large variations in energy level across the low frequency range which is used to reconstruct the high frequencies of the audio signal. A system configured to generate a plurality of high frequency subband signals covering a high frequency interval from a plurality of low frequency subband signals is described. The system comprises means for receiving the plurality of low frequency subband signals; means for receiving a set of target energies, each target energy covering a different target interval within the high frequency interval and being indicative of the desired energy of one or more high frequency subband signals lying within the target interval; means for generating the plurality of high frequency subband signals from the plurality of low frequency subband signals and from a plurality of spectral gain coefficients associated with the plurality of low frequency subband signals, respectively; and means for adjusting the energy of the plurality of high frequency subband signals using the set of target energies.
-
公开(公告)号:US12067996B2
公开(公告)日:2024-08-20
申请号:US18179139
申请日:2023-03-06
IPC分类号: G10L19/04 , G10L19/18 , G10L21/038
CPC分类号: G10L19/04 , G10L19/18 , G10L21/038
摘要: A codec allowing for switching between different coding modes is improved by, responsive to a switching instance, performing temporal smoothing and/or blending at a respective transition.
-
公开(公告)号:US20240271217A1
公开(公告)日:2024-08-15
申请号:US18637814
申请日:2024-04-17
发明人: Lars VILLEMOES , Per EKSTRAND
IPC分类号: C12Q1/6883 , G10L19/02 , G10L19/022 , G10L19/26 , G10L21/038
CPC分类号: C12Q1/6883 , G10L19/0204 , G10L19/022 , G10L19/265 , G10L21/038 , C12Q2600/118 , C12Q2600/156
摘要: The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank (501) comprising an analysis transformation unit (601) having a frequency resolution of Δf; and an analysis window (611) having a duration of DA; the analysis filter bank (501) being configured to provide a set of analysis subband signals from the low frequency component of the signal; a nonlinear processing unit (502, 650) configured to determine a set of synthesis subband signals based on a portion of the set of analysis subband signals, wherein the portion of the set of analysis subband signals is phase shifted by a transposition order T; and a synthesis filter bank (504) comprising a synthesis transformation unit (602) having a frequency resolution of QΔf; and a synthesis window (612) having a duration of DS; the synthesis filter bank (504) being configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein Q is a frequency resolution factor with Q≥1 and smaller than the transposition order T; and wherein the value of the product of the frequency resolution Δf and the duration DA of the analysis filter bank is selected based on the frequency resolution factor Q.
-
公开(公告)号:US20240205582A1
公开(公告)日:2024-06-20
申请号:US18594004
申请日:2024-03-04
发明人: Lei ZHANG , Junjiang FU , Bingyan YAN , Fengyun LIAO , Xin QI
IPC分类号: H04R1/10 , G02C11/00 , G02C11/06 , G06F3/16 , G10L21/0208 , G10L21/0216 , G10L21/038 , H04M1/03 , H04M1/78 , H04R1/02 , H04R1/22 , H04R1/24 , H04R1/26 , H04R1/28 , H04R1/34 , H04R1/38 , H04R1/44 , H04R3/00 , H04R3/02 , H04R5/02 , H04R5/033 , H04R9/06 , H04S7/00 , H04W4/80
CPC分类号: H04R1/1016 , G02C11/00 , G02C11/10 , G06F3/16 , G06F3/162 , G06F3/165 , G10L21/0208 , G10L21/038 , H04M1/03 , H04M1/035 , H04M1/78 , H04R1/02 , H04R1/026 , H04R1/028 , H04R1/10 , H04R1/1008 , H04R1/1025 , H04R1/1041 , H04R1/105 , H04R1/1075 , H04R1/1083 , H04R1/22 , H04R1/24 , H04R1/245 , H04R1/26 , H04R1/28 , H04R1/2803 , H04R1/2807 , H04R1/2811 , H04R1/2896 , H04R1/34 , H04R1/342 , H04R1/345 , H04R1/347 , H04R1/38 , H04R1/44 , H04R3/00 , H04R3/005 , H04R3/02 , H04R5/02 , H04R5/033 , H04R5/0335 , H04R9/06 , H04S7/304 , H04W4/80 , G02C11/06 , G10L2021/02166 , H04R2201/103 , H04R2410/05 , H04R2420/07 , H04S2400/11
摘要: The present disclosure may provide an acoustic device. The acoustic device may include a housing, at least one low-frequency acoustic driver, at least one high-frequency acoustic driver, and a noise reduction assembly. The housing may be configured to be rested on a shoulder of a user. The at least one low-frequency acoustic driver may be carried by the housing and configured to output first sound from at least two first sound guiding holes. The at least one high-frequency acoustic driver may be carried by the housing and configured to output second sound from at least two second sound guiding holes. The noise reduction assembly may be configured to receive third sound and reduce noise of the third sound.
-
公开(公告)号:USRE49999E1
公开(公告)日:2024-06-04
申请号:US17845607
申请日:2022-06-21
发明人: Markus Schnell , Manfred Lutzky , Markus Lohwasser , Markus Schmidt , Marc Gayer , Michael Mellar , Bernd Edler , Markus Multrus , Gerald Schuller , Ralf Geiger , Bernhard Grill
IPC分类号: G10L19/00 , G10L19/02 , G10L19/022 , G10L25/45 , H03H17/02 , G10L21/038
CPC分类号: G10L19/0204 , G10L19/022 , G10L25/45 , H03H17/0266 , G10L21/038
摘要: An embodiment of an apparatus for generating audio subband values in audio subband channels includes an analysis windower for windowing a frame of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function including a sequence of window coefficients to obtain windowed samples. The analysis window function includes a first number of window coefficients derived from a larger window function including a sequence of a larger second number of window coefficients, wherein the window coefficients of the window function are derived by an interpolation of window coefficients of the larger window function. The apparatus further includes a calculator for calculating the audio subband values using the windowed samples.
-
公开(公告)号:US12002476B2
公开(公告)日:2024-06-04
申请号:US18145797
申请日:2022-12-22
发明人: Kristofer Kjoerling
IPC分类号: G10L21/038 , G10L19/00 , G10L19/02
CPC分类号: G10L19/0017 , G10L19/0204 , G10L21/038
摘要: The application relates to HFR (High Frequency Reconstruction/Regeneration) of audio signals. In particular, the application relates to a method and system for performing HFR of audio signals having large variations in energy level across the low frequency range which is used to reconstruct the high frequencies of the audio signal. A system configured to generate a plurality of high frequency subband signals covering a high frequency interval from a plurality of low frequency subband signals is described. The system comprises means for receiving the plurality of low frequency subband signals; means for receiving a set of target energies, each target energy covering a different target interval within the high frequency interval and being indicative of the desired energy of one or more high frequency subband signals lying within the target interval; means for generating the plurality of high frequency subband signals from the plurality of low frequency subband signals and from a plurality of spectral gain coefficients associated with the plurality of low frequency subband signals, respectively; and means for adjusting the energy of the plurality of high frequency subband signals using the set of target energies.
-
公开(公告)号:US11935508B2
公开(公告)日:2024-03-19
申请号:US18194414
申请日:2023-03-31
发明人: Per Ekstrand , Lars Villemoes , Per Hedelin
IPC分类号: G10L21/038 , G10H1/00 , G10H1/12 , G10L19/26 , G10L21/0388
CPC分类号: G10H1/0091 , G10H1/125 , G10L19/265 , G10L21/038 , G10H2210/311 , G10L21/0388
摘要: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of Δf. The system further comprises a nonlinear processing unit (502) configured to determine a set of synthesis subband signals from the set of analysis subband signals using a transposition order P; wherein the set of synthesis subband signals comprises a portion of the set of analysis subband signals phase shifted by an amount derived from the transposition order P; and a synthesis filter bank (504) configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein the synthesis filter bank (504) has a frequency resolution of FΔf; with F being a resolution factor, with F≥1; wherein the transposition order P is different from the resolution factor F.
-
-
-
-
-
-
-
-
-