-
公开(公告)号:US20230386485A1
公开(公告)日:2023-11-30
申请号:US18448020
申请日:2023-08-10
发明人: Sascha DISCH , Martin DIETZ , Markus MULTRUS , Guillaume FUCHS , Emmanuel RAVELLI , Matthias NEUSINGER , Markus SCHNELL , Benjamin SCHUBERT , Bernhard GRILL
IPC分类号: G10L19/02 , G10L19/18 , G10L19/24 , G10L19/022
CPC分类号: G10L19/0208 , G10L19/18 , G10L19/028 , G10L19/022 , G10L19/24
摘要: An audio encoder for encoding an audio signal includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal including a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.
-
公开(公告)号:US11830512B2
公开(公告)日:2023-11-28
申请号:US17816447
申请日:2022-08-01
申请人: BlackBerry Limited
发明人: Joe Mammone , Michael Mead Truman
IPC分类号: G10L19/012 , G10L19/26 , H04H60/11 , G10L19/008 , G10L19/02 , G10L19/005
CPC分类号: G10L19/26 , H04H60/11 , G10L19/005 , G10L19/008 , G10L19/012 , G10L19/0208
摘要: In some examples, an audio sending device receives a stream of application audio data, encodes the stream of application audio data, and in response to detecting an end of the stream of application audio data, provides pre-encoded filler audio data from a buffer in the audio sending device as an encoded stream of filler audio data. The audio sending device transmits the encoded stream of application audio data and the encoded stream of filler audio data in an encoded output data stream over a transport to an audio receiving device.
-
公开(公告)号:US11830506B2
公开(公告)日:2023-11-28
申请号:US16386863
申请日:2019-04-17
发明人: Anisse Taleb , Gustaf Ullberg
IPC分类号: G10L19/025 , G10L19/02 , G10L25/21
CPC分类号: G10L19/025 , G10L19/0212
摘要: A transient detector analyzes a given frame n of the input audio signal to determine, based on audio signal characteristics of the given frame n, a transient hangover indicator for a following frame n+1, and signals the determined transient hangover indicator to an associated audio encoder to enable proper encoding of the following frame n+1.
-
公开(公告)号:US11825020B2
公开(公告)日:2023-11-21
申请号:US17432056
申请日:2020-12-22
CPC分类号: H04M3/5116 , H04M3/42221 , H04M2203/553
摘要: Systems and methods for processing emergency communications are provided. A system may receive an emergency communication initiated by an emergency communicator. The system may detect a data field action in response to an emergency receiver entering a data input based on the emergency communication. The system may capture a timestamp of when the data field action occurred. The system may generate a communication snippet based on the action timestamp and a snippet length. The communication snippet may be configured to provide context from the emergency communication to the data input. The system may transmit the communication snippet and the data input to an emergency responder.
-
公开(公告)号:US20230368804A1
公开(公告)日:2023-11-16
申请号:US18144413
申请日:2023-05-08
申请人: Google LLC
CPC分类号: G10L19/0204 , G10L25/30
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.
-
公开(公告)号:US20230360655A1
公开(公告)日:2023-11-09
申请号:US18246030
申请日:2021-08-13
申请人: Apple Inc.
发明人: Moo Young KIM , Sina ZAMANI , Dipanjan SEN
IPC分类号: G10L19/02 , G10L19/008 , G10L19/24
CPC分类号: G10L19/0204 , G10L19/008 , G10L19/24
摘要: Encoding and decoding of higher order ambisonics, HOA, data for purposes of bitrate reduction. One aspect uses principal components analysis to produce spatial descriptors. Other aspects include various spatial descriptor quantization techniques.
-
公开(公告)号:US11810582B2
公开(公告)日:2023-11-07
申请号:US17560295
申请日:2021-12-23
发明人: Heiko Purnhagen , Pontus Carlsson , Lars Villemoes
CPC分类号: G10L19/008 , G10L19/0212 , G10L19/06 , G10L19/167 , G10L19/18 , H04S3/008 , G10L25/12 , H04S2400/01
摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of:
(i) computing a second frequency-domain representation of a first input channel; and
(ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.-
公开(公告)号:US11790927B2
公开(公告)日:2023-10-17
申请号:US17571237
申请日:2022-01-07
发明人: Florin Ghido , Andreas Niedermeier
IPC分类号: G10L19/06 , G10L19/032 , G10L19/02 , G10L21/038 , G10L19/038 , G10L19/00 , G10L19/028
CPC分类号: G10L19/06 , G10L19/00 , G10L19/02 , G10L19/0204 , G10L19/028 , G10L19/032 , G10L19/038 , G10L21/038
摘要: An improved concept for coding sample values of a spectral envelope is obtained by combining spectrotemporal prediction on the one hand and context-based entropy coding the residuals, on the other hand, while particularly determining the context for a current sample value dependent on a measure of a deviation between a pair of already coded/decoded sample values of the spectral envelope in a spectrotemporal neighborhood of the current sample value. The combination of the spectrotemporal prediction on the one hand and the context-based entropy coding of the prediction residuals with selecting the context depending on the deviation measure on the other hand harmonizes with the nature of spectral envelopes.
-
公开(公告)号:US20230326471A1
公开(公告)日:2023-10-12
申请号:US17658094
申请日:2022-04-06
IPC分类号: G10L19/02 , G10L19/08 , G10L19/005
CPC分类号: G10L19/0212 , G10L19/08 , G10L19/005
摘要: A computer-implemented method, a computer program product, and a computer system for system monitoring. A computer system convert a time domain representation of first sound into a frequency domain representation, maps monitoring datasets of a monitored system to frequencies in the frequency domain representation, modifies the amplitudes of the respective frequencies according mapping rules defined by a user, convert the frequency domain representation into the time domain representation and generate a sound wave in a digital format which has modified amplitudes, converts the sound wave in the digital format to a sound wave in an analog format, and feeds the sound wave in the analog format to a sound system to play second sound. Performance of the monitored system is monitored by the user listening to the second sound and comparing the first and second sound.
-
公开(公告)号:US20230326470A1
公开(公告)日:2023-10-12
申请号:US18301194
申请日:2023-04-14
发明人: Markus Multrus , Bemhard Grill , Guillaume Fuchs , Stefan Geyersberger , Nikolaus Rettelbach , Virgilio Bacigalupo
IPC分类号: G10L19/02 , G10L19/022 , H03M7/30
CPC分类号: G10L19/02 , G10L19/022 , H03M7/30
摘要: An audio encoder for encoding segments of coefficients, the segments of coefficients representing different time or frequency resolutions of a sampled audio signal, the audio encoder including a processor for deriving a coding context for a currently encoded coefficient of a current segment based on a previously encoded coefficient of a previous segment, the previously encoded coefficient representing a different time or frequency resolution than the currently encoded coefficient. The audio encoder further includes an entropy encoder for entropy encoding the current coefficient based on the coding context to obtain an encoded audio stream.
-
-
-
-
-
-
-
-
-