Selective adjustment of sound playback

    公开(公告)号:US12141498B2

    公开(公告)日:2024-11-12

    申请号:US17755912

    申请日:2020-11-17

    Abstract: A device for managing sound playback includes one or more processors configured to receive an indication of a user-device interaction between a user and an audio interface device during a sound playback operation of a multi-speaker audio playback system. The one or more processors are also configured to, based on receiving the indication of the user-device interaction, initiate a selective adjustment of the sound playback operation to reduce a playback sound of the multi-speaker audio playback system based on a position of the user.

    Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding

    公开(公告)号:US12142285B2

    公开(公告)日:2024-11-12

    申请号:US16907934

    申请日:2020-06-22

    Abstract: In general, techniques are described for quantizing spatial components based on bit allocations determined for psychoacoustic audio coding. A device comprising a memory and one or more processors may perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing the psychoacoustic audio decoding, a first bit allocation for the encoded foreground audio signal. The one or more processors may also determine, based on the first bit allocation, a second bit allocation, and dequantize, based on the second bit allocation, the quantized spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.

    XR RENDERING FOR 3D AUDIO CONTENT AND AUDIO CODEC

    公开(公告)号:US20230051841A1

    公开(公告)日:2023-02-16

    申请号:US17444138

    申请日:2021-07-30

    Abstract: A device includes a memory configured to store instructions and also includes one or more processors configured to execute the instructions to obtain audio data corresponding to a sound source and metadata indicative of a direction of the sound source. The one or more processors are configured to execute the instructions to obtain direction data indicating a viewing direction associated with a user of a playback device. The one or more processors are configured to execute the instructions to determine a resolution setting based on a similarity between the viewing direction and the direction of the sound source. The one or more processors are also configured to execute the instructions to process the audio data based on the resolution setting to generate processed audio data.

    Correlating scene-based audio data for psychoacoustic audio coding

    公开(公告)号:US11538489B2

    公开(公告)日:2022-12-27

    申请号:US16908032

    申请日:2020-06-22

    Abstract: In general, techniques are described by which to correlate scene-based audio data for psychoacoustic audio coding. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including a plurality of encoded correlated components of a soundfield represented by scene-based audio data. The one or more processors may perform psychoacoustic audio decoding with respect to one or more of the plurality of encoded correlated components to obtain a plurality of correlated components, and obtain, from the bitstream, an indication representative of how the one or more of the plurality of correlated components were reordered in the bitstream. The one or more processors may reorder, based on the indication, the plurality of correlated components to obtain a plurality of reordered components, and reconstruct, based on the plurality of reordered components, the scene-based audio data.

    Soundfield adaptation for virtual reality audio

    公开(公告)号:US11317236B2

    公开(公告)日:2022-04-26

    申请号:US16951662

    申请日:2020-11-18

    Abstract: An example device includes a memory configured to store at least one spatial component and at least one audio source within a plurality of audio streams. The device also includes one or more processors coupled to the memory. The one or more processors are configured to receive, from motion sensors, rotation information. The one or more processors are configured to rotate the at least one spatial component based on the rotation information to form at least one rotated spatial component. The one or more processors are also configured to reconstruct ambisonic signals from the at least one rotated spatial component and the at least one audio source, wherein the at least one spatial component describes spatial characteristics associated with the at least one audio source in a spherical harmonic domain representation.

    SPATIALLY FORMATTED ENHANCED AUDIO DATA FOR BACKWARD COMPATIBLE AUDIO BITSTREAMS

    公开(公告)号:US20190392845A1

    公开(公告)日:2019-12-26

    申请号:US16450514

    申请日:2019-06-24

    Abstract: In general, techniques are described by which to specify spatially formatted enhanced audio data for backward compatible audio bitstreams. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the backward compatible bitstream that conforms to a legacy transport format. The processor(s) may obtain, from the backward compatible bitstream, legacy audio data that conforms to a legacy audio format and a spatially formatted extended audio stream. The processor(s) may process the spatially formatted extended audio stream to obtain extended audio data that enhances the legacy audio data. The processor(s) may next obtain, based on the legacy audio data and the extended audio data, enhanced audio data that conforms to an enhanced audio format. The processor(s) may output the enhanced audio data to one or more speakers.

    Coding scaled spatial components
    9.
    发明授权

    公开(公告)号:US11361776B2

    公开(公告)日:2022-06-14

    申请号:US16907969

    申请日:2020-06-22

    Abstract: In general, techniques are described by which to code scaled spatial components. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing psychoacoustic audio decoding, a bit allocation for the encoded foreground audio signal. The one or more processors may dequantize the quantized spatial component to obtain a scaled spatial component, and descale, based on the bit allocation, the scaled spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.

    Spatially formatted enhanced audio data for backward compatible audio bitstreams

    公开(公告)号:US11062713B2

    公开(公告)日:2021-07-13

    申请号:US16450514

    申请日:2019-06-24

    Abstract: In general, techniques are described by which to specify spatially formatted enhanced audio data for backward compatible audio bitstreams. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the backward compatible bitstream that conforms to a legacy transport format. The processor(s) may obtain, from the backward compatible bitstream, legacy audio data that conforms to a legacy audio format and a spatially formatted extended audio stream. The processor(s) may process the spatially formatted extended audio stream to obtain extended audio data that enhances the legacy audio data. The processor(s) may next obtain, based on the legacy audio data and the extended audio data, enhanced audio data that conforms to an enhanced audio format. The processor(s) may output the enhanced audio data to one or more speakers.

Patent Agency Ranking