INTERPOLATING AUDIO STREAMS
    161.
    发明申请

    公开(公告)号:US20200029164A1

    公开(公告)日:2020-01-23

    申请号:US16513436

    申请日:2019-07-16

    Abstract: In general, various aspects of the techniques are described for interpolating audio streams. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store the one or more audio streams. The processor may obtain one or more microphone locations, each of the one or more microphone locations identifying a location of a respective one or more microphones that captured each of the corresponding one or more audio streams. The processor may also obtain a listener location identifying a location of a listener, and perform interpolation, based on the one or more microphone locations and the listener location, with respect to the audio streams to obtain an interpolated audio stream. The processor may next obtain, based on the interpolated audio stream, one or more speaker feeds, and output the one or more speaker feeds.

    EMBEDDING ENHANCED AUDIO TRANSPORTS IN BACKWARD COMPATIBLE AUDIO BITSTREAMS

    公开(公告)号:US20200013414A1

    公开(公告)日:2020-01-09

    申请号:US16450698

    申请日:2019-06-24

    Abstract: In general, techniques are described by which to embed enhanced audio transports in backward compatible bitstreams. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the backward compatible bitstream, which conforms to a legacy transport format. The processor(s) may obtain, from the backward compatible bitstream, legacy audio data that conforms to a legacy audio format, and obtain, from the backward compatible bitstream, extended audio data that enhances the legacy audio data. The processor(s) may also obtain, based on the legacy audio data and the extended audio data, enhanced audio data that conforms to an enhanced audio format, and output the enhanced audio data to one or more speakers.

    DEMIXING DATA FOR BACKWARD COMPATIBLE RENDERING OF HIGHER ORDER AMBISONIC AUDIO

    公开(公告)号:US20190392846A1

    公开(公告)日:2019-12-26

    申请号:US16450625

    申请日:2019-06-24

    Abstract: In general, techniques are described by which to obtain demixing data for backward compatible rendering of higher order ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the higher order ambisonic (HOA) audio data. The processor may obtain, from a bitstream, legacy audio data that conforms to a legacy audio format, and obtain, from the bitstream, de-mixing data. The processor(s) may process, based on the de-mixing data, the legacy audio data to obtain the first portion of the HOA audio data. The processor(s) may next obtain, from the bitstream, a second portion of the HOA audio data. The processor(s) may render the first portion and the second portion to obtain a speaker feed. Further, the processor(s) may output the speaker feed to a speaker to reproduce a soundfield represented by the HOA audio data.

    Multimedia device for processing spatialized audio based on movement

    公开(公告)号:US10514887B2

    公开(公告)日:2019-12-24

    申请号:US16148837

    申请日:2018-10-01

    Abstract: In a particular aspect, a multimedia device is configured to compensate for navigational movement within a visual environment. The multimedia device includes a processor configured to generate a first version of a spatialized audio signal of a sound field in a first audio frame based on a first position within a visual environment. The processor is configured to generate a second version of the spatialized audio signal of the sound field in a second audio frame based on compensating for the speed of movement of the multimedia device that indicates the first position within the visual environment changed to a second position within the visual environment, being different than the speed of movement in changing from a first location of the sound field to a second location of the sound field.

    Rendering for computer-mediated reality systems

    公开(公告)号:US10469968B2

    公开(公告)日:2019-11-05

    申请号:US15782252

    申请日:2017-10-12

    Abstract: In general, techniques are described for adapting higher order ambisonic audio data to include three degrees of freedom plus effects. An example device configured to perform the techniques includes a memory, and a processor coupled to the memory. The memory may be configured to store higher order ambisonic audio data representative of a soundfield. The processor may be configured to obtain a translational distance representative of a translational head movement of a user interfacing with the device. The processor may further be configured to adapt, based on the translational distance, higher order ambisonic audio data to provide three degrees of freedom plus effects that adapt the soundfield to account for the translational head movement, and generate speaker feeds based on the adapted higher order ambient audio data.

    SPATIAL RELATION CODING OF HIGHER ORDER AMBISONIC COEFFICIENTS

    公开(公告)号:US20190110148A1

    公开(公告)日:2019-04-11

    申请号:US16152153

    申请日:2018-10-04

    Abstract: In general, techniques are described by which to perform spatial relation coding of higher order ambisonic coefficients using expanded parameters. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store at least a portion of a bitstream, the bitstream including a first indication representative of an HOA coefficient associated with the spherical basis function having an order of zero, and a second indication representative of one or more parameters. The processor may be configured to perform parameter expansion with respect to the one or more parameters to obtain one or more expanded parameters, and synthesize, based on the one or more expanded parameters and the HOA coefficient associated with the spherical basis function having the order of zero, one or more HOA coefficients associated with one or more spherical basis functions having an order greater than zero.

    SPATIAL RELATION CODING USING VIRTUAL HIGHER ORDER AMBISONIC COEFFICIENTS

    公开(公告)号:US20190110147A1

    公开(公告)日:2019-04-11

    申请号:US16152130

    申请日:2018-10-04

    Abstract: In general, techniques are described by which to perform spatial relation coding using virtual higher order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store audio data, the audio data representative of zero-ordered higher order ambisonic (HOA) coefficient, and one or more greater-than-zero-ordered HOA coefficients. The processor may be configured to obtain, based on the one or more greater-than-zero-ordered HOA coefficients, a virtual zero-ordered HOA coefficient. The processor may also be configured to obtain, based on the virtual HOA coefficient, one or more parameters from which to synthesize the one or more greater-than-zero-ordered HOA coefficients. The processor may further be configured to generate a bitstream that includes a first indication representative of the zero-ordered HOA coefficients, and a second indication representative of the one or more parameters.

    Signaling audio rendering information in a bitstream

    公开(公告)号:US10178489B2

    公开(公告)日:2019-01-08

    申请号:US14174769

    申请日:2014-02-06

    Abstract: In general, techniques are described for specifying audio rendering information in a bitstream. A device configured to generate the bitstream may perform various aspects of the techniques. The bitstream generation device may comprise one or more processors configured to specify audio rendering information that includes a signal value identifying an audio renderer used when generating the multi-channel audio content. A device configured to render multi-channel audio content from a bitstream may also perform various aspects of the techniques. The rendering device may comprise one or more processors configured to determine audio rendering information that includes a signal value identifying an audio renderer used when generating the multi-channel audio content, and render a plurality of speaker feeds based on the audio rendering information.

Patent Agency Ranking