-
11.
公开(公告)号:US20230260525A1
公开(公告)日:2023-08-17
申请号:US18138684
申请日:2023-04-24
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Shankar THAGADUR SHIVAPPA , S M Akramus SALEHIN , Shuhua ZHANG , Erik VISSER
IPC: G10L19/038 , H04R5/00 , G10L19/002
CPC classification number: G10L19/038 , H04R5/00 , G10L19/002 , H04S2420/11 , H04R2430/21 , G10L19/008
Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint. The one or more processors are also configured to apply an additional adaptive network.
-
公开(公告)号:US20210092543A1
公开(公告)日:2021-03-25
申请号:US16743275
申请日:2020-01-15
Applicant: QUALCOMM Incorporated
Inventor: S M Akramus SALEHIN , Shankar THAGADUR SHIVAPPA , Sanghyun CHI , Nils Gunther PETERS
Abstract: An apparatus includes one or more processors configured to receive orientation data and to select, based on the orientation data, a particular filter from among multiple filters. The one or more processors are configured to perform signal processing operations associated with three-dimensional (3D) sound data based on the particular filter.
-
公开(公告)号:US20250024219A1
公开(公告)日:2025-01-16
申请号:US18762208
申请日:2024-07-02
Applicant: QUALCOMM Incorporated
Inventor: Isaac Garcia MUNOZ , Shankar THAGADUR SHIVAPPA
IPC: H04S7/00
Abstract: A device includes a memory configured to store audio data associated with an immersive audio environment. The device also includes one or more processors configured to obtain a listener pose in the immersive audio environment associated with a first time and determine whether the listener pose is associated with a pre-rendered asset. The one or more processors are configured to obtain a rendered asset by selecting, based on the determination, between obtaining the pre-rendered asset and performing a rendering operation to generate the rendered asset. The one or more processors are also configured to generate an output audio signal based on the rendered asset.
-
公开(公告)号:US20220246133A1
公开(公告)日:2022-08-04
申请号:US17166250
申请日:2021-02-03
Applicant: QUALCOMM Incorporated
Inventor: Ferdinando OLIVIERI , Reid WESTBURG , Shankar THAGADUR SHIVAPPA
IPC: G10L13/08 , G10L13/027 , H04N7/15 , H04N7/14
Abstract: A device for communication includes one or more processors configured to receive, during an online meeting, a speech audio stream representing speech of a first user. The one or more processors are also configured to receive a text stream representing the speech of the first user. The one or more processors are further configured to selectively generate an output based on the text stream in response to an interruption in the speech audio stream.
-
公开(公告)号:US20220217425A1
公开(公告)日:2022-07-07
申请号:US17142022
申请日:2021-01-05
Applicant: QUALCOMM Incorporated
Inventor: Shankar THAGADUR SHIVAPPA , Reid WESTBURG , Ferdinando OLIVIERI
IPC: H04N21/233 , G06F3/16 , H04N21/218 , H04N21/2387 , H04N21/24
Abstract: A device includes one or more processors configured to, during a call, receive a sequence of audio frames from a first device. The one or more processors are configured to, in response to determining that no audio frame of the sequence has been received for a threshold duration since a last received audio frame of the sequence, initiate transmission of a frame loss indication to the first device. The one or more processors are also configured to, responsive to the frame loss indication, receive a set of audio frames of the sequence and an indication of a second playback speed from the first device. The one or more processors are configured to initiate playback, via a speaker, of the set of audio frames based on the second playback speed. The second playback speed is greater than a first playback speed of a first set of audio frames of the sequence.
-
公开(公告)号:US20220201395A1
公开(公告)日:2022-06-23
申请号:US17127421
申请日:2020-12-18
Applicant: QUALCOMM Incorporated
Inventor: S M Akramus SALEHIN , Lae-Hoon KIM , Vasudev NAYAK , Shankar THAGADUR SHIVAPPA , Isaac Garcia MUNOZ , Sanghyun CHI , Erik VISSER
Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.
-
-
-
-
-