Patent search ap:("SAMSUNG ELECTRONICS CO. Page LTD.") AND inv:"Sunil Bharitkar"

1.

发明公开
SIGNAL NORMALIZATION USING LOUDNESS METADATA FOR AUDIO PROCESSING 审中-公开

公开(公告)号：US20240276143A1

公开(公告)日：2024-08-15

申请号：US18398821

申请日：2023-12-28

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sunil Bharitkar

IPC: H04R3/00 , G06N3/0464

CPC classification number: H04R3/00 , G06N3/0464 , H04R2430/01

Abstract: One embodiment provides a method of signal normalization. The method comprises receiving an input content with a corresponding audio signal, and extracting loudness metadata from an audio signal corresponding to the input content. The method further comprises estimating, using a machine learning model, a peak-level amplitude based on the loudness metadata. The peak-level amplitude represents a maximum linear amplitude of the audio signal over an entire duration of the input content. The method further comprises determining a gain based at least on the peak-level amplitude, and applying the gain to the audio signal. The resulting gain-scaled audio signal is provided to one or more speakers coupled to or integrated in an electronic device for audio playback.

2.

发明公开
VIDEO-DERIVED AUDIO PROCESSING 审中-公开

公开(公告)号：US20240244386A1

公开(公告)日：2024-07-18

申请号：US18154678

申请日：2023-01-13

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Sunil Bharitkar , SeongNam Oh , Carlos Tejeda Ocampo

IPC: H04S7/00 , G06V20/40 , G11B27/036

CPC classification number: H04S7/30 , G06V20/46 , G11B27/036 , G06V2201/10 , H04S2400/11

Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.

3.

发明申请
SPECTROGRAM BASED TIME ALIGNMENT FOR INDEPENDENT RECORDING AND PLAYBACK SYSTEMS 有权

公开(公告)号：US20250048050A1

公开(公告)日：2025-02-06

申请号：US18790528

申请日：2024-07-31

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sunil Bharitkar , Allan Devantier , Ashish Y. Rawat

IPC: H04S7/00 , H04R29/00

Abstract: One embodiment provides a computer-implemented method that includes sending a stimulus signal to a loudspeaker. A measurement signal is received via a microphone. The stimulus signal is transformed into a stimulus time-frequency representation. The measured signal is transformed into a measured time-frequency representation. At least one frequency value is selected between the stimulus time-frequency representation and the measured time-frequency representation. Correlation analysis is performed using the selected at least one frequency value. Based on the correlation analysis, a statistical mode is determined to produce a start-time of the stimulus signal.

4.

发明公开
DEEP LEARNING BASED VOICE EXTRACTION AND PRIMARY-AMBIENCE DECOMPOSITION FOR STEREO TO SURROUND UPMIXING WITH DIALOG-ENHANCED CENTER CHANNEL 审中-公开

公开(公告)号：US20240267701A1

公开(公告)日：2024-08-08

申请号：US18399148

申请日：2023-12-28

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sunil Bharitkar , Ricardo Thaddeus Páez Amaro , Carlos Tejeda Ocampo , Luis Madrid Herrera

IPC: H04S7/00 , H04S3/00

CPC classification number: H04S7/307 , H04S3/008 , H04S7/302 , H04S2400/01 , H04S2400/03 , H04S2400/05 , H04S2400/11 , H04S2400/13

Abstract: One embodiment provides a computer-implemented method that includes determining directional sounds from a content mix using a machine learning unmixing model. The directional sounds are panned in an upmixed signal. Signal-dependent upmixing gains for specific frequency bins are computed on a frame-basis using a machine learning model for the upmixed signal. Dedicated voice clarity gains are computed using a hearing impairment model for multiple hearing-impaired profiles for achieving dialog enhancement. The signal dependent upmixing gains and voice clarity gains are transmitted as metadata with a downmixed signal representing the content mix.

5.

发明授权
Perceptual bass extension with loudness management and artificial intelligence (AI) 有权

公开(公告)号：US11950089B2

公开(公告)日：2024-04-02

申请号：US17689744

申请日：2022-03-08

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sunil Bharitkar , William I. Saba

IPC: H04S7/00 , G06N20/00 , H04R5/02 , H04R5/04 , H04S3/00

CPC classification number: H04S7/307 , G06N20/00 , H04R5/02 , H04R5/04 , H04S3/008 , H04S2400/01 , H04S2400/07 , H04S2400/13

Abstract: One embodiment provides a computer-implemented method that includes implementing a customizable compressor for at least one sidechain processing associated with a loudspeaker. Machine learning is applied to automatically tune one or more parameters of the at least one sidechain processing. One or more channels are extracted, including a low-frequency effects (LFE) channel, for nonlinear signal synthesis. A proportional power-sum-based mix-in of an LFE sidechain channel is applied into a non-LFE sidechain. The LFE sidechain channel is maintained within a specified threshold of being level, before and after nonlinear signal synthesis.

6.

发明授权
Video-derived audio processing 有权

公开(公告)号：US12231865B2

公开(公告)日：2025-02-18

申请号：US18154678

申请日：2023-01-13

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo

IPC: G11B27/036 , G06V20/40 , H04S7/00

Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.

7.

发明公开
DEEP LEARNING FOR MULTIMEDIA CLASSIFICATION 审中-公开

公开(公告)号：US20240126990A1

公开(公告)日：2024-04-18

申请号：US18480166

申请日：2023-10-03

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sunil Bharitkar

IPC: G06F40/284 , G06N3/0442

CPC classification number: G06F40/284 , G06N3/0442

Abstract: One embodiment provides a computer-implemented method that includes utilizing text information obtained from a title of a media content item and a trainable model for improving accuracy for classification of the media content item. The trainable model is utilized using a sequence of text to numeric-vector embeddings for classification of the media content item. At least one of a word embedding model parameter or a latent semantic analysis dimension is jointly optimized using the text information, and a classifier model for maximizing accuracy of the classification of the media content item.

8.

发明公开
BAYESIAN OPTIMIZATION FOR SIMULTANEOUS DECONVOLUTION OF ROOM IMPULSE RESPONSES 审中-公开

公开(公告)号：US20230353938A1

公开(公告)日：2023-11-02

申请号：US18054059

申请日：2022-11-09

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Sunil Bharitkar

IPC: H04R3/04 , H04R29/00 , H04S7/00

CPC classification number: H04R3/04 , H04R29/002 , H04S7/301 , H04S7/305

Abstract: One embodiment provides a method comprising optimizing one or more stimuli parameters by applying machine learning to training data. The method further comprises determining, based on the one or more optimized stimuli parameters, stimuli for simultaneously exciting a plurality of speakers within a spatial area. The stimuli has a shortest possible duration that is accurate for simultaneous deconvolution of a plurality of impulse responses of the plurality of speakers. The method further comprises simultaneously exciting the plurality of speakers by providing the stimuli to the plurality of speakers at the same time for reproduction. The method further comprises simultaneously deconvolving the plurality of impulse responses based on the stimuli and one or more measurements of sound recorded during the reproduction and arriving at one or more microphones within the spatial area.

9.

发明授权
Simultaneous deconvolution of loudspeaker-room impulse responses with linearly-optimal techniques 有权

公开(公告)号：US11792594B2

公开(公告)日：2023-10-17

申请号：US17584181

申请日：2022-01-25

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sunil Bharitkar

IPC: H04S7/00 , H04R5/02 , H04R5/04 , H04S3/00 , H04R5/027

CPC classification number: H04S7/301 , H04R5/02 , H04R5/027 , H04R5/04 , H04S3/008 , H04S7/305 , H04S2400/01 , H04S2400/15

Abstract: One embodiment provides a method comprising determining stimuli for simultaneously exciting a plurality of speakers within a spatial area. The method further comprises simultaneously exciting the plurality of speakers by providing the stimuli to the plurality of speakers at the same time for reproduction. The method further comprises recording, during the reproduction, one or more measurements of sound arriving at one or more microphones within the spatial area. The method further comprises simultaneously deconvolving a plurality of impulse responses of the plurality of speakers based on the stimuli and the one or more measurements.

10.

发明公开
SURROUND SOUND TO IMMERSIVE AUDIO UPMIXING BASED ON VIDEO SCENE ANALYSIS 审中-公开

公开(公告)号：US20240196158A1

公开(公告)日：2024-06-13

申请号：US18476172

申请日：2023-09-27

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Allan Devantier , Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo

IPC: H04S7/00 , G06V20/40

CPC classification number: H04S7/305 , G06V20/49

Abstract: One embodiment provides a method of audio upmixing comprising performing video scene analysis by segmenting visual objects from video frames of a video, and performing audio analysis by extracting audio signals from an audio corresponding to the video. The method further comprises determining whether any of the audio signals correspond to any of the visual objects, and estimating a video-based trajectory of a visual object if the visual object is in motion and transitions from on-screen to off-screen, or vice versa, during the video. The method further comprises positioning an audio trajectory of an audio signal from at least one speaker associated with the display to at least one other speaker associated with providing surround sound. The audio trajectory is automatically matched with the video. The audio signal is delivered to the at least one speaker and the at least one other speaker for audio reproduction during the presentation.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification