-
公开(公告)号:US20240276143A1
公开(公告)日:2024-08-15
申请号:US18398821
申请日:2023-12-28
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sunil Bharitkar
IPC: H04R3/00 , G06N3/0464
CPC classification number: H04R3/00 , G06N3/0464 , H04R2430/01
Abstract: One embodiment provides a method of signal normalization. The method comprises receiving an input content with a corresponding audio signal, and extracting loudness metadata from an audio signal corresponding to the input content. The method further comprises estimating, using a machine learning model, a peak-level amplitude based on the loudness metadata. The peak-level amplitude represents a maximum linear amplitude of the audio signal over an entire duration of the input content. The method further comprises determining a gain based at least on the peak-level amplitude, and applying the gain to the audio signal. The resulting gain-scaled audio signal is provided to one or more speakers coupled to or integrated in an electronic device for audio playback.
-
公开(公告)号:US20240244386A1
公开(公告)日:2024-07-18
申请号:US18154678
申请日:2023-01-13
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sunil Bharitkar , SeongNam Oh , Carlos Tejeda Ocampo
IPC: H04S7/00 , G06V20/40 , G11B27/036
CPC classification number: H04S7/30 , G06V20/46 , G11B27/036 , G06V2201/10 , H04S2400/11
Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.
-
公开(公告)号:US20250048050A1
公开(公告)日:2025-02-06
申请号:US18790528
申请日:2024-07-31
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sunil Bharitkar , Allan Devantier , Ashish Y. Rawat
Abstract: One embodiment provides a computer-implemented method that includes sending a stimulus signal to a loudspeaker. A measurement signal is received via a microphone. The stimulus signal is transformed into a stimulus time-frequency representation. The measured signal is transformed into a measured time-frequency representation. At least one frequency value is selected between the stimulus time-frequency representation and the measured time-frequency representation. Correlation analysis is performed using the selected at least one frequency value. Based on the correlation analysis, a statistical mode is determined to produce a start-time of the stimulus signal.
-
公开(公告)号:US20240267701A1
公开(公告)日:2024-08-08
申请号:US18399148
申请日:2023-12-28
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sunil Bharitkar , Ricardo Thaddeus Páez Amaro , Carlos Tejeda Ocampo , Luis Madrid Herrera
CPC classification number: H04S7/307 , H04S3/008 , H04S7/302 , H04S2400/01 , H04S2400/03 , H04S2400/05 , H04S2400/11 , H04S2400/13
Abstract: One embodiment provides a computer-implemented method that includes determining directional sounds from a content mix using a machine learning unmixing model. The directional sounds are panned in an upmixed signal. Signal-dependent upmixing gains for specific frequency bins are computed on a frame-basis using a machine learning model for the upmixed signal. Dedicated voice clarity gains are computed using a hearing impairment model for multiple hearing-impaired profiles for achieving dialog enhancement. The signal dependent upmixing gains and voice clarity gains are transmitted as metadata with a downmixed signal representing the content mix.
-
公开(公告)号:US11950089B2
公开(公告)日:2024-04-02
申请号:US17689744
申请日:2022-03-08
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sunil Bharitkar , William I. Saba
CPC classification number: H04S7/307 , G06N20/00 , H04R5/02 , H04R5/04 , H04S3/008 , H04S2400/01 , H04S2400/07 , H04S2400/13
Abstract: One embodiment provides a computer-implemented method that includes implementing a customizable compressor for at least one sidechain processing associated with a loudspeaker. Machine learning is applied to automatically tune one or more parameters of the at least one sidechain processing. One or more channels are extracted, including a low-frequency effects (LFE) channel, for nonlinear signal synthesis. A proportional power-sum-based mix-in of an LFE sidechain channel is applied into a non-LFE sidechain. The LFE sidechain channel is maintained within a specified threshold of being level, before and after nonlinear signal synthesis.
-
公开(公告)号:US12231865B2
公开(公告)日:2025-02-18
申请号:US18154678
申请日:2023-01-13
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo
IPC: G11B27/036 , G06V20/40 , H04S7/00
Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.
-
公开(公告)号:US20240126990A1
公开(公告)日:2024-04-18
申请号:US18480166
申请日:2023-10-03
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sunil Bharitkar
IPC: G06F40/284 , G06N3/0442
CPC classification number: G06F40/284 , G06N3/0442
Abstract: One embodiment provides a computer-implemented method that includes utilizing text information obtained from a title of a media content item and a trainable model for improving accuracy for classification of the media content item. The trainable model is utilized using a sequence of text to numeric-vector embeddings for classification of the media content item. At least one of a word embedding model parameter or a latent semantic analysis dimension is jointly optimized using the text information, and a classifier model for maximizing accuracy of the classification of the media content item.
-
公开(公告)号:US20230353938A1
公开(公告)日:2023-11-02
申请号:US18054059
申请日:2022-11-09
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sunil Bharitkar
CPC classification number: H04R3/04 , H04R29/002 , H04S7/301 , H04S7/305
Abstract: One embodiment provides a method comprising optimizing one or more stimuli parameters by applying machine learning to training data. The method further comprises determining, based on the one or more optimized stimuli parameters, stimuli for simultaneously exciting a plurality of speakers within a spatial area. The stimuli has a shortest possible duration that is accurate for simultaneous deconvolution of a plurality of impulse responses of the plurality of speakers. The method further comprises simultaneously exciting the plurality of speakers by providing the stimuli to the plurality of speakers at the same time for reproduction. The method further comprises simultaneously deconvolving the plurality of impulse responses based on the stimuli and one or more measurements of sound recorded during the reproduction and arriving at one or more microphones within the spatial area.
-
9.
公开(公告)号:US11792594B2
公开(公告)日:2023-10-17
申请号:US17584181
申请日:2022-01-25
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sunil Bharitkar
CPC classification number: H04S7/301 , H04R5/02 , H04R5/027 , H04R5/04 , H04S3/008 , H04S7/305 , H04S2400/01 , H04S2400/15
Abstract: One embodiment provides a method comprising determining stimuli for simultaneously exciting a plurality of speakers within a spatial area. The method further comprises simultaneously exciting the plurality of speakers by providing the stimuli to the plurality of speakers at the same time for reproduction. The method further comprises recording, during the reproduction, one or more measurements of sound arriving at one or more microphones within the spatial area. The method further comprises simultaneously deconvolving a plurality of impulse responses of the plurality of speakers based on the stimuli and the one or more measurements.
-
公开(公告)号:US20240196158A1
公开(公告)日:2024-06-13
申请号:US18476172
申请日:2023-09-27
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Allan Devantier , Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo
Abstract: One embodiment provides a method of audio upmixing comprising performing video scene analysis by segmenting visual objects from video frames of a video, and performing audio analysis by extracting audio signals from an audio corresponding to the video. The method further comprises determining whether any of the audio signals correspond to any of the visual objects, and estimating a video-based trajectory of a visual object if the visual object is in motion and transitions from on-screen to off-screen, or vice versa, during the video. The method further comprises positioning an audio trajectory of an audio signal from at least one speaker associated with the display to at least one other speaker associated with providing surround sound. The audio trajectory is automatically matched with the video. The audio signal is delivered to the at least one speaker and the at least one other speaker for audio reproduction during the presentation.
-
-
-
-
-
-
-
-
-