Patent search ap:("Samsung Electronics Co. Page Ltd.") AND inv:"Carlos Tejeda Ocampo"

1.

发明公开
DEEP LEARNING BASED VOICE EXTRACTION AND PRIMARY-AMBIENCE DECOMPOSITION FOR STEREO TO SURROUND UPMIXING WITH DIALOG-ENHANCED CENTER CHANNEL 审中-公开

公开(公告)号：US20240267701A1

公开(公告)日：2024-08-08

申请号：US18399148

申请日：2023-12-28

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sunil Bharitkar , Ricardo Thaddeus Páez Amaro , Carlos Tejeda Ocampo , Luis Madrid Herrera

IPC: H04S7/00 , H04S3/00

CPC classification number: H04S7/307 , H04S3/008 , H04S7/302 , H04S2400/01 , H04S2400/03 , H04S2400/05 , H04S2400/11 , H04S2400/13

Abstract: One embodiment provides a computer-implemented method that includes determining directional sounds from a content mix using a machine learning unmixing model. The directional sounds are panned in an upmixed signal. Signal-dependent upmixing gains for specific frequency bins are computed on a frame-basis using a machine learning model for the upmixed signal. Dedicated voice clarity gains are computed using a hearing impairment model for multiple hearing-impaired profiles for achieving dialog enhancement. The signal dependent upmixing gains and voice clarity gains are transmitted as metadata with a downmixed signal representing the content mix.

2.

发明公开
SURROUND SOUND TO IMMERSIVE AUDIO UPMIXING BASED ON VIDEO SCENE ANALYSIS 审中-公开

公开(公告)号：US20240196158A1

公开(公告)日：2024-06-13

申请号：US18476172

申请日：2023-09-27

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Allan Devantier , Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo

IPC: H04S7/00 , G06V20/40

CPC classification number: H04S7/305 , G06V20/49

Abstract: One embodiment provides a method of audio upmixing comprising performing video scene analysis by segmenting visual objects from video frames of a video, and performing audio analysis by extracting audio signals from an audio corresponding to the video. The method further comprises determining whether any of the audio signals correspond to any of the visual objects, and estimating a video-based trajectory of a visual object if the visual object is in motion and transitions from on-screen to off-screen, or vice versa, during the video. The method further comprises positioning an audio trajectory of an audio signal from at least one speaker associated with the display to at least one other speaker associated with providing surround sound. The audio trajectory is automatically matched with the video. The audio signal is delivered to the at least one speaker and the at least one other speaker for audio reproduction during the presentation.

3.

发明授权
Video-derived audio processing 有权

公开(公告)号：US12231865B2

公开(公告)日：2025-02-18

申请号：US18154678

申请日：2023-01-13

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo

IPC: G11B27/036 , G06V20/40 , H04S7/00

Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.

4.

发明申请
CODEC BITRATE SELECTION IN AUDIO OBJECT CODING 有权

公开(公告)号：US20250046321A1

公开(公告)日：2025-02-06

申请号：US18412878

申请日：2024-01-15

Applicant: Samsung Electronics Co., Ltd.

Inventor： Toni M. Hirvonen , Carlos Tejeda Ocampo

IPC: G10L19/032 , G10L19/002 , G10L19/008 , G10L19/20 , G10L19/24

Abstract: One embodiment provides a computer-implemented method that includes analyzing, by a computing device, spatial object-based audio content associated with one or more objects. One or more relative perceptual importance metrics of the one or more objects are determined, by the computing device, based on modeling. Based on the one or more relative perceptual importance metrics, resources are allocated, by the computing device, for improving overall audio quality relative to bitrate.

5.

发明公开
VIDEO-DERIVED AUDIO PROCESSING 审中-公开

公开(公告)号：US20240244386A1

公开(公告)日：2024-07-18

申请号：US18154678

申请日：2023-01-13

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Sunil Bharitkar , SeongNam Oh , Carlos Tejeda Ocampo

IPC: H04S7/00 , G06V20/40 , G11B27/036

CPC classification number: H04S7/30 , G06V20/46 , G11B27/036 , G06V2201/10 , H04S2400/11

Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.

Patent Agency Ranking