-
公开(公告)号:US20240267701A1
公开(公告)日:2024-08-08
申请号:US18399148
申请日:2023-12-28
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sunil Bharitkar , Ricardo Thaddeus Páez Amaro , Carlos Tejeda Ocampo , Luis Madrid Herrera
CPC classification number: H04S7/307 , H04S3/008 , H04S7/302 , H04S2400/01 , H04S2400/03 , H04S2400/05 , H04S2400/11 , H04S2400/13
Abstract: One embodiment provides a computer-implemented method that includes determining directional sounds from a content mix using a machine learning unmixing model. The directional sounds are panned in an upmixed signal. Signal-dependent upmixing gains for specific frequency bins are computed on a frame-basis using a machine learning model for the upmixed signal. Dedicated voice clarity gains are computed using a hearing impairment model for multiple hearing-impaired profiles for achieving dialog enhancement. The signal dependent upmixing gains and voice clarity gains are transmitted as metadata with a downmixed signal representing the content mix.
-
公开(公告)号:US20240196158A1
公开(公告)日:2024-06-13
申请号:US18476172
申请日:2023-09-27
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Allan Devantier , Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo
Abstract: One embodiment provides a method of audio upmixing comprising performing video scene analysis by segmenting visual objects from video frames of a video, and performing audio analysis by extracting audio signals from an audio corresponding to the video. The method further comprises determining whether any of the audio signals correspond to any of the visual objects, and estimating a video-based trajectory of a visual object if the visual object is in motion and transitions from on-screen to off-screen, or vice versa, during the video. The method further comprises positioning an audio trajectory of an audio signal from at least one speaker associated with the display to at least one other speaker associated with providing surround sound. The audio trajectory is automatically matched with the video. The audio signal is delivered to the at least one speaker and the at least one other speaker for audio reproduction during the presentation.
-
公开(公告)号:US12231865B2
公开(公告)日:2025-02-18
申请号:US18154678
申请日:2023-01-13
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sunil Bharitkar , Seongnam Oh , Carlos Tejeda Ocampo
IPC: G11B27/036 , G06V20/40 , H04S7/00
Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.
-
公开(公告)号:US20250046321A1
公开(公告)日:2025-02-06
申请号:US18412878
申请日:2024-01-15
Applicant: Samsung Electronics Co., Ltd.
Inventor: Toni M. Hirvonen , Carlos Tejeda Ocampo
IPC: G10L19/032 , G10L19/002 , G10L19/008 , G10L19/20 , G10L19/24
Abstract: One embodiment provides a computer-implemented method that includes analyzing, by a computing device, spatial object-based audio content associated with one or more objects. One or more relative perceptual importance metrics of the one or more objects are determined, by the computing device, based on modeling. Based on the one or more relative perceptual importance metrics, resources are allocated, by the computing device, for improving overall audio quality relative to bitrate.
-
公开(公告)号:US20240244386A1
公开(公告)日:2024-07-18
申请号:US18154678
申请日:2023-01-13
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sunil Bharitkar , SeongNam Oh , Carlos Tejeda Ocampo
IPC: H04S7/00 , G06V20/40 , G11B27/036
CPC classification number: H04S7/30 , G06V20/46 , G11B27/036 , G06V2201/10 , H04S2400/11
Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.
-
-
-
-