SURROUND SOUND TO IMMERSIVE AUDIO UPMIXING BASED ON VIDEO SCENE ANALYSIS

    公开(公告)号:US20240196158A1

    公开(公告)日:2024-06-13

    申请号:US18476172

    申请日:2023-09-27

    CPC classification number: H04S7/305 G06V20/49

    Abstract: One embodiment provides a method of audio upmixing comprising performing video scene analysis by segmenting visual objects from video frames of a video, and performing audio analysis by extracting audio signals from an audio corresponding to the video. The method further comprises determining whether any of the audio signals correspond to any of the visual objects, and estimating a video-based trajectory of a visual object if the visual object is in motion and transitions from on-screen to off-screen, or vice versa, during the video. The method further comprises positioning an audio trajectory of an audio signal from at least one speaker associated with the display to at least one other speaker associated with providing surround sound. The audio trajectory is automatically matched with the video. The audio signal is delivered to the at least one speaker and the at least one other speaker for audio reproduction during the presentation.

    Video-derived audio processing
    3.
    发明授权

    公开(公告)号:US12231865B2

    公开(公告)日:2025-02-18

    申请号:US18154678

    申请日:2023-01-13

    Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.

    VIDEO-DERIVED AUDIO PROCESSING
    5.
    发明公开

    公开(公告)号:US20240244386A1

    公开(公告)日:2024-07-18

    申请号:US18154678

    申请日:2023-01-13

    Abstract: One embodiment provides a computer-implemented method that includes creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video. The method can include, during the content production, inserting the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder. The method can include, during content playback, rendering the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.

Patent Agency Ranking