-
公开(公告)号:US20210112295A1
公开(公告)日:2021-04-15
申请号:US16613961
申请日:2017-12-12
Applicant: Google LLC
Inventor: Neil Birkbeck , Balineedu Adsumilli , Damien Kelly
IPC: H04N21/2662 , H04N21/81 , H04N21/233 , H04N21/234 , H04N21/4728
Abstract: Signals of an immersive multimedia item are jointly considered for optimizing the quality of experience for the immersive multimedia item. During encoding, portions of available bitrate are allocated to the signals (e.g., a video signal and an audio signal) according to the overall contribution of those signals to the immersive experience for the immersive multimedia item. For example, in the spatial dimension, multimedia signals are processed to determine spatial regions of the immersive multimedia item to render using greater bitrate allocations, such as based on locations of audio content of interest, video content of interest, or both. In another example, in the temporal dimension, multimedia signals are processed in time intervals to adjust allocations of bitrate between the signals based on the relative importance of such signals during those time intervals. Other techniques for bitrate optimizations for immersive multimedia streaming are also described herein.
-
12.
公开(公告)号:US12206914B2
公开(公告)日:2025-01-21
申请号:US18021636
申请日:2022-06-08
Applicant: Google LLC
Inventor: Yilin Wang , Balineedu Adsumilli , Junjie Ke , Hossein Talebi , Joong Yim , Neil Birkbeck , Peyman Milanfar , Feng Yang
IPC: H04N21/266 , G06N3/045 , H04N17/02 , H04N19/154 , H04N21/234 , H04N21/434 , H04N21/44 , H04N21/466
Abstract: Methods, systems, and media for determining perceptual quality indicators of video content items are provided. In some embodiments, the method comprises: receiving a video content item; extracting a plurality of frames from the video content item; determining, using a first subnetwork of a deep neural network, a content quality indicator for each frame of the plurality of frames of the video content item; determining, using a second subnetwork of the deep neural network, a video distortion indicator for each frame of the plurality of frames of the video content item; determining, using a third subnetwork of the deep neural network, a compression sensitivity indicator for each frame of the plurality of frames of the video content item; generating a quality level for each frame of the plurality of frames of the video content item that concatenates the content quality indicator, the video distortion indicator, and the compression sensitivity indicator for that frame of the video content item; generating an overall quality level for video content item by aggregating the quality level of each frame of the plurality of frames; and causing a video recommendation to be presented based on the overall quality level of the video content item.
-
公开(公告)号:US20240086041A1
公开(公告)日:2024-03-14
申请号:US18508092
申请日:2023-11-13
Applicant: Google LLC
Inventor: Neil Birkbeck , Isasi Inguva , Damien Kelly , Andrew Crawford , Hugh Denman , Perry Tobin , Steve Benting , Anil Kokaram , Jeremy Doig
IPC: G06F3/0484 , G11B27/10 , G11B27/34 , H04N21/2387 , H04N21/242 , H04N21/45 , H04N21/845 , H04N21/8549
CPC classification number: G06F3/0484 , G11B27/10 , G11B27/34 , H04N21/2387 , H04N21/242 , H04N21/45 , H04N21/8455 , H04N21/8456 , H04N21/8549
Abstract: An interactive multi-view module identifies a plurality of media items associated with an event. Each of the plurality of media items is created by capturing the event. The interactive multi-view module synchronizes the audio portions of the media items according to a common reference timeline. The interactive multi-view module provides the media items for presentation in an interactive multi-view player interface based on the synchronized audio portions and multiple relative geographic locations. The interactive multi-view player interface allows a user of a plurality of users to switch between the plurality of media items, and indicates a video density indicating a quantity of media items available at a given point in time and a popularity indicator of one of the media items at the given point in time. The popularity indicator is determined using factors comprising a number of viewers of the media items at the given point in time.
-
公开(公告)号:US11854164B2
公开(公告)日:2023-12-26
申请号:US17708983
申请日:2022-03-30
Applicant: Google LLC
Inventor: Damien Kelly , Neil Birkbeck , Balineedu Adsumilli , Mohammad Izadi
IPC: G06T5/00
CPC classification number: G06T5/002 , G06T2207/10016 , G06T2207/20021
Abstract: Processing a spherical video using denoising is described. Video content comprising the spherical video is received. Whether a camera geometry or a map projection, or both, used to generate the spherical video is available is then determined. The spherical video is denoised using a first technique responsive to a determination that the camera geometry, the map projection, or both is available. Otherwise, the spherical video is denoised using a second technique. At least some steps of the second technique can be different from steps of the first technique. The denoised spherical video can be encoded for transmission or storage using less data than encoding the spherical video without denoising.
-
公开(公告)号:US11843814B2
公开(公告)日:2023-12-12
申请号:US17462286
申请日:2021-08-31
Applicant: Google LLC
Inventor: Neil Birkbeck , Balineedu Adsumilli , Damien Kelly
IPC: H04N21/2662 , H04N21/233 , H04N21/234 , H04N21/4728 , H04N21/81
CPC classification number: H04N21/2662 , H04N21/233 , H04N21/23418 , H04N21/4728 , H04N21/816
Abstract: Signals of an immersive multimedia item are jointly considered for optimizing the quality of experience for the immersive multimedia item. During encoding, portions of available bitrate are allocated to the signals (e.g., a video signal and an audio signal) according to the overall contribution of those signals to the immersive experience for the immersive multimedia item. For example, in the spatial dimension, multimedia signals are processed to determine spatial regions of the immersive multimedia item to render using greater bitrate allocations, such as based on locations of audio content of interest, video content of interest, or both. In another example, in the temporal dimension, multimedia signals are processed in time intervals to adjust allocations of bitrate between the signals based on the relative importance of such signals during those time intervals. Other techniques for bitrate optimizations for immersive multimedia streaming are also described herein.
-
公开(公告)号:US11748854B2
公开(公告)日:2023-09-05
申请号:US17722720
申请日:2022-04-18
Applicant: Google LLC
Inventor: Neil Birkbeck , Balineedu Adsumilli , Mohammad Izadi
IPC: G06T5/00
CPC classification number: G06T5/002 , G06T2207/10016 , G06T2207/10024 , G06T2207/20081 , G06T2207/20084
Abstract: Denoising video content includes identifying a three-dimensional flat frame block of multiple frames of the video content, wherein the three-dimensional flat frame block comprises flat frame blocks, each flat frame block is located within a respective frame of the multiple frames, and the flat frame blocks have a spatial and temporal intensity variance that is less than a threshold. Denoising video content also includes determining an average intensity value of the three-dimensional flat frame block, determining a noise model that represents noise characteristics of the three-dimensional flat frame block, generating a denoising function using the average intensity value and the noise model, and denoising the multiple frames using the denoising function.
-
-
-
-
-