-
公开(公告)号:US12036036B2
公开(公告)日:2024-07-16
申请号:US17649915
申请日:2022-02-03
IPC分类号: H04N21/435 , A61B5/00 , A61M5/172 , G06N3/04 , G06N3/0455 , G16H20/17 , G16H50/30 , H04N19/70 , H04N21/44
CPC分类号: A61B5/4839 , A61B5/7275 , A61M5/1723 , G06N3/04 , G06N3/0455 , G16H20/17 , G16H50/30 , H04N19/70 , H04N21/435 , H04N21/44 , A61M2230/201
摘要: An example method is provided to include receiving a media bitstream comprising one or more media units and a first enhancement information message, wherein the first enhancement information message comprises at least two independently parsable structures, a first independently parsable structure comprising information about at least one purpose of one or more neural networks (NNs) to be applied to the one or more media units, and a second independently parsable structure comprising or identifying one or more neural networks; decoding the one or more media units; and using the one or more neural networks to enhance or filter one or more frames of the decoded the one or more media units, based on the at least one purpose. An example method includes. Corresponding apparatuses and computer program products are also provided.
-
公开(公告)号:US12015805B2
公开(公告)日:2024-06-18
申请号:US17795631
申请日:2021-01-21
IPC分类号: H04N21/218 , H04N21/472 , H04N21/81
CPC分类号: H04N21/21805 , H04N21/472 , H04N21/816
摘要: The embodiments relate to a method including determining a foreground area covering a viewport of 360-degree video and one or more other areas of 360-degree video, not containing the foreground area in its entirety; concluding a first set of tile streams among available tile streams of the 360-degree video to cover the foreground area; concluding a second set of tile streams among the available tile streams of the 360-degree video, to cover the one or more other areas; and requesting transmission of a first set of portions of the first set of tile streams and a second set of portions of the second set of tile streams, wherein the portions in the first set of portions have a shorter duration that portions in the second set of portions.
-
公开(公告)号:US20240048737A1
公开(公告)日:2024-02-08
申请号:US18379386
申请日:2023-10-12
IPC分类号: H04N19/30 , H04N19/70 , H04N19/172 , H04N19/167 , H04N19/184 , H04N19/169 , H04N19/132
CPC分类号: H04N19/30 , H04N19/70 , H04N19/172 , H04N19/167 , H04N19/184 , H04N19/188 , H04N19/132
摘要: An apparatus comprising: at least one processor; and at least one non-transitory memory storing instructions that, when executed by the at least one processor, cause the apparatus at least to: indicate a mixed network abstraction layer unit type pictures sample group used to merge video base tracks having a subpicture track; indicate a sample group description entry of the mixed network abstraction layer unit type pictures sample group, the sample group description entry indicating a group of pairs of mixed network abstraction layer unit type track reference indices, which reference video subpicture tracks or track groups; and wherein when a video bitstream is resolved from a video base track containing a mixed network abstraction layer unit type sample group with merging pairs of video subpicture tracks signaled in a mixed network abstraction layer unit type pictures sample group entry, then there is mixing of different network abstraction layer unit types.
-
公开(公告)号:US11575938B2
公开(公告)日:2023-02-07
申请号:US17137609
申请日:2020-12-30
发明人: Hamed Rezazadegan Tavakoli , Francesco Cricri , Miska Matias Hannuksela , Emre Baris Aksu , Honglei Zhang , Nam Le
IPC分类号: H04N19/124 , H04N19/61 , H04N19/192 , H04N19/176 , H04N19/134
摘要: Data may be encoded to minimize distortion after decoding, but the quality required for presentation of the decoded data to a machine and the quality required for presentation to a human may be different. To accommodate different quality requirements, video data may be encoded to produce a first set of encoded data and a second set of encoded data, where the first set may be decoded for use by one of a machine consumer or a human consumer, and a combination of the first set and the second set may be decoded for use by the other of a machine consumer or a human consumer. The first and second set may be produced with a neural encoder and a neural decoder, and/or may be produced with the use of prediction and transform neural network modules. A human-targeted structure and a machine-targeted structure may produce the sets of encoded data.
-
公开(公告)号:US11412266B2
公开(公告)日:2022-08-09
申请号:US17140512
申请日:2021-01-04
摘要: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit having metadata or compressed neural network data of a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information unit.
-
公开(公告)号:US11341688B2
公开(公告)日:2022-05-24
申请号:US17038439
申请日:2020-09-30
IPC分类号: G06T9/00 , H04N19/154 , G06N3/08 , H04N19/172 , H04N19/103
摘要: Optimization of a neural network, for example in a video codec at the decoder side, may be guided to limit overfitting. The encoder may encode video(s) with different qualities for different frames in the video. Low-quality frames may be used as both input and ground-truth during optimization. High-quality frames may be used to optimize the neural network so that higher-quality versions of lower-quality inputs may be predicted. The neural network may be trained to make such predictions by making a prediction based on a constructed low-quality input for which the corresponding high-quality version is known, comparing the prediction to the high-quality version, and fine-tuning the neural network to improve its ability to predict a high-quality version of a low-quality input. To limit overfitting, the neural network may be concurrently or in an alternating fashion trained with low-quality input for which a higher-quality version of the low-quality input is known.
-
公开(公告)号:US11212546B2
公开(公告)日:2021-12-28
申请号:US16812977
申请日:2020-03-09
IPC分类号: H04N19/16 , H04N19/463 , H04N19/58 , H04N19/46 , H04N19/52
摘要: A reference picture marking process and a reference picture list management process is handled in a unified reference picture marking and reference picture list management process. A new idle reference picture list may be used for handling reference pictures that are not used for reference in the current picture. Differential coding of picture order count may be used to increase coding efficiency. The reference picture management syntax structure may be sent in the picture parameter set for improved coding efficiency e.g. in regular GOP (group of pictures) arrangements.
-
公开(公告)号:US11184634B2
公开(公告)日:2021-11-23
申请号:US16834429
申请日:2020-03-30
摘要: There are disclosed various methods, apparatuses and computer program products for video encoding and decoding. In some embodiments a method comprises at least one of the following: encoding into a bitstream an indication that motion fields are stored, but only for inter-layer motion prediction; encoding into a bitstream an indication on a limited scope of motion field usage; encoding into a bitstream an indication whether or not to use the motion field for prediction; encoding into a bitstream an indication of storage parameters for storing motion information.
-
9.
公开(公告)号:US11172005B2
公开(公告)日:2021-11-09
申请号:US15261192
申请日:2016-09-09
IPC分类号: H04L29/06 , H04N21/845 , H04N21/2343 , H04N21/472
摘要: A method, apparatus and computer program product are provided to provide the rendering of audiovisual content, such as 360-degree virtual reality content, in a manner that allows for control over whether, and to what degree, the content presented to a viewer should take into account the relative positioning of the content with respect to the viewer. In particular, implementations are presented that allow for situational control over the rendering of content based on an initial observation setup associated with a segment or subsegment of content, the orientation of the viewing device, and/or the manner in which the segment or subsegment is accessed by a playback device.
-
公开(公告)号:US20210105492A1
公开(公告)日:2021-04-08
申请号:US17061610
申请日:2020-10-02
IPC分类号: H04N19/20 , H04N19/184 , H04N19/115 , H04N19/136
摘要: Described are methods, apparatuses and computer program products for signaling and storing compressed point clouds. Sub-sample entries associated with sequences of sub-samples within sequences of samples may indicate whether sequences of sub-samples were encapsulated alone in a track, without other sub-samples or additional header data. Sub-sample entry types can be indexed at track-level sub-sample description boxes. Point cloud compression coded bitstream component types may be signaled by including respective point cloud unit header information in a codec-specific parameters-related field of track level sub-sample description boxes. Sub-sample information boxes may indicate sub-sample entry indices for respective sub-samples. A flag in such information boxes may indicate the presence of sub-sample description entry indexes. Description index boxes can contain sub-sample description entry indexes in the same container as sub-sample information boxes. Track fragment header boxes can include sub-sample description entry indices that apply to samples of a track fragment.
-
-
-
-
-
-
-
-
-