-
公开(公告)号:US12132919B2
公开(公告)日:2024-10-29
申请号:US17987844
申请日:2022-11-15
Applicant: QUALCOMM Incorporated
Inventor: Yang Yang , Hoang Cong Minh Le , Yinhao Zhu , Reza Pourreza , Amir Said , Yizhe Zhang , Taco Sebastiaan Cohen
IPC: H04N19/124 , H04N19/119 , H04N19/147 , H04N19/17 , H04N19/436
CPC classification number: H04N19/436 , H04N19/119 , H04N19/124 , H04N19/147 , H04N19/17
Abstract: A processor-implemented method for image compression using an artificial neural network (ANN) includes receiving, at an encoder of the ANN, an image and a spatial segmentation map corresponding to the image. The spatial segmentation map indicates one or more regions of interest. The encoder compresses the image according to a controllable spatial bit allocation. The controllable spatial bit allocation is based on a learned quantization bin size.
-
公开(公告)号:US20240244246A1
公开(公告)日:2024-07-18
申请号:US18192343
申请日:2023-03-29
Applicant: QUALCOMM Incorporated
Inventor: Amir Said , Hilmi Enes Egilmez , Marta Karczewicz , Vadim Seregin
IPC: H04N19/44 , G06F17/16 , H04N19/117 , H04N19/12 , H04N19/147 , H04N19/176 , H04N19/18 , H04N19/463 , H04N19/61 , H04N19/625 , H04N19/96
CPC classification number: H04N19/45 , G06F17/16 , H04N19/117 , H04N19/12 , H04N19/147 , H04N19/176 , H04N19/18 , H04N19/463 , H04N19/61 , H04N19/625 , H04N19/96
Abstract: A device may perform a first prediction process for a first block of video data to produce a first residual. The device may apply a first transform process to the first residual to generate first transform coefficients for the first block of video data and encode the first transform coefficients. The device may perform a second prediction process for a second block of video data to produce a second residual. The device may determine that a second transform process, which includes the first transform process and at least one of a pre-adjustment operation or a post-adjustment operation, is to be applied to the second residual. The device may apply the first transform process and the pre- or post-adjustment operation to the second residual to generate second transform coefficients for the second block. The coding device may code the first and second transform coefficients.
-
33.
公开(公告)号:US20240121392A1
公开(公告)日:2024-04-11
申请号:US18534073
申请日:2023-12-08
Applicant: QUALCOMM Incorporated
Inventor: Amir Said
IPC: H04N19/13 , H04N19/124 , H04N19/134 , H04N19/136 , H04N19/42
CPC classification number: H04N19/13 , H04N19/124 , H04N19/134 , H04N19/136 , H04N19/42
Abstract: This disclosure describes entropy coding techniques for media data coded using neural-based techniques. A media coder is configured to determine a probability distribution function parameter for a data element of a data stream coded by a neural-based media compression technique, wherein the probability distribution function parameter is a function of a standard deviation of a probability distribution function of the data stream, determine a code vector based on the probability distribution function parameter, and entropy code the data element using the code vector.
-
公开(公告)号:US20230262267A1
公开(公告)日:2023-08-17
申请号:US17650728
申请日:2022-02-11
Applicant: QUALCOMM Incorporated
Inventor: Amir Said , Yinhao Zhu
IPC: H04N19/91 , H04N19/124 , G06T9/00 , G06N3/04
CPC classification number: H04N19/91 , H04N19/124 , G06T9/002 , G06N3/0472
Abstract: This disclosure describes entropy coding techniques for media data coded using neural-based techniques. A media coder is configured to determine a probability distribution function parameter for a data element of a data stream coded by a neural-based media compression technique, wherein the probability distribution function parameter is a logarithmic function of a standard deviation of a probability distribution function of the data stream, determine a code vector based on the probability distribution function parameter, and entropy code the data element using the code vector.
-
公开(公告)号:US20230179776A1
公开(公告)日:2023-06-08
申请号:US18162962
申请日:2023-02-01
Applicant: QUALCOMM Incorporated
Inventor: Hilmi Enes Egilmez , Vadim Seregin , Amir Said , Marta Karczewicz
IPC: H04N19/159 , H04N19/176 , H04N19/60 , H04N19/96
CPC classification number: H04N19/159 , H04N19/176 , H04N19/60 , H04N19/96
Abstract: An example device applies a primary transform to a current block of video data to create primary transform coefficients. The device determines whether intra sub-partitioning is applied to the current block of video data. The device applies a primary transform to the current block. The device also determines whether a primary transform size for the current block of video data is at least a predetermined size. Based on intra sub-partitioning being applied and the primary transform size being at least the predetermined size, the device applies a secondary transform to primary transform coefficients and codes the current block of video data based on the secondary transform.
-
公开(公告)号:US11240534B2
公开(公告)日:2022-02-01
申请号:US16838553
申请日:2020-04-02
Applicant: QUALCOMM Incorporated
Inventor: Hilmi Enes Egilmez , Vadim Seregin , Amir Said , Marta Karczewicz
IPC: H04N19/61 , H04N19/176 , H04N19/625
Abstract: An example device for coding video data includes a memory configured to store video data; and one or more processors implemented in circuitry and configured to: code a first codeword representing a selected transform scheme of a set of transform candidates of a multiple transform selection (MTS) scheme for a current block of video data, the selected transform scheme being a secondary transform of a set of available secondary transforms to be applied in addition to a primary transform; code a second codeword representing the secondary transform from the set of available secondary transforms; and apply the primary transform and the secondary transform during coding of residual data for the current block. The second codeword may be a value for a low-frequency non-separable transform (LFNST) syntax element.
-
公开(公告)号:US20220030278A1
公开(公告)日:2022-01-27
申请号:US17450509
申请日:2021-10-11
Applicant: QUALCOMM Incorporated
Inventor: Hilmi Enes Egilmez , Vadim Seregin , Amir Said , Marta Karczewicz
IPC: H04N19/625 , H04N19/61 , H04N19/176
Abstract: An example device for coding video data includes a memory configured to store video data; and one or more processors implemented in circuitry and configured to: code a first codeword representing a selected transform scheme of a set of transform candidates of a multiple transform selection (MTS) scheme for a current block of video data, the selected transform scheme being a secondary transform of a set of available secondary transforms to be applied in addition to a primary transform; code a second codeword representing the secondary transform from the set of available secondary transforms; and apply the primary transform and the secondary transform during coding of residual data for the current block. The second codeword may be a value for a low-frequency non-separable transform (LFNST) syntax element.
-
公开(公告)号:US20210329267A1
公开(公告)日:2021-10-21
申请号:US17070589
申请日:2020-10-14
Applicant: QUALCOMM Incorporated
Inventor: Dana Kianfar , Auke Joris Wiggers , Amir Said , Taco Sebastiaan Cohen , Reza Pourreza Shahri
IPC: H04N19/176 , H04N19/61 , H04N19/13 , H04N19/18 , H04N19/124
Abstract: A video encoder determines scaled transform coefficients, wherein determining the scaled transform coefficients comprises scaling transform coefficients of a block of the video data according to a given quantization step. The video encoder determines scalar quantized coefficients, wherein determining the scalar quantized coefficients comprises applying scalar quantization to the scaled transform coefficients of the block. Additionally, the video encoder applies a neural network that determines a respective set of probabilities for each respective transform coefficient of the block. The respective set of probabilities for the respective transform coefficient includes a respective probability value for each possible adjustment value in a plurality of possible adjustment values. Inputs to the neural network include the scaled transform coefficients and the scalar quantized coefficients. The video encoder determines, based on the set of probabilities for a particular transform coefficient of the block, a quantization level for the particular transform coefficient.
-
公开(公告)号:US11082703B2
公开(公告)日:2021-08-03
申请号:US16715756
申请日:2019-12-16
Applicant: QUALCOMM Incorporated
Inventor: Vadim Seregin , Xin Zhao , Amir Said , Marta Karczewicz , Nan Hu , Wei-Jung Chien , Vijayaraghavan Thirumalai
IPC: H04N19/11 , H04N19/159 , H04N19/463 , H04N19/176 , H04N19/91 , H04N19/157 , H04N19/593 , H04N1/417 , H04N5/926 , H04N9/804 , H04N9/808
Abstract: A device for coding video data is configured to determine that a current block of video data is coded using an intra prediction mode; add an intra prediction mode of a first neighboring block of the current block to a most probable mode candidate list for the current block; add an intra prediction mode for a second neighboring block of the current block to the most probable mode candidate list for the current block; add an intra prediction mode for a third neighboring block of the current block to the most probable mode candidate list for the current block; and code the current block of video data using an intra prediction mode.
-
公开(公告)号:US20210092381A1
公开(公告)日:2021-03-25
申请号:US17025529
申请日:2020-09-18
Applicant: QUALCOMM Incorporated
Inventor: Hilmi Enes Egilmez , Amir Said , Vadim Seregin , Marta Karczewicz
IPC: H04N19/129 , H04N19/159 , H04N19/176
Abstract: An example device for decoding video data includes a memory configured to store the video data and one or more processors coupled to the memory. The one or more processors are configured to reorganize 2-D dequantized coefficients according to a first ordering. The one or more processors are configured to apply an inverse low-frequency non-separable transform (LFNST) to the reorganized 2-D dequantized coefficients to create inverse transformed coefficients. The one or more processors are configured to reorganize the inverse transformed coefficients according to a second ordering, the second ordering being based on an array including values, wherein each value in the array corresponds to a position in a 2-D block and the values in the array denote indices of the 2-D block in a defined order. The one or more processors are configured to decode the video data based on the second ordered inverse transformed coefficients.
-
-
-
-
-
-
-
-
-