-
公开(公告)号:US20180047193A1
公开(公告)日:2018-02-15
申请号:US15285441
申请日:2016-10-04
Applicant: QUALCOMM Incorporated
Inventor: Jinglun Gao , Ying Chen , Lei Wang , Ning Bi
CPC classification number: G06T7/248 , G06T2207/10016 , G06T2207/10024 , G06T2207/30232 , G06T2207/30236 , G06T2207/30241 , G06T2210/12
Abstract: Provided are methods, apparatuses, and computer-readable medium for content-adaptive bounding box merging. A system using content-adaptive bounding box merging can adapt its merging criteria according to the objects typically present in a scene. When two bounding boxes overlap, the content-adaptive merge engine can consider the overlap ratio, and compare the merged bounding box against a minimum object size. The minimum object size can be adapted to the size of the blobs detected in the scene. When two bounding boxes do not overlap, the system can consider the horizontal and vertical distances between the bounding boxes. The system can further compare the distances against content-adaptive thresholds. Using a content-adaptive bounding box merge engine, a video content analysis system may be able to more accurately merge (or not merge) bounding boxes and their associated blobs.
-
公开(公告)号:US20180047171A1
公开(公告)日:2018-02-15
申请号:US15384802
申请日:2016-12-20
Applicant: QUALCOMM Incorporated
Inventor: Ying Chen , Lei Wang , Jinglun Gao , Ning Bi
CPC classification number: G06T7/20 , G06K9/00711 , G06K9/4671 , G06T7/13 , G06T7/246 , G06T7/30 , G06T7/60 , G06T2207/30232 , G06T2207/30241
Abstract: Techniques and systems are provided for processing video data. For example, techniques and systems are provided for maintaining blob trackers for one or more video frames. A blob tracker can be associated with a blob generated for a video frame. The blob includes pixels of at least a portion of one or more foreground objects in the video frame. The blob tracker can be determined to be a first type of tracker or a second type of tracker. A first type of tracker has a first bounding box and a second bounding box with an overlapping ratio greater than an alignment threshold for the first type of tracker. A second type of tracker has an irregular size change or an irregular motion change over a threshold duration. The blob tracker can be removed from the plurality of blob trackers maintained for the one or more video frames when the blob tracker is the first type of tracker or the second type of tracker.
-
公开(公告)号:US20180033152A1
公开(公告)日:2018-02-01
申请号:US15262700
申请日:2016-09-12
Applicant: QUALCOMM Incorporated
Inventor: Ying Chen , Lei Wang , Jinglun Gao , Ning Bi
CPC classification number: G06K9/00711 , G06K9/00771 , G06K9/38 , G06K2009/00738 , G06T5/004 , G06T5/30 , G06T7/11 , G06T7/155 , G06T7/194 , G06T2207/10016 , G06T2207/10024 , G06T2207/20012 , G06T2207/20024 , G06T2207/20036 , G06T2207/20192
Abstract: Techniques and systems are provided for processing video data. For example, techniques and systems are provided for performing content-adaptive morphology operations. A first erosion function can be performed on a foreground mask of a video frame, including setting one or more foreground pixels of the frame to one or more background pixels. A temporary foreground mask can be generated based on the first erosion function being performed on the foreground mask. One or more connected components can be generated for the frame by performing connected component analysis to connect one or more neighboring foreground pixels. A complexity of the frame (or of the foreground mask of the frame) can be determined by comparing a number of the one or more connected components to a threshold number. A second erosion function can be performed on the temporary foreground mask when the number of the one or more connected components is higher than the threshold number. The one or more connected components can be output for blob processing when the number of the one or more connected components is lower than the threshold number.
-
公开(公告)号:US09883197B2
公开(公告)日:2018-01-30
申请号:US14592819
申请日:2015-01-08
Applicant: QUALCOMM Incorporated
Inventor: Ying Chen , Chao Pang , Li Zhang , Joel Sole Rojals , Marta Karczewicz
IPC: H04N7/12 , H04N19/513 , H04N19/52 , H04N19/593 , H04N19/186 , H04N19/11 , H04N19/154
CPC classification number: H04N19/513 , H04N19/11 , H04N19/154 , H04N19/186 , H04N19/52 , H04N19/593
Abstract: A device for coding video data is configured to: determine a coding unit of a picture of the video data is coded using an intra block copy mode; determine a vector for a first chroma block of the coding unit; locate a first chroma reference block using the vector, the first chroma reference block being in the picture; predict the first chroma block based on the first chroma reference block; locate a second chroma reference block using the vector, the second chroma reference block being in the picture; and predict a second chroma block of the coding unit based on the second chroma reference block.
-
公开(公告)号:US09860540B2
公开(公告)日:2018-01-02
申请号:US14584351
申请日:2014-12-29
Applicant: QUALCOMM Incorporated
Inventor: Ye-Kui Wang , Ying Chen
IPC: H04N19/70 , H04N19/187 , H04N19/103 , H04N19/146 , H04N19/186 , H04N19/172 , H04N19/423
CPC classification number: H04N19/187 , H04N19/103 , H04N19/146 , H04N19/172 , H04N19/186 , H04N19/423 , H04N19/70
Abstract: An apparatus for coding video information according to certain aspects includes a processor configured to determine a value of a flag associated with a current picture of a current layer to be decoded, the flag indicating whether pictures in a decoded picture buffer (DPB) should be output, wherein the current picture is an intra random access point (TRAP) picture that starts a new coded video sequence (CVS) and wherein the determination of the value of the flag is based on at least one of: (1) the chroma format of the current picture and the chroma format of the preceding picture, (2) the bit depth of the luma samples of the current picture and the bit depth of the luma samples of the preceding picture, or (3) the bit depth of the chroma samples of the current picture and the bit depth of the chroma samples of the preceding picture.
-
公开(公告)号:US09854234B2
公开(公告)日:2017-12-26
申请号:US13803736
申请日:2013-03-14
Applicant: QUALCOMM Incorporated
Inventor: Ying Chen , Ye-Kui Wang
IPC: H04N19/00 , H04N19/70 , H04N19/573 , H04N19/503 , H04N19/597 , H04N19/58 , H04N19/51
CPC classification number: H04N19/573 , H04N19/503 , H04N19/51 , H04N19/58 , H04N19/597 , H04N19/70
Abstract: The techniques of this disclosure may be generally related to reference statues of pictures. The techniques may store the reference status information of reference pictures of a picture, at an instance when the picture is being coded. The techniques may then utilize the reference status information of the reference pictures of the picture, at the instance when the picture was coded, to inter-predict video blocks of a subsequent picture.
-
公开(公告)号:US20170345179A1
公开(公告)日:2017-11-30
申请号:US15229456
申请日:2016-08-05
Applicant: QUALCOMM Incorporated
Inventor: Jinglun Gao , Ning Bi , Ying Chen , Lei Wang
CPC classification number: G06T7/60 , G06K9/00 , G06K9/4671 , G06T7/251 , G06T7/277 , G06T2207/10004 , G06T2207/30196 , G06T2207/30232
Abstract: Techniques and systems are provided for processing video data. For example, techniques and systems are provided for determining costs for blob trackers and blobs. A blob can be detected in a video frame. The blob includes pixels of at least a portion of a foreground object. A physical distance between a blob tracker and the blob can be determined. A size ratio between the blob tracker and the blob can also be determined. A cost between the blob tracker and the blob can then be determined using the physical distance and the size ratio. In some cases, a spatial relationship between the blob tracker and the blob is determined, in which case the physical distance can be determined based on the spatial relationship. Blob trackers can be associated with blobs based on the determined costs between the blob trackers and the blobs.
-
公开(公告)号:US09813736B2
公开(公告)日:2017-11-07
申请号:US14496807
申请日:2014-09-25
Applicant: QUALCOMM Incorporated
Inventor: Ying Chen , Ye-Kui Wang
IPC: H04N19/30 , H04N19/70 , H04N19/597 , H04N19/463
CPC classification number: H04N19/597 , H04N19/30 , H04N19/463 , H04N19/70
Abstract: A video decoder receives a value for a first syntax element representing whether a dependency type syntax element for a current layer is signaled, wherein the dependency type syntax element identifies a type of dependency of a current layer relative to a reference layer; and in response to the value for the first syntax element indicating that the dependency type syntax element is not signaled determines that the type of dependency of the current layer relative to the reference layer is a predetermined type and decodes a block of the current layer using inter-layer prediction conforming to the predetermined type.
-
公开(公告)号:US20170272765A1
公开(公告)日:2017-09-21
申请号:US15612912
申请日:2017-06-02
Applicant: QUALCOMM Incorporated
Inventor: Xianglin Wang , Wei-Jung Chien , Marta Karczewicz , Ying Chen , Peisong Chen
IPC: H04N19/182 , H04N19/61 , H04N19/593 , H04N19/105
CPC classification number: H04N19/182 , H04N19/105 , H04N19/593 , H04N19/61
Abstract: A video coder performs a padding operation that processes a set of border pixels according to an order. The order starts at a bottom-left border pixel and proceeds through the border pixels sequentially to a top-right border pixel. When the padding operation processes an unavailable border pixel, the padding operation predicts a value of the unavailable border pixel based on a value of a border pixel previously processed by the padding operation. The video coder may generate an intra-predicted video block based on the border pixels.
-
公开(公告)号:US09756335B2
公开(公告)日:2017-09-05
申请号:US14318230
申请日:2014-06-27
Applicant: QUALCOMM Incorporated
Inventor: Jianle Chen , Ying Chen , Ye-Kui Wang , Krishnakanth Rapaka , Fnu Hendry
IPC: H04N19/70 , H04N19/30 , H04N19/187 , H04N19/597 , H04N19/105 , H04N19/46 , H04N19/196 , H04N19/174
CPC classification number: H04N19/105 , H04N19/174 , H04N19/196 , H04N19/30 , H04N19/46 , H04N19/597 , H04N19/70
Abstract: A method of coding video data includes receiving one or more layers of video information. Each layer may include at least one picture. The method can include determining a number of active reference layer pictures associated with at least one picture of the one or more layers. The method can further include determining a number of direct reference layers associated with the at least one of the one or more layers. Based on the number of direct reference layers equaling the number of active reference layer pictures, the method can further include refraining from further signaling inter-layer reference picture information in any video slice associated with at least one of a video parameter set (VPS), a sequence parameter set (SPS), or a picture parameter set (PPS).
-
-
-
-
-
-
-
-
-