-
公开(公告)号:US20240119721A1
公开(公告)日:2024-04-11
申请号:US18476033
申请日:2023-09-27
Applicant: QUALCOMM Incorporated
Inventor: Dharma Raj KC , Venkata Ravi Kiran DAYANA , Meng-Lin WU , Venkateswara Rao CHERUKURI
CPC classification number: G06V10/82 , G06V10/7715 , G06V10/806
Abstract: Systems and techniques are described herein for processing data (e.g., image data) using convolution as a transformer (CAT) operations. The method includes receiving, at a convolution engine of a machine learning system, a first set of features, the first set of features being associated with an image and having a three-dimensional shape, applying, via the convolution engine, a depth-wise separable convolutional filter to the first set of features to generate a first output, applying, via the convolution engine, a pointwise convolutional filter to the first output to generate a second output based on global information from a spatial dimension and a channel dimension associated with the image, modifying the second output to the three-dimensional shape to generate a second set of features and combining the first set of features and the second set of features to generate an output set of features.