PROCESSING DATA USING CONVOLUTION AS A TRANSFORMER OPERATION

    公开(公告)号:US20240119721A1

    公开(公告)日:2024-04-11

    申请号:US18476033

    申请日:2023-09-27

    CPC classification number: G06V10/82 G06V10/7715 G06V10/806

    Abstract: Systems and techniques are described herein for processing data (e.g., image data) using convolution as a transformer (CAT) operations. The method includes receiving, at a convolution engine of a machine learning system, a first set of features, the first set of features being associated with an image and having a three-dimensional shape, applying, via the convolution engine, a depth-wise separable convolutional filter to the first set of features to generate a first output, applying, via the convolution engine, a pointwise convolutional filter to the first output to generate a second output based on global information from a spatial dimension and a channel dimension associated with the image, modifying the second output to the three-dimensional shape to generate a second set of features and combining the first set of features and the second set of features to generate an output set of features.

Patent Agency Ranking