-
公开(公告)号:US20240394936A1
公开(公告)日:2024-11-28
申请号:US18466747
申请日:2023-09-13
Applicant: QUALCOMM Incorporated
Inventor: Reza POURREZA , Roland MEMISEVIC , Apratim BHATTACHARYYA , Sunny Praful Kumar PANCHAL , Mingu LEE , Pulkit MADAN
IPC: G06T11/20 , G06N3/0464 , G06N3/084 , G06T11/60
Abstract: A processor-implemented method for image generation using an artificial neural network (ANN) includes receiving an input including one or more of an image or a text prompt. The ANN processes the input to determine one or more virtual brush strokes to generate an output image or one or more commands for controlling an image drawing application to generate the output image. A list of the one or more virtual brush strokes to generate the output image or the one or more commands for controlling the image drawing application to generate the output image. The one or more virtual brush strokes or commands may be executed to generate a sketch based on the input.
-
公开(公告)号:US20240386712A1
公开(公告)日:2024-11-21
申请号:US18500986
申请日:2023-11-02
Applicant: QUALCOMM Incorporated
Inventor: Apratim BHATTACHARYYA , Roland MEMISEVIC , Sunny Praful Kumar PANCHAL , Reza POURREZA , Mingu LEE , Pulkit MADAN
IPC: G06V10/82 , G06F40/10 , G06F40/284
Abstract: A processor-implemented method for generating grounded rationales for visual reasoning tasks includes receiving, by a first artificial neural network (ANN), an interleaved sequence of images and textual information. The first ANN extracts grid features of the images of the interleaved sequence of the images and the textual information to generate a representation of the interleaved sequence of the images and the textual information based on the grid features. A second ANN maps the grid features to a textual domain. The second ANN extracts visual information of the interleaved sequence of the images and the textual information based on the grid features in the textual domain. The second ANN determines a rationale based on the visual information. The visual information comprises one or more lower-level surrogate tasks.
-