-
公开(公告)号:US20250054322A1
公开(公告)日:2025-02-13
申请号:US18787616
申请日:2024-07-29
Applicant: Google LLC
Inventor: Keren Ye , Yicheng Zhu , Junjie Ke , Jiahui Yu , Leonidas John Guibas , Peyman Milanfar , Feng Yang
IPC: G06V20/70 , G06F40/279
Abstract: Systems and methods for attribute recognition can include obtaining an image and a text string. The text string can be processed with a language model to generate a set of candidate attributes based on sequence based prediction. The image and the candidate attributes can be processed with an image-text model to determine a likelihood that the respective candidate attribute is depicted in the image. The likelihood determination can then be utilized to determine a predicted attribute for the object of interest.