SYSTEM AND METHOD FOR PROBLEM INFERENCE BASED ON MULTI-MODAL GENERATIVE ARTIFICIAL INTELLIGENCE

    公开(公告)号:US20250139188A1

    公开(公告)日:2025-05-01

    申请号:US18889622

    申请日:2024-09-19

    Abstract: A method for problem inference based on multi-modal generative artificial intelligence includes receiving question information including an image and text, generating formal languages by parsing the image and text of the question information, respectively, based on a pre-constructed problem solving template, generating text-based intermediate inference information for the question information by inputting the generated formal language to a formal language inference unit, generating image-based inference information by inputting the text-based intermediate inference information, the text included in the question information (hereinafter referred to as “text question information”), and the image included in the question information (hereinafter referred to as “image question information”) to a multi-modal image generation model, and generating text-based inference information by inputting the text-based intermediate inference information, the image-based inference information, and the text question information to a multi-modal text generation model.

Patent Agency Ranking