MULTI-MODAL NATURAL LANGUAGE PROCESSING
    5.
    发明申请

    公开(公告)号:US20200251098A1

    公开(公告)日:2020-08-06

    申请号:US16723762

    申请日:2019-12-20

    Abstract: Multi-modal natural language processing systems are provided. Some systems are context-aware systems that use multi-modal data to improve the accuracy of natural language understanding as it is applied to spoken language input. Machine learning architectures are provided that jointly model spoken language input (“utterances”) and information displayed on a visual display (“on-screen information”). Such machine learning architectures can improve upon, and solve problems inherent in, existing spoken language understanding systems that operate in multi-modal contexts.

    Multi-modal natural language processing

    公开(公告)号:US10515625B1

    公开(公告)日:2019-12-24

    申请号:US15828174

    申请日:2017-11-30

    Abstract: Multi-modal natural language processing systems are provided. Some systems are context-aware systems that use multi-modal data to improve the accuracy of natural language understanding as it is applied to spoken language input. Machine learning architectures are provided that jointly model spoken language input (“utterances”) and information displayed on a visual display (“on-screen information”). Such machine learning architectures can improve upon, and solve problems inherent in, existing spoken language understanding systems that operate in multi-modal contexts.

    NATURAL LANGUAGE GENERATION
    7.
    发明申请

    公开(公告)号:US20250006196A1

    公开(公告)日:2025-01-02

    申请号:US18345455

    申请日:2023-06-30

    Abstract: Techniques for generating a prompt for a language model to determine an action responsive to a user input, are described. In some embodiments, the system receives a user input, determines one or more application programming interfaces (APIs) configured to perform actions that are relevant to the user input and exemplars representing examples of using the APIs with respect to user inputs similar to the current user input. The system further determines device states of devices that are determined to be related to the user input and also determines other contextual information (e.g., weather information, time of day, geographic location, etc.). The system generates a prompt including the user input, the APIs, the exemplars, the device states, and the other contextual information. A language model processes the prompt to determine an action responsive to the user input and the system causes performance of the action.

Patent Agency Ranking