LARGE LANGUAGE MODEL OUTPUT ENTAILMENT

    公开(公告)号:US20250094456A1

    公开(公告)日:2025-03-20

    申请号:US18887751

    申请日:2024-09-17

    Applicant: GOOGLE LLC

    Abstract: Implementations are described herein for identifying potentially false information in generative model output by performing entailment evaluation of generative model output. In various implementations, data indicative of a query may be processed to generate generative model output. Textual fragments may be extracted from the generative model output, and a subset of the textual fragments may be classified as being suitable for textual entailment analysis. Textual entailment analysis may be performed on each textual fragment of the subset, including formulating a search query based on the textual fragment, retrieving document(s) responsive to the search query, and processing the textual fragment and the document(s) using entailment machine learning model(s) to generate prediction(s) of whether the at least one document corroborates or contradicts the textual fragment. When natural language (NL) responsive to the query is rendered at a client device, annotation(s) may be rendered to express the prediction(s).

    GENERATING NEURAL NETWORK OUTPUTS USING INSERTION COMMANDS

    公开(公告)号:US20200372356A1

    公开(公告)日:2020-11-26

    申请号:US16883772

    申请日:2020-05-26

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.

    GENERATING NEURAL NETWORK OUTPUTS USING INSERTION COMMANDS

    公开(公告)号:US20240028893A1

    公开(公告)日:2024-01-25

    申请号:US18321696

    申请日:2023-05-22

    Applicant: Google LLC

    CPC classification number: G06N3/08 G06F40/237 G06N3/04 G06N3/084

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.

    Retrieval-augmented language model pre-training and fine-tuning

    公开(公告)号:US11003865B1

    公开(公告)日:2021-05-11

    申请号:US16879457

    申请日:2020-05-20

    Applicant: Google LLC

    Abstract: Systems and methods for pre-training and fine-tuning of neural-network-based language models are disclosed in which a neural-network-based textual knowledge retriever is trained along with the language model. In some examples, the knowledge retriever obtains documents from an unlabeled pre-training corpus, generates its own training tasks, and learns to retrieve documents relevant to those tasks. In some examples, the knowledge retriever is further refined using supervised open-QA questions. The framework of the present technology provides models that can intelligently retrieve helpful information from a large unlabeled corpus, rather than requiring all potentially relevant information to be stored implicitly in the parameters of the neural network. This framework may thus reduce the storage space and complexity of the neural network, and also enable the model to more effectively handle new tasks that may be different than those on which it was pre-trained.

    Generating neural network outputs using insertion commands

    公开(公告)号:US12086715B2

    公开(公告)日:2024-09-10

    申请号:US18321696

    申请日:2023-05-22

    Applicant: Google LLC

    CPC classification number: G06N3/08 G06F40/237 G06N3/04 G06N3/084

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.

    PERFORMING MACHINE LEARNING TASKS USING INSTRUCTION-TUNED NEURAL NETWORKS

    公开(公告)号:US20230205994A1

    公开(公告)日:2023-06-29

    申请号:US17561581

    申请日:2021-12-23

    Applicant: Google LLC

    CPC classification number: G06F40/284 G06F40/30 G06N3/10 G06N5/04

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing a machine learning task on an input to generate an output. In one aspect, one of the method includes receiving input data that describes an input of a machine learning task; receiving candidate output data that describes a set of candidate classification outputs of the machine learning task for the input; generating an input sequence that includes the input and the set of candidate classification outputs; processing the input sequence using a neural network to generate a network output that specifies a respective score for each candidate classification output in the set of candidate classification outputs; and generating an output of the machine learning task for the input, comprising selecting, as the output, a selected candidate classification output from the set of candidate classification outputs using the respective scores.

    Generating neural network outputs using insertion commands

    公开(公告)号:US11657277B2

    公开(公告)日:2023-05-23

    申请号:US16883772

    申请日:2020-05-26

    Applicant: Google LLC

    CPC classification number: G06N3/08 G06F40/237 G06N3/04 G06N3/084

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.

Patent Agency Ranking