Multimodal dialog state tracking and action prediction for assistant systems

Invention Grant

US11704745B2 Multimodal dialog state tracking and action prediction for assistant systems 有权

Please log in to see more content

Patent Title: Multimodal dialog state tracking and action prediction for assistant systems
Application No.: US17006339

Application Date: 2020-08-28
Publication No.: US11704745B2

Publication Date: 2023-07-18
Inventor: Shivani Poddar , Seungwhan Moon , Paul Anthony Crook , Rajen Subba
Applicant: Meta Platforms, Inc.
Applicant Address: US CA Menlo Park
Assignee: Meta Platforms, Inc.
Current Assignee: Meta Platforms, Inc.
Current Assignee Address: US CA Menlo Park
Agency: Baker Botts L.L.P.
Main IPC: G06F40/30
IPC: G06F40/30 ; G06F3/01 ; G06F9/451 ; G06F9/48 ; G06F9/54 ; G06F16/332 ; G06F16/9032 ; G06F16/9536 ; G06F40/205 ; G06F40/253 ; G06Q50/00 ; H04N7/14 ; G06N20/00 ; G06F40/35 ; G06F40/56 ; G06F40/242 ; G06V20/20 ; G06V10/82 ; G06V40/16 ; G06V20/30 ; G06V10/20 ; G06V10/764 ; G06V20/00 ; G06V40/20 ; H04L51/222 ; H04L51/224 ; H04L51/52 ; H04L51/212 ; H04L67/75 ; G06N3/047 ; G06N3/045 ; G06F18/2321 ; G06N3/08 ; G06Q10/109 ; G10L15/06 ; G10L15/08 ; G10L15/16 ; G10L15/18 ; G10L15/22 ; G10L15/30 ; G10L15/32 ; H04L51/18 ; H04L67/306 ; G06V20/40 ; G06F3/16

Multimodal dialog state tracking and action prediction for assistant systems

Abstract:

In one embodiment, a method includes receiving, from a client system associated with a user, a user request comprising a reference to a target object, accessing visual data from the client system, wherein the visual data comprises images portraying the target object and one or more additional objects, and wherein attribute information of the target object is recorded in a multimodal dialog state, resolving the reference to the target object based on the attribute information recorded in the multimodal dialog state, determining relational information between the target object and one or more of the additional objects portrayed in the visual data, and sending, to the client system, instructions for presenting a response to the user request, wherein the response comprises the attribute information and the determined relational information.

Public/Granted literature

US11663678B2 Multimodal dialog state tracking and action prediction for assistant systems Public/Granted day:2023-05-30

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/30	.语义分析