-
公开(公告)号:US20230128422A1
公开(公告)日:2023-04-27
申请号:US18050037
申请日:2022-10-26
Applicant: Meta Platforms, Inc.
Inventor: Mengxi Li , Aaron Jackson , Julien Philippe Gilbert Odent
IPC: G10L15/22 , G10L15/18 , G06F3/01 , H04R3/00 , G06F3/0482 , G06V20/20 , G06V20/62 , G06F40/58 , G06F3/16 , G10L15/24
Abstract: In one embodiment, a method includes receiving, by a XR display device, a gesture-based input from a first user of the XR display device, processing, using a gesture-detection model, the gesture-based input to identify a first gesture, receiving, by the XR display device, an audio input from the first user, where the audio input includes a first voice command, processing, using a natural-language model, the audio input to identify one or more intents or one or more slots associated with the first voice command, determining whether the identified first gesture matches the first voice command, and executing, responsive to the identified first gesture matching the first voice command and by the XR display device, a first task corresponding to the first voice command based on the identified first gesture and the identified one or more intents or one or more slots.