-
1.
公开(公告)号:US20240184814A1
公开(公告)日:2024-06-06
申请号:US18173622
申请日:2023-02-23
Applicant: NVIDIA Corporation
Inventor: Shubhadeep Das , Sumit Kumar Bhattacharya , Oluwatobi Olabiyi
IPC: G06F16/332
CPC classification number: G06F16/3329
Abstract: In various examples, hybrid models for determining intents in conversational AI systems and applications are disclosed. Systems and methods are disclosed that use a machine learning model(s) and a data file(s) that associates requests (e.g., questions) with responses (e.g., answers) in order to generate final responses to requests. For instance, the machine learning model(s) may determine confidence scores that indicate similarities between the requests from the data file(s) and an input request represented by text data. The data file(s) is then used to determine, based on the confidence scores, one of the responses that is associated with one of the requests that is related to the input request. Additionally, the response may then used to generate a final response to the input request.
-
2.
公开(公告)号:US20230205797A1
公开(公告)日:2023-06-29
申请号:US18173610
申请日:2023-02-23
Applicant: NVIDIA Corporation
Inventor: Shubhadeep Das , Sumit Kumar Bhattacharya , Oluwatobi Olabiyi
IPC: G06F16/332
CPC classification number: G06F16/3329
Abstract: In various examples, hybrid models for determining intents in conversational AI systems and applications are disclosed. Systems and methods are disclosed that use a machine learning model(s) and a data file(s) that associates intents with one another (e.g., using a tree-like structure) in order to determine a final intent associated with text. For example, the text may initially be processed using the machine learning model(s) (e.g., a first machine learning model) in order to determine a first intent associated with the text. The data file(s) may then be used to determine information (e.g., anchors) for one or more second intents (e.g., one or more sub-intents) that are related to the first intent. The text and the information may then be processed using the machine learning model(s) (e.g., a second machine learning model) to determine a second intent, from the one or more second intents, that is associated with the text.
-
3.
公开(公告)号:US20250124734A1
公开(公告)日:2025-04-17
申请号:US18999826
申请日:2024-12-23
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
4.
公开(公告)号:US12211308B2
公开(公告)日:2025-01-28
申请号:US17462833
申请日:2021-08-31
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
公开(公告)号:US20240370690A1
公开(公告)日:2024-11-07
申请号:US18309890
申请日:2023-05-01
Applicant: NVIDIA Corporation
Abstract: In various examples, query response generation using entity linking for conversational AI systems and applications is described herein. Systems and methods are disclosed that generate embeddings associated with entities that a dialogue system is trained to interpret. The systems and methods may then use the embeddings to interpret requests. For instance, when receiving a request, the systems and methods may generate at least an embedding for an entity included in the request and compare the embedding to the stored embeddings in order to determine that the entity from the request is related to one of the stored entities. The systems and methods may then use this relationship to generate the response to the query. This way, even if the entity is not an exact match to a stored entity, the systems and methods are still able to interpret the query from the user.
-
6.
公开(公告)号:US20240176808A1
公开(公告)日:2024-05-30
申请号:US18172571
申请日:2023-02-22
Applicant: NVIDIA Corporation
Inventor: Shubhadeep Das , Sumit Kumar Bhattacharya , Oluwatobi Olabiyi
IPC: G06F16/33 , G06F16/338
CPC classification number: G06F16/3344 , G06F16/338
Abstract: In various examples, contextual data may be generated using structured and unstructured data for conversational AI systems and applications. Systems and methods are disclosed that use structured data (converted to unstructured form) and unstructured data, such as from a knowledge database(s), to generate contextual data. For instance, the contextual data may represent text (e.g., narratives), where a first portion of the text is generated using the structured data and a second portion of the text is generated using the unstructured data. The systems and methods may then use a neural network(s), such as a neural network(s) associated with a dialogue manager, to process input data representing a request (e.g., a query) and the contextual data in order to generate a response to the request. For instance, if the request includes a query for information associated with a topic, the neural network(s) may generate a response that includes the requested information.
-
公开(公告)号:US11817117B2
公开(公告)日:2023-11-14
申请号:US17162907
申请日:2021-01-29
Applicant: NVIDIA Corporation
Inventor: Utkarsh Vaidya , Ravindra Yeshwant Lokhande , Viraj Gangadhar Karandikar , Niranjan Rajendra Wartikar , Sumit Kumar Bhattacharya
CPC classification number: G10L25/78 , G06N3/02 , G10L25/30 , G10L2025/786
Abstract: In various examples, end of speech (EOS) for an audio signal is determined based at least in part on a rate of speech for a speaker. For a segment of the audio signal, EOS is indicated based at least in part on an EOS threshold determined based at least in part on the rate of speech for the speaker.
-
8.
公开(公告)号:US20230064049A1
公开(公告)日:2023-03-02
申请号:US17462833
申请日:2021-08-31
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06K9/00 , G06T7/73 , G06F3/01 , G06F16/632 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
公开(公告)号:US20220246167A1
公开(公告)日:2022-08-04
申请号:US17162907
申请日:2021-01-29
Applicant: NVIDIA Corporation
Inventor: Utkarsh Vaidya , Ravindra Yeshwant Lokhande , Viraj Gangadhar Karandikar , Niranjan Rajendra Wartikar , Sumit Kumar Bhattacharya
Abstract: In various examples, end of speech (EOS) for an audio signal is determined based at least in part on a rate of speech for a speaker. For a segment of the audio signal, EOS is indicated based at least in part on an EOS threshold determined based at least in part on the rate of speech for the speaker.
-
-
-
-
-
-
-
-