Patent search ap:("Nvidia Corporation") AND inv:"Sumit Kumar Bhattacharya" Page 1

1.

发明公开
DETERMINING INTENTS AND RESPONSES USING MACHINE LEARNING IN CONVERSATIONAL AI SYSTEMS AND APPLICATIONS 审中-公开

公开(公告)号：US20240184814A1

公开(公告)日：2024-06-06

申请号：US18173622

申请日：2023-02-23

Applicant: NVIDIA Corporation

Inventor： Shubhadeep Das , Sumit Kumar Bhattacharya , Oluwatobi Olabiyi

IPC: G06F16/332

CPC classification number: G06F16/3329

Abstract: In various examples, hybrid models for determining intents in conversational AI systems and applications are disclosed. Systems and methods are disclosed that use a machine learning model(s) and a data file(s) that associates requests (e.g., questions) with responses (e.g., answers) in order to generate final responses to requests. For instance, the machine learning model(s) may determine confidence scores that indicate similarities between the requests from the data file(s) and an input request represented by text data. The data file(s) is then used to determine, based on the confidence scores, one of the responses that is associated with one of the requests that is related to the input request. Additionally, the response may then used to generate a final response to the input request.

2.

发明公开
DETERMINING INTENTS AND RESPONSES USING MACHINE LEARNING IN CONVERSATIONAL AI SYSTEMS AND APPLICATIONS 审中-公开

公开(公告)号：US20230205797A1

公开(公告)日：2023-06-29

申请号：US18173610

申请日：2023-02-23

Applicant: NVIDIA Corporation

Inventor： Shubhadeep Das , Sumit Kumar Bhattacharya , Oluwatobi Olabiyi

IPC: G06F16/332

CPC classification number: G06F16/3329

Abstract: In various examples, hybrid models for determining intents in conversational AI systems and applications are disclosed. Systems and methods are disclosed that use a machine learning model(s) and a data file(s) that associates intents with one another (e.g., using a tree-like structure) in order to determine a final intent associated with text. For example, the text may initially be processed using the machine learning model(s) (e.g., a first machine learning model) in order to determine a first intent associated with the text. The data file(s) may then be used to determine information (e.g., anchors) for one or more second intents (e.g., one or more sub-intents) that are related to the first intent. The text and the information may then be processed using the machine learning model(s) (e.g., a second machine learning model) to determine a second intent, from the one or more second intents, that is associated with the text.

3.

发明申请
MULTI-MODAL SENSOR FUSION FOR CONTENT IDENTIFICATION IN APPLICATIONS OF HUMAN-MACHINE INTERFACES 有权

公开(公告)号：US20250124734A1

公开(公告)日：2025-04-17

申请号：US18999826

申请日：2024-12-23

Applicant: Nvidia Corporation

Inventor： Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche

IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

4.

发明授权
Multi-modal sensor fusion for content identification in applications of human-machine interfaces 有权

公开(公告)号：US12211308B2

公开(公告)日：2025-01-28

申请号：US17462833

申请日：2021-08-31

Applicant: Nvidia Corporation

Inventor： Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche

IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

5.

发明申请
ENTITY LINKING FOR RESPONSE GENERATION IN CONVERSATIONAL AI SYSTEMS AND APPLICATIONS 有权

公开(公告)号：US20240370690A1

公开(公告)日：2024-11-07

申请号：US18309890

申请日：2023-05-01

Applicant: NVIDIA Corporation

Inventor： Sagar Bogadi Manjunath , Shubhadeep Das , Sumit Kumar Bhattacharya , Oluwatobi Olabiyi

IPC: G06N3/006 , G06N3/045

Abstract: In various examples, query response generation using entity linking for conversational AI systems and applications is described herein. Systems and methods are disclosed that generate embeddings associated with entities that a dialogue system is trained to interpret. The systems and methods may then use the embeddings to interpret requests. For instance, when receiving a request, the systems and methods may generate at least an embedding for an entity included in the request and compare the embedding to the stored embeddings in order to determine that the entity from the request is related to one of the stored entities. The systems and methods may then use this relationship to generate the response to the query. This way, even if the entity is not an exact match to a stored entity, the systems and methods are still able to interpret the query from the user.

6.

发明公开
QUERY RESPONSE GENERATION USING STRUCTURED AND UNSTRUCTURED DATA FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS 审中-公开

公开(公告)号：US20240176808A1

公开(公告)日：2024-05-30

申请号：US18172571

申请日：2023-02-22

Applicant: NVIDIA Corporation

Inventor： Shubhadeep Das , Sumit Kumar Bhattacharya , Oluwatobi Olabiyi

IPC: G06F16/33 , G06F16/338

CPC classification number: G06F16/3344 , G06F16/338

Abstract: In various examples, contextual data may be generated using structured and unstructured data for conversational AI systems and applications. Systems and methods are disclosed that use structured data (converted to unstructured form) and unstructured data, such as from a knowledge database(s), to generate contextual data. For instance, the contextual data may represent text (e.g., narratives), where a first portion of the text is generated using the structured data and a second portion of the text is generated using the unstructured data. The systems and methods may then use a neural network(s), such as a neural network(s) associated with a dialogue manager, to process input data representing a request (e.g., a query) and the contextual data in order to generate a response to the request. For instance, if the request includes a query for information associated with a topic, the neural network(s) may generate a response that includes the requested information.

7.

发明授权
Speaker adaptive end of speech detection for conversational AI applications 有权

公开(公告)号：US11817117B2

公开(公告)日：2023-11-14

申请号：US17162907

申请日：2021-01-29

Applicant: NVIDIA Corporation

Inventor： Utkarsh Vaidya , Ravindra Yeshwant Lokhande , Viraj Gangadhar Karandikar , Niranjan Rajendra Wartikar , Sumit Kumar Bhattacharya

IPC: G10L25/78 , G10L25/30 , G06N3/02

CPC classification number: G10L25/78 , G06N3/02 , G10L25/30 , G10L2025/786

Abstract: In various examples, end of speech (EOS) for an audio signal is determined based at least in part on a rate of speech for a speaker. For a segment of the audio signal, EOS is indicated based at least in part on an EOS threshold determined based at least in part on the rate of speech for the speaker.

8.

发明申请
MULTI-MODAL SENSOR FUSION FOR CONTENT IDENTIFICATION IN APPLICATIONS OF HUMAN-MACHINE INTERFACES 有权

公开(公告)号：US20230064049A1

公开(公告)日：2023-03-02

申请号：US17462833

申请日：2021-08-31

Applicant: Nvidia Corporation

Inventor： Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche

IPC: G06K9/00 , G06T7/73 , G06F3/01 , G06F16/632 , G06T15/06

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

9.

发明申请
SPEAKER ADAPTIVE END OF SPEECH DETECTION FOR CONVERSATIONAL AI APPLICATIONS 有权

公开(公告)号：US20220246167A1

公开(公告)日：2022-08-04

申请号：US17162907

申请日：2021-01-29

Applicant: NVIDIA Corporation

Inventor： Utkarsh Vaidya , Ravindra Yeshwant Lokhande , Viraj Gangadhar Karandikar , Niranjan Rajendra Wartikar , Sumit Kumar Bhattacharya

IPC: G10L25/78 , G10L25/30 , G06N3/02

Abstract: In various examples, end of speech (EOS) for an audio signal is determined based at least in part on a rate of speech for a speaker. For a segment of the audio signal, EOS is indicated based at least in part on an EOS threshold determined based at least in part on the rate of speech for the speaker.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification