Patent search ap:("NVIDIA Corporation") AND inv:"Sumit Bhattacharya" Page 1

1.

发明授权
Dynamically preventing audio artifacts 有权

公开(公告)号：US11995378B2

公开(公告)日：2024-05-28

申请号：US18161326

申请日：2023-01-30

Applicant: NVIDIA Corporation

Inventor： Utkarsh Vaidya , Sumit Bhattacharya

IPC: G06F3/16 , G06N3/045 , G06N7/01

CPC classification number: G06F3/165 , G06F3/162 , G06N3/045 , G06N7/01

Abstract: The disclosure is directed to a process that can predict and prevent an audio artifact from occurring. The process can monitor the systems, processes, and execution threads on a larger system/device, such as a mobile or in-vehicle device. Using a learning algorithm, such as deep neural network (DNN), the information collected can generate a prediction of whether an audio artifact is likely to occur. The process can use a second learning algorithm, which also can be a DNN, to generate recommended system adjustments that can attempt to prevent the audio glitch from occurring. The recommendations can be for various systems and components on the device, such as changing the processing system frequency, the memory frequency, and the audio buffer size. After the audio artifact has been prevented, the system adjustments can be reversed fully or in steps to return the system to its state prior to the system adjustments.

2.

发明公开
USING A NATURAL LANGUAGE MODEL TO INTERFACE WITH A CLOSED DOMAIN SYSTEM 审中-公开

公开(公告)号：US20230317067A1

公开(公告)日：2023-10-05

申请号：US18329839

申请日：2023-06-06

Applicant: NVIDIA Corporation

Inventor： Shubhadeep Das , Sumit Bhattacharya , Ratin Kumar

IPC: G10L15/18 , G10L13/02 , G10L15/30 , G10L15/22

CPC classification number: G10L15/1815 , G10L13/02 , G10L15/30 , G10L15/22

Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.

3.

发明公开
END OF SPEECH DETECTION USING ONE OR MORE NEURAL NETWORKS 审中-公开

公开(公告)号：US20230298579A1

公开(公告)日：2023-09-21

申请号：US18202228

申请日：2023-05-25

Applicant: NVIDIA Corporation

Inventor： Utkarsh Vaidya , Sumit Bhattacharya , Viraj Karandikar , Niranjan Wartikar

IPC: G10L15/197 , G10L15/04 , G10L15/02 , G10L15/22 , G06N3/08 , G10L15/16

CPC classification number: G10L15/197 , G10L15/04 , G10L15/02 , G10L15/22 , G06N3/08 , G10L15/16 , G10L2015/223

Abstract: Apparatuses, systems, and techniques are presented to recognize speech in an audio signal. In particular, various embodiments can indicate an end of one or more speech segments based, at least in part, on one or more characters predicted to be within these one or more speech segments.

4.

发明公开
USING A NATURAL LANGUAGE MODEL TO INTERFACE WITH A CLOSED DOMAIN SYSTEM 审中-公开

公开(公告)号：US20240363104A1

公开(公告)日：2024-10-31

申请号：US18766466

申请日：2024-07-08

Applicant: NVIDIA Corporation

Inventor： Shubhadeep Das , Sumit Bhattacharya , Ratin Kumar

IPC: G10L15/18 , G10L13/02 , G10L15/22 , G10L15/30

CPC classification number: G10L15/1815 , G10L13/02 , G10L15/22 , G10L15/30

Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.

5.

发明授权
Conversational AI platforms with closed domain and open domain dialog integration 有权

公开(公告)号：US11769495B2

公开(公告)日：2023-09-26

申请号：US18067217

申请日：2022-12-16

Applicant: NVIDIA Corporation

Inventor： Shubhadeep Das , Sumit Bhattacharya , Ratin Kumar

IPC: G10L15/22 , G10L15/18 , G10L13/02 , G10L15/30

CPC classification number: G10L15/1815 , G10L13/02 , G10L15/22 , G10L15/30

Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.

6.

发明申请
SYSTEMS AND METHODS FOR PERFORMING COMMANDS IN A VEHICLE USING SPEECH AND IMAGE RECOGNITION 有权

公开(公告)号：US20230095988A1

公开(公告)日：2023-03-30

申请号：US18062163

申请日：2022-12-06

Applicant: NVIDIA Corporation

Inventor： Sumit Bhattacharya , Jason Conrad Roche , Niranjan Avadhanam

IPC: B60R25/01 , B60R25/25 , B60R25/30 , G05B13/02 , G06F21/32 , G06N3/08 , G10L17/00 , G10L17/06 , G10L17/18 , G06V10/25 , G06V20/59

Abstract: Systems and methods are disclosed herein for implementation of a vehicle command operation system that may use multi-modal technology to authenticate an occupant of the vehicle to authorize a command and receive natural language commands for vehicular operations. The system may utilize sensors to receive data indicative of a voice command from an occupant of the vehicle. The system may receive second sensor data to aid in the determination of the corresponding vehicular operation in response to the received command. The system may retrieve authentication data for the occupants of the vehicle. The system authenticates the occupant to authorize a vehicular operation command using a neural network based on at least one of the first sensor data, the second sensor data, and the authentication data. Responsive to the authentication, the system may authorize the operation to be performed in the vehicle based on the vehicular operation command.

7.

发明申请
SYSTEMS AND METHODS FOR PEDESTRIAN CROSSING RISK ASSESSMENT AND DIRECTIONAL WARNING 有权

公开(公告)号：US20220012988A1

公开(公告)日：2022-01-13

申请号：US16922601

申请日：2020-07-07

Applicant: NVIDIA Corporation

Inventor： Niranjan Avadhanam , Sumit Bhattacharya , Atousa Torabi , Jason Conrad Roche

IPC: G08B3/10 , G08G1/005 , G06N3/08

Abstract: Systems and methods are disclosed herein for a pedestrian crossing warning system that may use multi-modal technology to determine attributes of a person and provide a warning to the person in response to a calculated risk level to effect a reduction of the risk level. The system may utilize sensors to receive data indicative of a trajectory of a person external to the vehicle. Specific attributes of the person such as age or walking aids may be determined. Based on the trajectory data and the specific attributes, a risk level may be determined by the system using a machine learning model. The system may cause emission of a warning to the person in response to the risk level.

8.

发明授权
Systems and methods for pedestrian crossing risk assessment and directional warning 有权

公开(公告)号：US11682272B2

公开(公告)日：2023-06-20

申请号：US16922601

申请日：2020-07-07

Applicant: NVIDIA Corporation

Inventor： Niranjan Avadhanam , Sumit Bhattacharya , Atousa Torabi , Jason Conrad Roche

IPC: G08B3/10 , G06N3/08 , G08G1/005

CPC classification number: G08B3/10 , G06N3/08 , G08G1/005

Abstract: Systems and methods are disclosed herein for a pedestrian crossing warning system that may use multi-modal technology to determine attributes of a person and provide a warning to the person in response to a calculated risk level to effect a reduction of the risk level. The system may utilize sensors to receive data indicative of a trajectory of a person external to the vehicle. Specific attributes of the person such as age or walking aids may be determined. Based on the trajectory data and the specific attributes, a risk level may be determined by the system using a machine learning model. The system may cause emission of a warning to the person in response to the risk level.

9.

发明公开
DYNAMICALLY PREVENTING AUDIO ARTIFACTS 审中-公开

公开(公告)号：US20230168857A1

公开(公告)日：2023-06-01

申请号：US18161326

申请日：2023-01-30

Applicant: NVIDIA Corporation

Inventor： Utkarsh Vaidya , Sumit Bhattacharya

IPC: G06F3/16 , G06N3/045 , G06N7/01

CPC classification number: G06F3/165 , G06F3/162 , G06N3/045 , G06N7/01

Abstract: The disclosure is directed to a process that can predict and prevent an audio artifact from occurring. The process can monitor the systems, processes, and execution threads on a larger system/ device, such as a mobile or in-vehicle device. Using a learning algorithm, such as deep neural network (DNN), the information collected can generate a prediction of whether an audio artifact is likely to occur. The process can use a second learning algorithm, which also can be a DNN, to generate recommended system adjustments that can attempt to prevent the audio glitch from occurring. The recommendations can be for various systems and components on the device, such as changing the processing system frequency, the memory frequency, and the audio buffer size. After the audio artifact has been prevented, the system adjustments can be reversed fully or in steps to return the system to its state prior to the system adjustments.

10.

发明申请
DYNAMICALLY PREVENTING AUDIO ARTIFACTS 有权

公开(公告)号：US20210103425A1

公开(公告)日：2021-04-08

申请号：US17121373

申请日：2020-12-14

Applicant: Nvidia Corporation

Inventor： Utkarsh Vaidya , Sumit Bhattacharya

IPC: G06F3/16 , G06N7/00 , G06N3/04

Abstract: The disclosure is directed to a process that can predict and prevent an audio artifact from occurring. The process can monitor the systems, processes, and execution threads on a larger system/device, such as a mobile or in-vehicle device. Using a learning algorithm, such as deep neural network (DNN), the information collected can generate a prediction of whether an audio artifact is likely to occur. The process can use a second learning algorithm, which also can be a DNN, to generate recommended system adjustments that can attempt to prevent the audio glitch from occurring. The recommendations can be for various systems and components on the device, such as changing the processing system frequency, the memory frequency, and the audio buffer size. After the audio artifact has been prevented, the system adjustments can be reversed fully or in steps to return the system to its state prior to the system adjustments.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification