-
公开(公告)号:US11995378B2
公开(公告)日:2024-05-28
申请号:US18161326
申请日:2023-01-30
Applicant: NVIDIA Corporation
Inventor: Utkarsh Vaidya , Sumit Bhattacharya
Abstract: The disclosure is directed to a process that can predict and prevent an audio artifact from occurring. The process can monitor the systems, processes, and execution threads on a larger system/device, such as a mobile or in-vehicle device. Using a learning algorithm, such as deep neural network (DNN), the information collected can generate a prediction of whether an audio artifact is likely to occur. The process can use a second learning algorithm, which also can be a DNN, to generate recommended system adjustments that can attempt to prevent the audio glitch from occurring. The recommendations can be for various systems and components on the device, such as changing the processing system frequency, the memory frequency, and the audio buffer size. After the audio artifact has been prevented, the system adjustments can be reversed fully or in steps to return the system to its state prior to the system adjustments.
-
公开(公告)号:US20230317067A1
公开(公告)日:2023-10-05
申请号:US18329839
申请日:2023-06-06
Applicant: NVIDIA Corporation
Inventor: Shubhadeep Das , Sumit Bhattacharya , Ratin Kumar
CPC classification number: G10L15/1815 , G10L13/02 , G10L15/30 , G10L15/22
Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.
-
公开(公告)号:US20230298579A1
公开(公告)日:2023-09-21
申请号:US18202228
申请日:2023-05-25
Applicant: NVIDIA Corporation
Inventor: Utkarsh Vaidya , Sumit Bhattacharya , Viraj Karandikar , Niranjan Wartikar
CPC classification number: G10L15/197 , G10L15/04 , G10L15/02 , G10L15/22 , G06N3/08 , G10L15/16 , G10L2015/223
Abstract: Apparatuses, systems, and techniques are presented to recognize speech in an audio signal. In particular, various embodiments can indicate an end of one or more speech segments based, at least in part, on one or more characters predicted to be within these one or more speech segments.
-
公开(公告)号:US20240363104A1
公开(公告)日:2024-10-31
申请号:US18766466
申请日:2024-07-08
Applicant: NVIDIA Corporation
Inventor: Shubhadeep Das , Sumit Bhattacharya , Ratin Kumar
CPC classification number: G10L15/1815 , G10L13/02 , G10L15/22 , G10L15/30
Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.
-
公开(公告)号:US11769495B2
公开(公告)日:2023-09-26
申请号:US18067217
申请日:2022-12-16
Applicant: NVIDIA Corporation
Inventor: Shubhadeep Das , Sumit Bhattacharya , Ratin Kumar
CPC classification number: G10L15/1815 , G10L13/02 , G10L15/22 , G10L15/30
Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.
-
6.
公开(公告)号:US20230095988A1
公开(公告)日:2023-03-30
申请号:US18062163
申请日:2022-12-06
Applicant: NVIDIA Corporation
Inventor: Sumit Bhattacharya , Jason Conrad Roche , Niranjan Avadhanam
IPC: B60R25/01 , B60R25/25 , B60R25/30 , G05B13/02 , G06F21/32 , G06N3/08 , G10L17/00 , G10L17/06 , G10L17/18 , G06V10/25 , G06V20/59
Abstract: Systems and methods are disclosed herein for implementation of a vehicle command operation system that may use multi-modal technology to authenticate an occupant of the vehicle to authorize a command and receive natural language commands for vehicular operations. The system may utilize sensors to receive data indicative of a voice command from an occupant of the vehicle. The system may receive second sensor data to aid in the determination of the corresponding vehicular operation in response to the received command. The system may retrieve authentication data for the occupants of the vehicle. The system authenticates the occupant to authorize a vehicular operation command using a neural network based on at least one of the first sensor data, the second sensor data, and the authentication data. Responsive to the authentication, the system may authorize the operation to be performed in the vehicle based on the vehicular operation command.
-
公开(公告)号:US20220012988A1
公开(公告)日:2022-01-13
申请号:US16922601
申请日:2020-07-07
Applicant: NVIDIA Corporation
Inventor: Niranjan Avadhanam , Sumit Bhattacharya , Atousa Torabi , Jason Conrad Roche
Abstract: Systems and methods are disclosed herein for a pedestrian crossing warning system that may use multi-modal technology to determine attributes of a person and provide a warning to the person in response to a calculated risk level to effect a reduction of the risk level. The system may utilize sensors to receive data indicative of a trajectory of a person external to the vehicle. Specific attributes of the person such as age or walking aids may be determined. Based on the trajectory data and the specific attributes, a risk level may be determined by the system using a machine learning model. The system may cause emission of a warning to the person in response to the risk level.
-
公开(公告)号:US11682272B2
公开(公告)日:2023-06-20
申请号:US16922601
申请日:2020-07-07
Applicant: NVIDIA Corporation
Inventor: Niranjan Avadhanam , Sumit Bhattacharya , Atousa Torabi , Jason Conrad Roche
Abstract: Systems and methods are disclosed herein for a pedestrian crossing warning system that may use multi-modal technology to determine attributes of a person and provide a warning to the person in response to a calculated risk level to effect a reduction of the risk level. The system may utilize sensors to receive data indicative of a trajectory of a person external to the vehicle. Specific attributes of the person such as age or walking aids may be determined. Based on the trajectory data and the specific attributes, a risk level may be determined by the system using a machine learning model. The system may cause emission of a warning to the person in response to the risk level.
-
公开(公告)号:US20230168857A1
公开(公告)日:2023-06-01
申请号:US18161326
申请日:2023-01-30
Applicant: NVIDIA Corporation
Inventor: Utkarsh Vaidya , Sumit Bhattacharya
Abstract: The disclosure is directed to a process that can predict and prevent an audio artifact from occurring. The process can monitor the systems, processes, and execution threads on a larger system/ device, such as a mobile or in-vehicle device. Using a learning algorithm, such as deep neural network (DNN), the information collected can generate a prediction of whether an audio artifact is likely to occur. The process can use a second learning algorithm, which also can be a DNN, to generate recommended system adjustments that can attempt to prevent the audio glitch from occurring. The recommendations can be for various systems and components on the device, such as changing the processing system frequency, the memory frequency, and the audio buffer size. After the audio artifact has been prevented, the system adjustments can be reversed fully or in steps to return the system to its state prior to the system adjustments.
-
公开(公告)号:US20210103425A1
公开(公告)日:2021-04-08
申请号:US17121373
申请日:2020-12-14
Applicant: Nvidia Corporation
Inventor: Utkarsh Vaidya , Sumit Bhattacharya
Abstract: The disclosure is directed to a process that can predict and prevent an audio artifact from occurring. The process can monitor the systems, processes, and execution threads on a larger system/device, such as a mobile or in-vehicle device. Using a learning algorithm, such as deep neural network (DNN), the information collected can generate a prediction of whether an audio artifact is likely to occur. The process can use a second learning algorithm, which also can be a DNN, to generate recommended system adjustments that can attempt to prevent the audio glitch from occurring. The recommendations can be for various systems and components on the device, such as changing the processing system frequency, the memory frequency, and the audio buffer size. After the audio artifact has been prevented, the system adjustments can be reversed fully or in steps to return the system to its state prior to the system adjustments.
-
-
-
-
-
-
-
-
-