-
公开(公告)号:US20210358490A1
公开(公告)日:2021-11-18
申请号:US16876433
申请日:2020-05-18
Applicant: Nvidia Corporation
Inventor: Utkarsh Vaidya , Sumit Bhattacharya , Viraj Karandikar , Niranjan Wartikar
Abstract: Apparatuses, systems, and techniques are presented to recognize speech in an audio signal. In particular, various embodiments can indicate an end of one or more speech segments based, at least in part, on one or more characters predicted to be within these one or more speech segments.
-
22.
公开(公告)号:US20210347328A1
公开(公告)日:2021-11-11
申请号:US16867395
申请日:2020-05-05
Applicant: NVIDIA Corporation
Inventor: Sumit Bhattacharya , Jason Roche , Niranjan Avadhanam
IPC: B60R25/01 , G05B13/02 , G06N3/08 , G06F21/32 , B60R25/25 , B60R25/30 , G10L17/00 , G10L17/06 , G10L17/18 , G06K9/00 , G06K9/32
Abstract: Systems and methods are disclosed herein for implementation of a vehicle command operation system that may use multi-modal technology to authenticate an occupant of the vehicle to authorize a command and receive natural language commands for vehicular operations. The system may utilize sensors to receive data indicative of a voice command from an occupant of the vehicle. The system may receive second sensor data to aid in the determination of the corresponding vehicular operation in response to the received command. The system may retrieve authentication data for the occupants of the vehicle. The system authenticates the occupant to authorize a vehicular operation command using a neural network based on at least one of the first sensor data, the second sensor data, and the authentication data. Responsive to the authentication, the system may authorize the operation to be performed in the vehicle based on the vehicular operation command.
-