-
1.
公开(公告)号:US20210347328A1
公开(公告)日:2021-11-11
申请号:US16867395
申请日:2020-05-05
Applicant: NVIDIA Corporation
Inventor: Sumit Bhattacharya , Jason Roche , Niranjan Avadhanam
IPC: B60R25/01 , G05B13/02 , G06N3/08 , G06F21/32 , B60R25/25 , B60R25/30 , G10L17/00 , G10L17/06 , G10L17/18 , G06K9/00 , G06K9/32
Abstract: Systems and methods are disclosed herein for implementation of a vehicle command operation system that may use multi-modal technology to authenticate an occupant of the vehicle to authorize a command and receive natural language commands for vehicular operations. The system may utilize sensors to receive data indicative of a voice command from an occupant of the vehicle. The system may receive second sensor data to aid in the determination of the corresponding vehicular operation in response to the received command. The system may retrieve authentication data for the occupants of the vehicle. The system authenticates the occupant to authorize a vehicular operation command using a neural network based on at least one of the first sensor data, the second sensor data, and the authentication data. Responsive to the authentication, the system may authorize the operation to be performed in the vehicle based on the vehicular operation command.
-
公开(公告)号:US20210334565A1
公开(公告)日:2021-10-28
申请号:US16859741
申请日:2020-04-27
Applicant: NVIDIA Corporation
Inventor: Jason Roche , Niranjan Avadhanam
Abstract: In various examples, systems and methods are disclosed herein for a vehicle command operation system that may use technology across multiple modalities to cause vehicular operations to be performed in response to determining a focal point based on a gaze of an occupant. The system may utilize sensors to receive first data indicative of an eye gaze of an occupant of the vehicle. The system may utilize sensors to receive second data indicative of other data from the occupant. The system may then calculate a gaze vector based on the data indicative of the eye gaze of the occupant. The system may determine a focal point based on the gaze vector. In response to determining the focal point, the system causes an operation to be performed in the vehicle based on the second data.
-
3.
公开(公告)号:US20230064049A1
公开(公告)日:2023-03-02
申请号:US17462833
申请日:2021-08-31
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06K9/00 , G06T7/73 , G06F3/01 , G06F16/632 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
4.
公开(公告)号:US20250124734A1
公开(公告)日:2025-04-17
申请号:US18999826
申请日:2024-12-23
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
5.
公开(公告)号:US12211308B2
公开(公告)日:2025-01-28
申请号:US17462833
申请日:2021-08-31
Applicant: Nvidia Corporation
Inventor: Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche
IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06
Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
-
-
-
-