Patent search ap:("Nvidia Corporation") AND inv:"Jason Roche" Page 1

1.

发明申请
SYSTEMS AND METHODS FOR PERFORMING COMMANDS IN A VEHICLE USING SPEECH AND IMAGE RECOGNITION 有权

公开(公告)号：US20210347328A1

公开(公告)日：2021-11-11

申请号：US16867395

申请日：2020-05-05

Applicant: NVIDIA Corporation

Inventor： Sumit Bhattacharya , Jason Roche , Niranjan Avadhanam

IPC: B60R25/01 , G05B13/02 , G06N3/08 , G06F21/32 , B60R25/25 , B60R25/30 , G10L17/00 , G10L17/06 , G10L17/18 , G06K9/00 , G06K9/32

Abstract: Systems and methods are disclosed herein for implementation of a vehicle command operation system that may use multi-modal technology to authenticate an occupant of the vehicle to authorize a command and receive natural language commands for vehicular operations. The system may utilize sensors to receive data indicative of a voice command from an occupant of the vehicle. The system may receive second sensor data to aid in the determination of the corresponding vehicular operation in response to the received command. The system may retrieve authentication data for the occupants of the vehicle. The system authenticates the occupant to authorize a vehicular operation command using a neural network based on at least one of the first sensor data, the second sensor data, and the authentication data. Responsive to the authentication, the system may authorize the operation to be performed in the vehicle based on the vehicular operation command.

2.

发明申请
SYSTEMS AND METHODS FOR PERFORMING OPERATIONS IN A VEHICLE USING GAZE DETECTION 有权

公开(公告)号：US20210334565A1

公开(公告)日：2021-10-28

申请号：US16859741

申请日：2020-04-27

Applicant: NVIDIA Corporation

Inventor： Jason Roche , Niranjan Avadhanam

IPC: G06K9/00 , G06F3/01 , G06N20/00 , G06N3/02

Abstract: In various examples, systems and methods are disclosed herein for a vehicle command operation system that may use technology across multiple modalities to cause vehicular operations to be performed in response to determining a focal point based on a gaze of an occupant. The system may utilize sensors to receive first data indicative of an eye gaze of an occupant of the vehicle. The system may utilize sensors to receive second data indicative of other data from the occupant. The system may then calculate a gaze vector based on the data indicative of the eye gaze of the occupant. The system may determine a focal point based on the gaze vector. In response to determining the focal point, the system causes an operation to be performed in the vehicle based on the second data.

3.

发明申请
MULTI-MODAL SENSOR FUSION FOR CONTENT IDENTIFICATION IN APPLICATIONS OF HUMAN-MACHINE INTERFACES 有权

公开(公告)号：US20230064049A1

公开(公告)日：2023-03-02

申请号：US17462833

申请日：2021-08-31

Applicant: Nvidia Corporation

Inventor： Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche

IPC: G06K9/00 , G06T7/73 , G06F3/01 , G06F16/632 , G06T15/06

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

4.

发明申请
MULTI-MODAL SENSOR FUSION FOR CONTENT IDENTIFICATION IN APPLICATIONS OF HUMAN-MACHINE INTERFACES 有权

公开(公告)号：US20250124734A1

公开(公告)日：2025-04-17

申请号：US18999826

申请日：2024-12-23

Applicant: Nvidia Corporation

Inventor： Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche

IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

5.

发明授权
Multi-modal sensor fusion for content identification in applications of human-machine interfaces 有权

公开(公告)号：US12211308B2

公开(公告)日：2025-01-28

申请号：US17462833

申请日：2021-08-31

Applicant: Nvidia Corporation

Inventor： Sakthivel Sivaraman , Nishant Puri , Yuzhuo Ren , Atousa Torabi , Shubhadeep Das , Niranjan Avadhanam , Sumit Kumar Bhattacharya , Jason Roche

IPC: G06V40/10 , G06F3/01 , G06F16/632 , G06T7/73 , G06T15/06

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

Patent Agency Ranking