-
1.
公开(公告)号:US11934955B2
公开(公告)日:2024-03-19
申请号:US18051296
申请日:2022-10-31
申请人: NVIDIA Corporation
发明人: Nuri Murat Arar , Niranjan Avadhanam , Nishant Puri , Shagan Sah , Rajath Shetty , Sujay Yadawadkar , Pavlo Molchanov
IPC分类号: G06N3/08 , G06F18/21 , G06F18/214 , G06N20/00 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/94 , G06V20/59 , G06V20/64 , G06V40/16 , G06V40/18
CPC分类号: G06N3/08 , G06F18/214 , G06F18/2193 , G06N20/00 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/95 , G06V20/597 , G06V20/647 , G06V40/171 , G06V40/193
摘要: Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.
-
2.
公开(公告)号:US20240257539A1
公开(公告)日:2024-08-01
申请号:US18611462
申请日:2024-03-20
申请人: NVIDIA Corporation
发明人: Nuri Murat Arar , Niranjan Avadhanam , Yuzhuo Ren
CPC分类号: G06V20/597 , B60W30/0956 , B60W40/08 , B60W50/14 , B60W60/001 , G06V20/56 , B60W30/09 , B60W2540/22 , B60W2540/221 , B60W2540/223 , B60W2540/225
摘要: In various examples, estimated field of view or gaze information of a user may be projected external to a vehicle and compared to vehicle perception information corresponding to an environment outside of the vehicle. As a result, interior monitoring of a driver or occupant of the vehicle may be used to determine whether the driver or occupant has processed or seen certain object types, environmental conditions, or other information exterior to the vehicle. For a more holistic understanding of the state of the user, attentiveness and/or cognitive load of the user may be monitored to determine whether one or more actions should be taken. As a result, notifications, AEB system activations, and/or other actions may be determined based on a more complete state of the user as determined based on cognitive load, attentiveness, and/or a comparison between external perception of the vehicle and estimated perception of the user.
-
公开(公告)号:US20230244941A1
公开(公告)日:2023-08-03
申请号:US18298115
申请日:2023-04-10
申请人: NVIDIA Corporation
IPC分类号: G06N3/08 , G06N20/00 , G06V10/94 , G06V20/59 , G06V20/64 , G06V40/16 , G06V40/18 , G06F18/214 , G06F18/21 , G06V10/764 , G06V10/774 , G06V10/82
CPC分类号: G06N3/08 , G06F18/214 , G06F18/2193 , G06N20/00 , G06V10/82 , G06V10/95 , G06V10/764 , G06V10/774 , G06V20/597 , G06V20/647 , G06V40/171 , G06V40/193
摘要: Systems and methods for determining the gaze direction of a subject and projecting this gaze direction onto specific regions of an arbitrary three-dimensional geometry. In an exemplary embodiment, gaze direction may be determined by a regression-based machine learning model. The determined gaze direction is then projected onto a three-dimensional map or set of surfaces that may represent any desired object or system. Maps may represent any three-dimensional layout or geometry, whether actual or virtual. Gaze vectors can thus be used to determine the object of gaze within any environment. Systems can also readily and efficiently adapt for use in different environments by retrieving a different set of surfaces or regions for each environment.
-
公开(公告)号:US11340701B2
公开(公告)日:2022-05-24
申请号:US16902737
申请日:2020-06-16
申请人: NVIDIA Corporation
摘要: Machine learning systems and methods that learn glare, and thus determine gaze direction in a manner more resilient to the effects of glare on input images. The machine learning systems have an isolated representation of glare, e.g., information on the locations of glare points in an image, as an explicit input, in addition to the image itself. In this manner, the machine learning systems explicitly consider glare while making a determination of gaze direction, thus producing more accurate results for images containing glare.
-
公开(公告)号:US11841987B2
公开(公告)日:2023-12-12
申请号:US17751548
申请日:2022-05-23
申请人: NVIDIA Corporation
CPC分类号: G06F3/013 , G06V10/454 , G06V10/764 , G06V10/82 , G06V40/171 , G06V40/18 , G06V40/19
摘要: Machine learning systems and methods that learn glare, and thus determine gaze direction in a manner more resilient to the effects of glare on input images. The machine learning systems have an isolated representation of glare, e.g., information on the locations of glare points in an image, as an explicit input, in addition to the image itself. In this manner, the machine learning systems explicitly consider glare while making a determination of gaze direction, thus producing more accurate results for images containing glare.
-
6.
公开(公告)号:US20230078171A1
公开(公告)日:2023-03-16
申请号:US18051296
申请日:2022-10-31
申请人: NVIDIA Corporation
发明人: Nuri Murat Arar , Niranjan Avadhanam , Nishant Puri , Shagan Sah , Rajath Shetty , Sujay Yadawadkar , Pavlo Molchanov
摘要: Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.
-
7.
公开(公告)号:US20220300072A1
公开(公告)日:2022-09-22
申请号:US17206585
申请日:2021-03-19
申请人: NVIDIA Corporation
摘要: In various examples, systems and methods are disclosed that provide highly accurate gaze predictions that are specific to a particular user by generating and applying, in deployment, personalized calibration functions to outputs and/or layers of a machine learning model. The calibration functions corresponding to a specific user may operate on outputs (e.g., gaze predictions from a machine learning model) to provide updated values and gaze predictions. The calibration functions may also be applied one or more last layers of the machine learning model to operate on features identified by the model and provide values that are more accurate. The calibration functions may be generated using explicit calibration methods by instructing users to gaze at a number of identified ground truth locations within the interior of the vehicle. Once generated, the calibration functions may be modified or refined through implicit gaze calibration points and/or regions based on gaze saliency maps.
-
公开(公告)号:US11978266B2
公开(公告)日:2024-05-07
申请号:US17076690
申请日:2020-10-21
申请人: NVIDIA Corporation
发明人: Nuri Murat Arar , Niranjan Avadhanam , Yuzhuo Ren
CPC分类号: G06V20/597 , B60W30/0956 , B60W40/08 , B60W50/14 , B60W60/001 , G06V20/56 , B60W30/09 , B60W2540/22 , B60W2540/221 , B60W2540/223 , B60W2540/225
摘要: In various examples, estimated field of view or gaze information of a user may be projected external to a vehicle and compared to vehicle perception information corresponding to an environment outside of the vehicle. As a result, interior monitoring of a driver or occupant of the vehicle may be used to determine whether the driver or occupant has processed or seen certain object types, environmental conditions, or other information exterior to the vehicle. For a more holistic understanding of the state of the user, attentiveness and/or cognitive load of the user may be monitored to determine whether one or more actions should be taken. As a result, notifications, AEB system activations, and/or other actions may be determined based on a more complete state of the user as determined based on cognitive load, attentiveness, and/or a comparison between external perception of the vehicle and estimated perception of the user.
-
9.
公开(公告)号:US20240143072A1
公开(公告)日:2024-05-02
申请号:US18410801
申请日:2024-01-11
申请人: NVIDIA Corporation
CPC分类号: G06F3/013 , G06F18/2148 , G06F18/2178 , G06V10/462 , G06V20/597 , G06V40/165 , G06V40/171
摘要: In various examples, systems and methods are disclosed that provide highly accurate gaze predictions that are specific to a particular user by generating and applying, in deployment, personalized calibration functions to outputs and/or layers of a machine learning model. The calibration functions corresponding to a specific user may operate on outputs (e.g., gaze predictions from a machine learning model) to provide updated values and gaze predictions. The calibration functions may also be applied one or more last layers of the machine learning model to operate on features identified by the model and provide values that are more accurate. The calibration functions may be generated using explicit calibration methods by instructing users to gaze at a number of identified ground truth locations within the interior of the vehicle. Once generated, the calibration functions may be modified or refined through implicit gaze calibration points and/or regions based on gaze saliency maps.
-
公开(公告)号:US11657263B2
公开(公告)日:2023-05-23
申请号:US17005914
申请日:2020-08-28
申请人: NVIDIA Corporation
IPC分类号: G06K9/62 , G06F18/214 , G06N20/00 , G06V10/94 , G06V20/59 , G06V20/64 , G06V40/16 , G06V40/18 , G06F18/21
CPC分类号: G06F18/214 , G06F18/2193 , G06N20/00 , G06V10/95 , G06V20/597 , G06V20/647 , G06V40/171 , G06V40/193
摘要: Systems and methods for determining the gaze direction of a subject and projecting this gaze direction onto specific regions of an arbitrary three-dimensional geometry. In an exemplary embodiment, gaze direction may be determined by a regression-based machine learning model. The determined gaze direction is then projected onto a three-dimensional map or set of surfaces that may represent any desired object or system. Maps may represent any three-dimensional layout or geometry, whether actual or virtual. Gaze vectors can thus be used to determine the object of gaze within any environment. Systems can also readily and efficiently adapt for use in different environments by retrieving a different set of surfaces or regions for each environment.
-
-
-
-
-
-
-
-
-