-
公开(公告)号:US11772656B2
公开(公告)日:2023-10-03
申请号:US16923367
申请日:2020-07-08
Applicant: Ford Global Technologies, LLC
Inventor: Apurbaa Mallik , Kaushik Balakrishnan , Vijay Nagasamy , Praveen Narayanan , Sowndarya Sundar
CPC classification number: B60W40/02 , B60W60/005 , G06N20/00 , G06V10/751 , B60W2420/42 , B60W2555/20
Abstract: A system includes a computer including a processor and a memory, the memory storing instructions executable by the processor to generate a synthetic image by adjusting respective color values of one or more pixels of a reference image based on a specified meteorological optical range from a vehicle sensor to simulated fog, and input the synthetic image to a machine learning program to train the machine learning program to identify a meteorological optical range from the vehicle sensor to actual fog.
-
公开(公告)号:US11613249B2
公开(公告)日:2023-03-28
申请号:US15944563
申请日:2018-04-03
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Praveen Narayanan , Mohsen Lakehal-ayat
Abstract: A method for training an autonomous vehicle to reach a target location. The method includes detecting the state of an autonomous vehicle in a simulated environment, and using a neural network to navigate the vehicle from an initial location to a target destination. During the training phase, a second neural network may reward the first neural network for a desired action taken by the autonomous vehicle, and may penalize the first neural network for an undesired action taken by the autonomous vehicle. A corresponding system and computer program product are also disclosed and claimed herein.
-
3.
公开(公告)号:US20220005457A1
公开(公告)日:2022-01-06
申请号:US16919315
申请日:2020-07-02
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Praveen Narayanan , Francois Charette
IPC: G10L13/047 , G10L15/16
Abstract: An end-to-end deep-learning-based system that can solve both ASR and TTS problems jointly using unpaired text and audio samples is disclosed herein. An adversarially-trained approach is used to generate a more robust independent TTS neural network and an ASR neural network that can be deployed individually or simultaneously. The process for training the neural networks includes generating an audio sample from a text sample using the TTS neural network, then feeding the generated audio sample into the ASR neural network to regenerate the text. The difference between the regenerated text and the original text is used as a first loss for training the neural networks. A similar process is used for an audio sample. The difference between the regenerated audio and the original audio is used as a second loss. Text and audio discriminators are similarly used on the output of the neural network to generate additional losses for training.
-
4.
公开(公告)号:US20220214692A1
公开(公告)日:2022-07-07
申请号:US17141433
申请日:2021-01-05
Applicant: Ford Global Technologies, LLC
Inventor: Punarjay Chakravarty , Kaushik Balakrishnan , Shubham Shrivastava
Abstract: Present embodiments use deep reinforcement learning (DRL) algorithms and use one or more path planning approaches to create a path using a deep learning approach using a reinforcement learning algorithm, trained using traditional learning algorithms such as A-Star. The reinforcement learning algorithm takes in a forward-facing camera operative as part of a computer vision system for a robot, and utilizes training the algorithm to train the robot to traverse from point A to point B in an operating environment using a sequence of waypoints as a breadcrumb trail. The system trains the robot to learn the path section by section by the waypoints, which prevents requiring the robot to solve the entire path. At test/deploy time, A-star is not used, and the robot navigates the entire start to goal path without any intermediate waypoints
-
公开(公告)号:US10810754B2
公开(公告)日:2020-10-20
申请号:US15961498
申请日:2018-04-24
Applicant: Ford Global Technologies, LLC
Inventor: Punarjay Chakravarty , Kaushik Balakrishnan
IPC: G06T7/593 , G06T7/73 , H04N13/271 , H04N13/289 , H04N13/00
Abstract: The disclosure relates to systems, methods, and devices for determining a depth map of an environment based on a monocular image. A method for determining a depth map includes receiving a plurality of images from a monocular camera forming an image sequence. The method includes determining pose vector data for two successive images of the image sequence and providing the image sequence and the pose vector data to a generative adversarial network (GAN), wherein the GAN is trained using temporal constraints to generate a depth map for each image of the image sequence. The method includes generating a reconstructed image based on a depth map received from the GAN.
-
6.
公开(公告)号:US20230342512A1
公开(公告)日:2023-10-26
申请号:US17660127
申请日:2022-04-21
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Devesh Upadhyay , Herbert Alexander Morriss-Andrews , Ryan Joseph Madden , Suzhou Huang , Dimitar Petrov Filev
Abstract: Systems and methods for automotive shape design by combining computational fluid dynamics (CFD) and Generative Adversarial Network (GAN). CFD simulations may be performed to determine aerodynamic properties and identify a set of candidate vehicle outline shapes. Vehicle shape outlines may be provided as input to a generative adversarial network (GAN) that is trained to learn aesthetic preferences for vehicle attributes. The GAN may be used to determine, by based on the vehicle outline shape, a set of vehicle attributes. The GAN may be used to generate photo-realistic images with the vehicle shape outline and filling in additional aesthetic styles for the given outline, such as different colors, lighting, visual appearance, wheel design, aspect ratio, etc.
-
7.
公开(公告)号:US11574622B2
公开(公告)日:2023-02-07
申请号:US16919315
申请日:2020-07-02
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Praveen Narayanan , Francois Charette
IPC: G10L15/16 , G10L13/047
Abstract: An end-to-end deep-learning-based system that can solve both ASR and TTS problems jointly using unpaired text and audio samples is disclosed herein. An adversarially-trained approach is used to generate a more robust independent TTS neural network and an ASR neural network that can be deployed individually or simultaneously. The process for training the neural networks includes generating an audio sample from a text sample using the TTS neural network, then feeding the generated audio sample into the ASR neural network to regenerate the text. The difference between the regenerated text and the original text is used as a first loss for training the neural networks. A similar process is used for an audio sample. The difference between the regenerated audio and the original audio is used as a second loss. Text and audio discriminators are similarly used on the output of the neural network to generate additional losses for training.
-
8.
公开(公告)号:US20190325597A1
公开(公告)日:2019-10-24
申请号:US15961498
申请日:2018-04-24
Applicant: Ford Global Technologies, LLC
Inventor: Punarjay Chakravarty , Kaushik Balakrishnan
IPC: G06T7/593 , H04N13/271 , H04N13/289 , G06T7/73
Abstract: The disclosure relates to systems, methods, and devices for determining a depth map of an environment based on a monocular image. A method for determining a depth map includes receiving a plurality of images from a monocular camera forming an image sequence. The method includes determining pose vector data for two successive images of the image sequence and providing the image sequence and the pose vector data to a generative adversarial network (GAN), wherein the GAN is trained using temporal constraints to generate a depth map for each image of the image sequence. The method includes generating a reconstructed image based on a depth map received from the GAN.
-
公开(公告)号:US20240320505A1
公开(公告)日:2024-09-26
申请号:US18188024
申请日:2023-03-22
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Neeloy Chakraborty , Devesh Upadhyay
IPC: G06N3/092
CPC classification number: G06N3/092
Abstract: A computer that includes a processor and a memory, the memory including instructions executable by the processor to train an agent neural network to input a first state and output a first action, input the first action to an environment and determine a second state and a reward. Koopman model neural network can be trained based on the first state, the first action and the second state to determine a fake state. The agent neural network can be re-trained and the Koopman model neural network can be re-trained based on reinforcement learning including the first state, the first action, the second state, the fake state, and the reward.
-
公开(公告)号:US20230139013A1
公开(公告)日:2023-05-04
申请号:US17518623
申请日:2021-11-04
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Praveen Narayanan , Justin Miller , Devesh Upadhyay
Abstract: An image including a vehicle seat and a seatbelt webbing for the vehicle seat is obtained. The image is input to a neural network trained to, upon determining a presence of an occupant in the vehicle seat, output a physical state of the occupant and a seatbelt webbing state. Respective classifications for the physical state and the seatbelt webbing state are determined. The classifications are one of preferred or nonpreferred. A vehicle component is actuated based on the classification for at least one of the physical state of the occupant or the seatbelt webbing state being nonpreferred.
-
-
-
-
-
-
-
-
-