-
公开(公告)号:US11772656B2
公开(公告)日:2023-10-03
申请号:US16923367
申请日:2020-07-08
Applicant: Ford Global Technologies, LLC
Inventor: Apurbaa Mallik , Kaushik Balakrishnan , Vijay Nagasamy , Praveen Narayanan , Sowndarya Sundar
CPC classification number: B60W40/02 , B60W60/005 , G06N20/00 , G06V10/751 , B60W2420/42 , B60W2555/20
Abstract: A system includes a computer including a processor and a memory, the memory storing instructions executable by the processor to generate a synthetic image by adjusting respective color values of one or more pixels of a reference image based on a specified meteorological optical range from a vehicle sensor to simulated fog, and input the synthetic image to a machine learning program to train the machine learning program to identify a meteorological optical range from the vehicle sensor to actual fog.
-
公开(公告)号:US11613249B2
公开(公告)日:2023-03-28
申请号:US15944563
申请日:2018-04-03
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Praveen Narayanan , Mohsen Lakehal-ayat
Abstract: A method for training an autonomous vehicle to reach a target location. The method includes detecting the state of an autonomous vehicle in a simulated environment, and using a neural network to navigate the vehicle from an initial location to a target destination. During the training phase, a second neural network may reward the first neural network for a desired action taken by the autonomous vehicle, and may penalize the first neural network for an undesired action taken by the autonomous vehicle. A corresponding system and computer program product are also disclosed and claimed herein.
-
公开(公告)号:US11410667B2
公开(公告)日:2022-08-09
申请号:US16457150
申请日:2019-06-28
Applicant: Ford Global Technologies, LLC
Inventor: Punarjay Chakravarty , Lisa Scaria , Ryan Burke , Francois Charette , Praveen Narayanan
Abstract: A speech conversion system is described that includes a hierarchical encoder and a decoder. The system may comprise a processor and memory storing instructions executable by the processor. The instructions may comprise to: using a second recurrent neural network (RNN) (GRU1) and a first set of encoder vectors derived from a spectrogram as input to the second RNN, determine a second concatenated sequence; determine a second set of encoder vectors by doubling a stack height and halving a length of the second concatenated sequence; using the second set of encoder vectors, determine a third set of encoder vectors; and decode the third set of encoder vectors using an attention block.
-
公开(公告)号:US20210264284A1
公开(公告)日:2021-08-26
申请号:US16800950
申请日:2020-02-25
Applicant: Ford Global Technologies, LLC
Inventor: Shubh Gupta , Nikita Jaipuria , Praveen Narayanan , Vidya Nariyambut Murali
Abstract: The present disclosure discloses a system and a method. In an example implantation, the system and the method can generate, at a discriminator, a plurality of image patches from an image, determine a plurality of routing coefficients within a capsule network based on the plurality of image patches, generate a prediction indicating whether the image is synthetic or sourced from a real distribution based on the plurality of routing coefficients, and update one or more weights of a generator based on the prediction, wherein the generator is connected to the discriminator.
-
公开(公告)号:US20210082145A1
公开(公告)日:2021-03-18
申请号:US17103835
申请日:2020-11-24
Applicant: Ford Global Technologies, LLC
Inventor: Punarjay Chakravarty , Tom Roussel , Praveen Narayanan , Gaurav Pandey
IPC: G06T7/73
Abstract: Various examples of hybrid metric-topological camera-based localization are described. A single image sensor captures an input image of an environment. The input image is localized to one of a plurality of topological nodes of a hybrid simultaneous localization and mapping (SLAM) metric-topological map which describes the environment as the plurality of topological nodes at a plurality of discrete locations in the environment. A metric pose of the image sensor can be determined using a Perspective-n-Point (PnP) projection algorithm. A convolutional neural network (CNN) can be trained to localize the input image to one of the plurality of topological nodes and a direction of traversal through the environment.
-
公开(公告)号:US20240264276A1
公开(公告)日:2024-08-08
申请号:US18423694
申请日:2024-01-26
Inventor: Yunfei Long , Daniel Morris , Abhinav Kumar , Xiaoming Liu , Marcos Paul Gerardo Castro , Punarjay Chakravarty , Praveen Narayanan
IPC: G01S7/41 , G01S13/86 , G01S13/89 , G01S13/931
CPC classification number: G01S7/417 , G01S13/867 , G01S13/89 , G01S2013/9318 , G01S2013/93185 , G01S2013/9319
Abstract: A computer that includes a processor and a memory, the memory including instructions executable by the processor to generate radar data by projecting radar returns of objects within a scene onto an image plane of camera data of the scene based on extrinsic and intrinsic parameters of a camera and extrinsic parameters of a radar sensor to generate the radar data. The image data can be received at an image channel of an image/radar convolutional neural network (CNN) and receive the radar data at a radar channel of the image/radar CNN, wherein features are transferred from the image channel to the radar channel at multiple stages Image object features and image confidence scores can be determined by the image channel, and radar object features and radar confidences by the radar channel. The image object features can be combined with the radar object features using a weighted sum.
-
公开(公告)号:US11625856B2
公开(公告)日:2023-04-11
申请号:US17160259
申请日:2021-01-27
Applicant: Ford Global Technologies, LLC.
Inventor: Sarah Houts , Praveen Narayanan , Punarjay Chakravarty , Gaurav Pandey , Graham Mills , Tyler Reid
Abstract: Example localization systems and methods are described. In one implementation, a method receives a camera image from a vehicle camera and cleans the camera image using a VAE-GAN (variational autoencoder combined with a generative adversarial network) algorithm. The method further receives a vector map related to an area proximate the vehicle and generates a synthetic image based on the vector map. The method then localizes the vehicle based on the cleaned camera image and the synthetic image.
-
8.
公开(公告)号:US11574622B2
公开(公告)日:2023-02-07
申请号:US16919315
申请日:2020-07-02
Applicant: Ford Global Technologies, LLC
Inventor: Kaushik Balakrishnan , Praveen Narayanan , Francois Charette
IPC: G10L15/16 , G10L13/047
Abstract: An end-to-end deep-learning-based system that can solve both ASR and TTS problems jointly using unpaired text and audio samples is disclosed herein. An adversarially-trained approach is used to generate a more robust independent TTS neural network and an ASR neural network that can be deployed individually or simultaneously. The process for training the neural networks includes generating an audio sample from a text sample using the TTS neural network, then feeding the generated audio sample into the ASR neural network to regenerate the text. The difference between the regenerated text and the original text is used as a first loss for training the neural networks. A similar process is used for an audio sample. The difference between the regenerated audio and the original audio is used as a second loss. Text and audio discriminators are similarly used on the output of the neural network to generate additional losses for training.
-
公开(公告)号:US20230023347A1
公开(公告)日:2023-01-26
申请号:US17384010
申请日:2021-07-23
Inventor: Xiaoming Liu , Daniel Morris , Yunfei Long , Marcos Paul Gerardo Castro , Punarjay Chakravarty , Praveen Narayanan
Abstract: A computer includes a processor and a memory storing instructions executable by the processor to receive radar data including a radar pixel having a radial velocity from a radar; receive camera data including an image frame including camera pixels from a camera; map the radar pixel to the image frame; generate a region of the image frame surrounding the radar pixel; determine association scores for the respective camera pixels in the region; select a first camera pixel of the camera pixels from the region, the first camera pixel having a greatest association score of the association scores; and calculate a full velocity of the radar pixel using the radial velocity of the radar pixel and a first optical flow at the first camera pixel. The association scores indicate a likelihood that the respective camera pixels correspond to a same point in an environment as the radar pixel.
-
公开(公告)号:US10957317B2
公开(公告)日:2021-03-23
申请号:US16164355
申请日:2018-10-18
Applicant: Ford Global Technologies, LLC
Inventor: Lisa Scaria , Ryan Burke , Praveen Narayanan , Francois Charette
Abstract: A computing system can determine a vehicle command based on a received spoken language command and determined confidence levels. The computing system can operate a vehicle based on the vehicle command. The computing system can further determine the spoken language command by processing audio spectrum data corresponding to spoken natural language with an automatic speech recognition (ASR) system.
-
-
-
-
-
-
-
-
-