Patent search ap:("Ford Global Technologies Page LLC") AND inv:"Francois Charette"

1.

发明授权
Hierarchical encoder for speech conversion system 有权

公开(公告)号：US11410667B2

公开(公告)日：2022-08-09

申请号：US16457150

申请日：2019-06-28

Applicant: Ford Global Technologies, LLC

Inventor： Punarjay Chakravarty , Lisa Scaria , Ryan Burke , Francois Charette , Praveen Narayanan

IPC: G10L19/02 , G10L19/00 , G10L19/16

Abstract: A speech conversion system is described that includes a hierarchical encoder and a decoder. The system may comprise a processor and memory storing instructions executable by the processor. The instructions may comprise to: using a second recurrent neural network (RNN) (GRU1) and a first set of encoder vectors derived from a spectrogram as input to the second RNN, determine a second concatenated sequence; determine a second set of encoder vectors by doubling a stack height and halving a length of the second concatenated sequence; using the second set of encoder vectors, determine a third set of encoder vectors; and decode the third set of encoder vectors using an attention block.

2.

发明申请
TRANSFER FUNCTION TO GENERATE LOMBARD SPEECH FROM NEUTRAL SPEECH 审中-公开

公开(公告)号：US20170125038A1

公开(公告)日：2017-05-04

申请号：US14931132

申请日：2015-11-03

Applicant: Ford Global Technologies, LLC

Inventor： Ali Hassani , Scott Andrew Amman , Francois Charette , John Edward Huber , Brigitte Frances Mora Richardson , Gintaras Vincent Puskorius , An Ji , Ranjani Rangarajan

IPC: G10L25/87 , G10L15/02 , G10L15/06 , G10L15/22

CPC classification number: G10L15/063 , G10L13/033 , G10L21/003 , G10L25/51 , G10L2015/0631 , G10L2021/0135 , G10L2021/03646

Abstract: A controller may be programmed to create a speech utterance set for speech recognition training by, in response to receiving data representing a neutral utterance and parameter values defining signal noise, generating data representing a Lombard effect version of the neutral utterance using a transfer function associated with the parameter values and defining distortion between neutral and Lombard effect versions of a same utterance due to the signal noise.

3.

发明申请
Vehicle State-Based Hands-Free Phone Noise Reduction With Learning Capability 审中-公开
Title translation: 基于车辆状态的免提电话降噪与学习能力

公开(公告)号：US20160019890A1

公开(公告)日：2016-01-21

申请号：US14334622

申请日：2014-07-17

Applicant: Ford Global Technologies, LLC

Inventor： Scott Andrew Amman , Francois Charette , Anthony Dwayne Cooprider , Yuksel Gur , Paul J. Joseph Nicastri , Gintaras Vincent Puskorious

IPC: G10L15/20

CPC classification number: G10L15/20 , G10L21/0208 , G10L25/30 , G10L25/60

Abstract: This disclosure generally relates to a system, apparatus, and method for achieving a vehicle state-based hands free noise reduction feature. A noise reduction tool is provided for applying a noise reduction strategy on a sound input that uses machine learning to develop future noise reduction strategies, where the noise reduction strategies include analyzing vehicle operational state information and external information that are predicted to contribute to cabin noise and selecting noise reducing pre-filter options based on the analysis. The machine learning may further be supplemented by off-line training to generate a speech quality performance measure for the sound input that may be referenced by the noise reduction tool for further noise reduction strategies.

Abstract translation: 本公开总体上涉及用于实现基于车辆状态的免提降噪特征的系统，装置和方法。提供了一种降噪工具，用于对使用机器学习的声音输入应用降噪策略来开发未来的降噪策略，其中降噪策略包括分析车辆操作状态信息和被预测为有助于驾驶室噪声的外部信息，基于分析选择降噪预滤波器选项。机器学习可以进一步由离线训练来补充，以产生用于进一步降噪策略的降噪工具可以参考的声音输入的语音质量性能测量。

4.

发明授权
Natural speech data generation systems and methods 有权

公开(公告)号：US11318373B2

公开(公告)日：2022-05-03

申请号：US16650160

申请日：2017-10-04

Applicant: Ford Global Technologies, LLC

Inventor： Francois Charette , Lakshmi Krishnan , Shant Tokatyan

IPC: A63F13/215 , A63F13/533 , A63F13/54 , A63F13/87 , G10L15/22

Abstract: Example natural speech data generation systems and methods are described. In one implementation, a natural speech data generator initiates a game between a first player and a second player and determines a scenario associated with the game. A first role is assigned to the first player and a second role is assigned to the second player. The natural speech data generator receives multiple natural speech utterances by the first player and the second player during the game.

5.

发明申请
JOINT AUTOMATIC SPEECH RECOGNITION AND TEXT TO SPEECH CONVERSION USING ADVERSARIAL NEURAL NETWORKS 有权

公开(公告)号：US20220005457A1

公开(公告)日：2022-01-06

申请号：US16919315

申请日：2020-07-02

Applicant: Ford Global Technologies, LLC

Inventor： Kaushik Balakrishnan , Praveen Narayanan , Francois Charette

IPC: G10L13/047 , G10L15/16

Abstract: An end-to-end deep-learning-based system that can solve both ASR and TTS problems jointly using unpaired text and audio samples is disclosed herein. An adversarially-trained approach is used to generate a more robust independent TTS neural network and an ASR neural network that can be deployed individually or simultaneously. The process for training the neural networks includes generating an audio sample from a text sample using the TTS neural network, then feeding the generated audio sample into the ASR neural network to regenerate the text. The difference between the regenerated text and the original text is used as a first loss for training the neural networks. A similar process is used for an audio sample. The difference between the regenerated audio and the original audio is used as a second loss. Text and audio discriminators are similarly used on the output of the neural network to generate additional losses for training.

6.

发明授权
Neural network generative modeling to transform speech utterances and augment training data 有权

公开(公告)号：US10937438B2

公开(公告)日：2021-03-02

申请号：US15940639

申请日：2018-03-29

Applicant: Ford Global Technologies, LLC

Inventor： Praveen Narayanan , Lisa Scaria , Francois Charette , Ashley Elizabeth Micks , Ryan Burke

IPC: G10L21/02 , G10L15/16 , G10L25/03 , G10L15/06 , G06F3/16 , G06N5/04

Abstract: Systems, methods, and devices for speech transformation and generating synthetic speech using deep generative models are disclosed. A method of the disclosure includes receiving input audio data comprising a plurality of iterations of a speech utterance from a plurality of speakers. The method includes generating an input spectrogram based on the input audio data and transmitting the input spectrogram to a neural network configured to generate an output spectrogram. The method includes receiving the output spectrogram from the neural network and, based on the output spectrogram, generating synthetic audio data comprising the speech utterance.

7.

发明授权
Vehicle having dynamic acoustic model switching to improve noisy speech recognition 有权

公开(公告)号：US10297251B2

公开(公告)日：2019-05-21

申请号：US15002563

申请日：2016-01-21

Applicant: Ford Global Technologies, LLC

Inventor： Ali Hassani , Scott Andrew Amman , Francois Charette , Brigitte Frances Mora Richardson , Gintaras Vincent Puskorius , An Ji , Ranjani Rangarajan , John Edward Huber

IPC: G10L21/02 , G10L15/20 , G10L15/065 , G10L15/08 , G10L15/22 , G10L25/84

Abstract: An automatic speech recognition system for a vehicle includes a controller configured to select an acoustic model from a library of acoustic models based on ambient noise in a cabin of the vehicle and operating parameters of the vehicle. The controller is further configured to apply the selected acoustic model to noisy speech to improve recognition of the speech.

8.

发明授权
Vehicle-window-transmittance-control apparatus and method 有权

公开(公告)号：US10192125B2

公开(公告)日：2019-01-29

申请号：US15299211

申请日：2016-10-20

Applicant: Ford Global Technologies, LLC

Inventor： Scott Vincent Myers , Alexandro Walsh , Francois Charette , Lisa Scaria

IPC: G06K9/00 , B60J3/04 , G02F1/163 , G06K9/46 , G06K9/62 , G06K9/66 , G06N3/08

Abstract: A vehicle is disclosed that includes systems for adjusting the transmittance of one or more windows of the vehicle. The vehicle may include a camera outputting images taken of an occupant within the vehicle. The vehicle may also include an artificial neural network running on computer hardware carried on-board the vehicle. The artificial neural network may be trained to classify the occupant of the vehicle using the images captured by the camera as input. The vehicle may further include a controller controlling transmittance of the one or more windows based on classifications made by the artificial neural network. For example, if the artificial neural network classifies the occupant as squinting or shading his or her eyes with a hand, the controller may reduce the transmittance of a windshield, side window, or some combination thereof.

9.

发明申请
Vehicle-Window-Transmittance-Control Apparatus And Method 审中-公开

公开(公告)号：US20190026574A1

公开(公告)日：2019-01-24

申请号：US16105646

申请日：2018-08-20

Applicant: Ford Global Technologies, LLC

Inventor： Scott Vincent Myers , Alexandro Walsh , Francois Charette , Lisa Scaria

IPC: G06K9/00 , G06N3/08 , G06K9/66 , G06K9/62 , B60J3/04 , G06K9/46 , G02F1/163

Abstract: A vehicle is disclosed that includes systems for adjusting the transmittance of one or more windows of the vehicle. The vehicle may include a camera outputting images taken of an occupant within the vehicle. The vehicle may also include an artificial neural network running on computer hardware carried on-board the vehicle. The artificial neural network may be trained to classify the occupant of the vehicle using the images captured by the camera as input. The vehicle may further include a controller controlling transmittance of the one or more windows based on classifications made by the artificial neural network. For example, if the artificial neural network classifies the occupant as squinting or shading his or her eyes with a hand, the controller may reduce the transmittance of a windshield, side window, or some combination thereof

10.

发明授权
Method and apparatus for tuning speech recognition systems to accommodate ambient noise 有权

公开(公告)号：US09978399B2

公开(公告)日：2018-05-22

申请号：US14940761

申请日：2015-11-13

Applicant: FORD GLOBAL TECHNOLOGIES, LLC

Inventor： Scott Andrew Amman , Brigitte Frances Mora Richardson , Allan Miramonti , John Edward Huber , Francois Charette , Gintaras Vincent Puskorius

IPC: G10L25/87 , G10L25/84 , G10L15/06

CPC classification number: G10L15/01 , B60R16/0373

Abstract: A system includes a head and torso simulation (HATS) system configured to play back pre-recorded audio commands while simulating a driver head location as an output location. The system also includes a vehicle speaker system and a processor configured to engage a vehicle heating, ventilation and air-conditioning (HVAC) system. The processor is also configured to play back audio commands through the HATS system while playing back pre-recorded vehicle environment noises through the speaker system. The processor is further configured to determine if the audio command, recorded by a vehicle microphone, is recognizable in the presence of the environment noises and HVAC noises. Also, the processor is configured to repeat the engagement, playback of commands and noises, and determination, recording the results of the determination for each command in a set of commands.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification