Hierarchical encoder for speech conversion system

    公开(公告)号:US11410667B2

    公开(公告)日:2022-08-09

    申请号:US16457150

    申请日:2019-06-28

    Abstract: A speech conversion system is described that includes a hierarchical encoder and a decoder. The system may comprise a processor and memory storing instructions executable by the processor. The instructions may comprise to: using a second recurrent neural network (RNN) (GRU1) and a first set of encoder vectors derived from a spectrogram as input to the second RNN, determine a second concatenated sequence; determine a second set of encoder vectors by doubling a stack height and halving a length of the second concatenated sequence; using the second set of encoder vectors, determine a third set of encoder vectors; and decode the third set of encoder vectors using an attention block.

    Vehicle State-Based Hands-Free Phone Noise Reduction With Learning Capability
    3.
    发明申请
    Vehicle State-Based Hands-Free Phone Noise Reduction With Learning Capability 审中-公开
    基于车辆状态的免提电话降噪与学习能力

    公开(公告)号:US20160019890A1

    公开(公告)日:2016-01-21

    申请号:US14334622

    申请日:2014-07-17

    CPC classification number: G10L15/20 G10L21/0208 G10L25/30 G10L25/60

    Abstract: This disclosure generally relates to a system, apparatus, and method for achieving a vehicle state-based hands free noise reduction feature. A noise reduction tool is provided for applying a noise reduction strategy on a sound input that uses machine learning to develop future noise reduction strategies, where the noise reduction strategies include analyzing vehicle operational state information and external information that are predicted to contribute to cabin noise and selecting noise reducing pre-filter options based on the analysis. The machine learning may further be supplemented by off-line training to generate a speech quality performance measure for the sound input that may be referenced by the noise reduction tool for further noise reduction strategies.

    Abstract translation: 本公开总体上涉及用于实现基于车辆状态的免提降噪特征的系统,装置和方法。 提供了一种降噪工具,用于对使用机器学习的声音输入应用降噪策略来开发未来的降噪策略,其中降噪策略包括分析车辆操作状态信息和被预测为有助于驾驶室噪声的外部信息, 基于分析选择降噪预滤波器选项。 机器学习可以进一步由离线训练来补充,以产生用于进一步降噪策略的降噪工具可以参考的声音输入的语音质量性能测量。

    JOINT AUTOMATIC SPEECH RECOGNITION AND TEXT TO SPEECH CONVERSION USING ADVERSARIAL NEURAL NETWORKS

    公开(公告)号:US20220005457A1

    公开(公告)日:2022-01-06

    申请号:US16919315

    申请日:2020-07-02

    Abstract: An end-to-end deep-learning-based system that can solve both ASR and TTS problems jointly using unpaired text and audio samples is disclosed herein. An adversarially-trained approach is used to generate a more robust independent TTS neural network and an ASR neural network that can be deployed individually or simultaneously. The process for training the neural networks includes generating an audio sample from a text sample using the TTS neural network, then feeding the generated audio sample into the ASR neural network to regenerate the text. The difference between the regenerated text and the original text is used as a first loss for training the neural networks. A similar process is used for an audio sample. The difference between the regenerated audio and the original audio is used as a second loss. Text and audio discriminators are similarly used on the output of the neural network to generate additional losses for training.

    Vehicle-window-transmittance-control apparatus and method

    公开(公告)号:US10192125B2

    公开(公告)日:2019-01-29

    申请号:US15299211

    申请日:2016-10-20

    Abstract: A vehicle is disclosed that includes systems for adjusting the transmittance of one or more windows of the vehicle. The vehicle may include a camera outputting images taken of an occupant within the vehicle. The vehicle may also include an artificial neural network running on computer hardware carried on-board the vehicle. The artificial neural network may be trained to classify the occupant of the vehicle using the images captured by the camera as input. The vehicle may further include a controller controlling transmittance of the one or more windows based on classifications made by the artificial neural network. For example, if the artificial neural network classifies the occupant as squinting or shading his or her eyes with a hand, the controller may reduce the transmittance of a windshield, side window, or some combination thereof.

    Vehicle-Window-Transmittance-Control Apparatus And Method

    公开(公告)号:US20190026574A1

    公开(公告)日:2019-01-24

    申请号:US16105646

    申请日:2018-08-20

    Abstract: A vehicle is disclosed that includes systems for adjusting the transmittance of one or more windows of the vehicle. The vehicle may include a camera outputting images taken of an occupant within the vehicle. The vehicle may also include an artificial neural network running on computer hardware carried on-board the vehicle. The artificial neural network may be trained to classify the occupant of the vehicle using the images captured by the camera as input. The vehicle may further include a controller controlling transmittance of the one or more windows based on classifications made by the artificial neural network. For example, if the artificial neural network classifies the occupant as squinting or shading his or her eyes with a hand, the controller may reduce the transmittance of a windshield, side window, or some combination thereof

Patent Agency Ranking