发明公开
- 专利标题: EFFICIENT SPEECH TO SPIKES CONVERSION PIPELINE FOR A SPIKING NEURAL NETWORK
-
申请号: US18118619申请日: 2023-03-07
-
公开(公告)号: US20230290340A1公开(公告)日: 2023-09-14
- 发明人: Lavinia Andreea Danielescu , Kenneth Michael Stewart , Noah Gideon Pacik-Nelson , Timothy M. Shea
- 申请人: Accenture Global Solutions Limited
- 申请人地址: IE Dublin
- 专利权人: Accenture Global Solutions Limited
- 当前专利权人: Accenture Global Solutions Limited
- 当前专利权人地址: IE Dublin
- 主分类号: G10L15/16
- IPC分类号: G10L15/16 ; G10L21/013 ; G10L25/24
摘要:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for converting audio to spikes for input to a spiking neural network configured to recognize speech based on the spikes are described. In some aspects, a method includes obtaining audio data and generating frequency domain audio signals that represent the audio data by converting the audio data into a frequency domain. The frequency domain audio signals are mapped into a set of Mel-frequency bands to obtain Mel-scale frequency audio signals. A log transformation is performed on the Mel-scale frequency audio signals to obtain log-Mel signals. Spike input is generated for input to a spiking neural network (SNN) model by converting the log-Mel signals to the series of spikes. The spike input is provided as an input to the SNN model.
信息查询