EFFICIENT SPEECH TO SPIKES CONVERSION PIPELINE FOR A SPIKING NEURAL NETWORK

发明公开

US20230290340A1 EFFICIENT SPEECH TO SPIKES CONVERSION PIPELINE FOR A SPIKING NEURAL NETWORK 审中-公开

请登陆查看更多内容

专利标题： EFFICIENT SPEECH TO SPIKES CONVERSION PIPELINE FOR A SPIKING NEURAL NETWORK
申请号： US18118619

申请日： 2023-03-07
公开(公告)号： US20230290340A1

公开(公告)日： 2023-09-14
发明人: Lavinia Andreea Danielescu , Kenneth Michael Stewart , Noah Gideon Pacik-Nelson , Timothy M. Shea
申请人： Accenture Global Solutions Limited
申请人地址： IE Dublin
专利权人： Accenture Global Solutions Limited
当前专利权人： Accenture Global Solutions Limited
当前专利权人地址： IE Dublin
主分类号： G10L15/16
IPC分类号： G10L15/16 ; G10L21/013 ; G10L25/24

EFFICIENT SPEECH TO SPIKES CONVERSION PIPELINE FOR A SPIKING NEURAL NETWORK

摘要：

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for converting audio to spikes for input to a spiking neural network configured to recognize speech based on the spikes are described. In some aspects, a method includes obtaining audio data and generating frequency domain audio signals that represent the audio data by converting the audio data into a frequency domain. The frequency domain audio signals are mapped into a set of Mel-frequency bands to obtain Mel-scale frequency audio signals. A log transformation is performed on the Mel-scale frequency audio signals to obtain log-Mel signals. Spike input is generated for input to a spiking neural network (SNN) model by converting the log-Mel signals to the series of spikes. The spike input is provided as an input to the SNN model.

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络