专利检索 ap:("Distributed Creation Inc.") AND inv:"Naveen Sasalu Rajashekharappa" 第 1 页

1.

发明申请
METHOD AND SYSTEM FOR LEARNING AND USING LATENT-SPACE REPRESENTATIONS OF AUDIO SIGNALS FOR AUDIO CONTENT-BASED RETRIEVAL 有权

公开(公告)号：US20220036915A1

公开(公告)日：2022-02-03

申请号：US16942410

申请日：2020-07-29

申请人： Distributed Creation Inc.

发明人： Alejandro Koretzky , Naveen Sasalu Rajashekharappa

IPC分类号： G10L25/54 , G06K9/62 , G06F3/16 , G10L25/30 , G10L21/12 , G10L21/14 , G06F16/65 , G06N3/08

摘要： A method and system are provided for extracting features from digital audio signals which exhibit variations in pitch, timbre, decay, reverberation, and other psychoacoustic attributes and learning, from the extracted features, an artificial neural network model for generating contextual latent-space representations of digital audio signals. A method and system are also provided for learning an artificial neural network model for generating consistent latent-space representations of digital audio signals in which the generated latent-space representations are comparable for the purposes of determining psychoacoustic similarity between digital audio signals. A method and system are also provided for extracting features from digital audio signals and learning, from the extracted features, an artificial neural network model for generating latent-space representations of digital audio signals which take care of selecting salient attributes of the signals that represent psychoacoustic differences between the signals.

2.

发明授权
Method and system for learning and using latent-space representations of audio signals for audio content-based retrieval 有权

公开(公告)号：US12051439B2

公开(公告)日：2024-07-30

申请号：US18142165

申请日：2023-05-02

申请人： Distributed Creation Inc.

发明人： Alejandro Koretzky , Naveen Sasalu Rajashekharappa

IPC分类号： G10L25/54 , G06F3/16 , G06F16/65 , G06F18/214 , G06N3/08 , G10L21/12 , G10L21/14 , G10L25/30

CPC分类号： G10L25/54 , G06F3/165 , G06F16/65 , G06F18/214 , G06N3/08 , G10L21/12 , G10L21/14 , G10L25/30

摘要： A method and system are provided for extracting features from digital audio signals which exhibit variations in pitch, timbre, decay, reverberation, and other psychoacoustic attributes and learning, from the extracted features, an artificial neural network model for generating contextual latent-space representations of digital audio signals. A method and system are also provided for learning an artificial neural network model for generating consistent latent-space representations of digital audio signals in which the generated latent-space representations are comparable for the purposes of determining psychoacoustic similarity between digital audio signals. A method and system are also provided for extracting features from digital audio signals and learning, from the extracted features, an artificial neural network model for generating latent-space representations of digital audio signals which take care of selecting salient attributes of the signals that represent psychoacoustic differences between the signals.

3.

发明授权
Method and system for learning and using latent-space representations of audio signals for audio content-based retrieval 有权

公开(公告)号：US11670322B2

公开(公告)日：2023-06-06

申请号：US16942410

申请日：2020-07-29

申请人： Distributed Creation Inc.

发明人： Alejandro Koretzky , Naveen Sasalu Rajashekharappa

IPC分类号： G10L25/54 , G06F16/65 , G06F3/16 , G06N3/08 , G10L21/12 , G10L21/14 , G10L25/30 , G06F18/214

CPC分类号： G10L25/54 , G06F3/165 , G06F16/65 , G06F18/214 , G06N3/08 , G10L21/12 , G10L21/14 , G10L25/30

摘要： A method and system are provided for extracting features from digital audio signals which exhibit variations in pitch, timbre, decay, reverberation, and other psychoacoustic attributes and learning, from the extracted features, an artificial neural network model for generating contextual latent-space representations of digital audio signals. A method and system are also provided for learning an artificial neural network model for generating consistent latent-space representations of digital audio signals in which the generated latent-space representations are comparable for the purposes of determining psychoacoustic similarity between digital audio signals. A method and system are also provided for extracting features from digital audio signals and learning, from the extracted features, an artificial neural network model for generating latent-space representations of digital audio signals which take care of selecting salient attributes of the signals that represent psychoacoustic differences between the signals.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类