- 专利标题: AUDIO IDENTIFICATION BASED ON DATA STRUCTURE
-
申请号: US18406840申请日: 2024-01-08
-
公开(公告)号: US20240160665A1公开(公告)日: 2024-05-16
- 发明人: Zafar Rafii , Prem Seetharaman
- 申请人: Gracenote, Inc.
- 申请人地址: US CA Emeryville
- 专利权人: Gracenote, Inc.
- 当前专利权人: Gracenote, Inc.
- 当前专利权人地址: US CA Emeryville
- 主分类号: G06F16/68
- IPC分类号: G06F16/68 ; G06F16/61 ; G06F17/14 ; G10L25/27 ; G10L25/51
摘要:
Example systems and methods are audio identification based on data structure are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to execute a constant Q transform on query time slices of query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, sequentially order the two-dimensional Fourier transforms in a query data structure, and identify the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.
公开/授权文献
- US12105754B2 Audio identification based on data structure 公开/授权日:2024-10-01
信息查询