发明授权
US09589283B2 Device, method, and medium for generating audio fingerprint and retrieving audio data
有权
用于生成音频指纹和检索音频数据的设备,方法和介质
- 专利标题: Device, method, and medium for generating audio fingerprint and retrieving audio data
- 专利标题(中): 用于生成音频指纹和检索音频数据的设备,方法和介质
-
申请号: US13742858申请日: 2013-01-16
-
公开(公告)号: US09589283B2公开(公告)日: 2017-03-07
- 发明人: Hyounggook Kim , Yuan Yuan She , Kiwan Eom , Xuan Zhu , Jiyeun Kim
- 申请人: Samsung Electronics Co., Ltd.
- 申请人地址: KR Suwon-si
- 专利权人: SAMSUNG ELECTRONICS CO., LTD.
- 当前专利权人: SAMSUNG ELECTRONICS CO., LTD.
- 当前专利权人地址: KR Suwon-si
- 代理机构: Sughrue Mion, PLLC
- 优先权: KR10-2005-0108070 20051111
- 主分类号: G10L17/00
- IPC分类号: G10L17/00 ; G06Q30/02 ; G10L25/48 ; G10L19/02
摘要:
Provided are device, method, and medium for generating an audio fingerprint and retrieving audio data. The device for generating an audio fingerprint includes: a coefficient extracting section partially decoding audio data in a compression area and extracting MDCT (Modified Discrete Cosine Transform) coefficients; a coefficient selecting section selecting an MDCT coefficient robust to noises from the extracted MDCT coefficients; a modulation spectrum generating section transforming the selected MDCT coefficient by the use of a Fourier transform method and generating a modulation spectrum; and a bit conversion section quantizing the generated modulation spectrum and generating an audio fingerprint. As a result, it is possible to accurately and rapidly retrieve the audio data recorded in a variety of environments. Since elements based on MP3 are used, it is possible to apply to MP3 applications in various manners. In addition, it is possible to apply to classification of audio data such as classification of music moods and classification of music genres and various other fields such as extraction of a specific event from moving images of sports.
公开/授权文献
信息查询