MUSICAL PIECE STRUCTURE ANALYSIS DEVICE AND MUSICAL PIECE STRUCTURE ANALYSIS METHOD

    公开(公告)号:US20230186877A1

    公开(公告)日:2023-06-15

    申请号:US18164575

    申请日:2023-02-04

    发明人: Kouhei SUMI

    IPC分类号: G10H1/00

    摘要: A musical piece structure analysis method includes acquiring an acoustic signal of a musical piece, extracting a first feature amount indicating changes in tone from the acoustic signal of the musical piece, extracting a second feature amount indicating changes in chords from the acoustic signal of the musical piece, outputting a first boundary likelihood indicating likelihood of a constituent boundary of the musical piece from the first feature amount using a first learning model, outputting a second boundary likelihood indicating likelihood of the constituent boundary of the musical piece from the second feature amount using a second learning model, identifying the constituent boundary of the musical piece by performing weighted synthesis of the first boundary likelihood and the second boundary likelihood, and dividing the acoustic signal of the musical piece into a plurality of sections at the constituent boundary that has been identified.

    Audio separation system and method
    3.
    发明授权
    Audio separation system and method 有权
    音频分离系统和方法

    公开(公告)号:US09093056B2

    公开(公告)日:2015-07-28

    申请号:US13612413

    申请日:2012-09-12

    摘要: A method includes determining a first spectrogram of the audio signal, defining a similarity matrix of the audio signal based on the first spectrogram and a transposed version of the first spectrogram, identifying two or more similar frames in the similarity matrix that are more similar to a designated frame than to one or more other frames in the similarity matrix, creating a repeating spectrogram model based on the two or more similar frames that are identified in the similarity matrix, and deriving a mask based on the repeating spectrogram model and the first spectrogram of the audio signal. The mask is representative of similarities between the repeating spectrogram model and the first spectrogram of the audio signal. The method also includes extracting a repeating structure from the audio signal by applying the mask to the audio signal.

    摘要翻译: 一种方法包括确定音频信号的第一频谱图,基于第一谱图定义音频信号的相似矩阵,以及识别第一谱图的转置版本,识别相似性矩阵中更相似的两个或多个相似帧 指定帧比相似矩阵中的一个或多个其他帧,基于在相似性矩阵中识别的两个或更多个相似帧创建重复频谱图模型,并且基于重复频谱图模型和第一频谱图 音频信号。 该掩模代表了重复频谱图模型和音频信号的第一个频谱图之间的相似性。 该方法还包括通过将音程应用于音频信号从音频信号中提取重复结构。

    METHOD FOR EXTRACTING REPRESENTATIVE SEGMENTS FROM MUSIC
    4.
    发明申请
    METHOD FOR EXTRACTING REPRESENTATIVE SEGMENTS FROM MUSIC 有权
    从音乐中提取代表部分的方法

    公开(公告)号:US20140338515A1

    公开(公告)日:2014-11-20

    申请号:US14362129

    申请日:2012-11-29

    申请人: Play My Tone Ltd.

    IPC分类号: G10H1/00

    摘要: A method for extracting the most representative segments of a musical composition, represented by an audio signal, according to which the audio signal is preprocessed by a set of preprocessors, each if which is adapted to identify a rhythmic pattern. The output of the preprocessors that provided the most periodic or rhythmical patterns in the musical composition selected and the musical composition is divided into bars with rhythmic patterns, while iteratively checking and scoring their quality and detecting a section that is a sequence of bars with score above a predetermined threshold. Checking and scoring is iteratively repeated until all sections are detected. Then similarity matrices between all bars that belong to the musical composition are constructed, based on MFCCs of the processed sound, chromograms and the rhythmic patterns. Then equivalent classes of similar sections are extracted along the musical composition. Substantial transitions between sections represented as blocks in the similarity matrices are collected and a representative segment is selected from each class with the highest number of sections.

    摘要翻译: 一种用于提取由音频信号表示的音乐作品的最具代表性的段的方法,根据该方法音频信号由一组预处理器进行预处理,每一个如果适合于识别节奏模式。 在选择的音乐作品中提供最周期或有节奏的图案的预处理器的输出和音乐作品被分成具有节奏模式的条,同时迭代地检查和评分其质量并且检测作为具有得分高于 预定阈值。 迭代重复检查和评分,直到检测到所有部分。 然后,基于处理声音的MFCC,色彩图和节奏模式,构建属于音乐作品的所有条之间的相似矩阵。 然后沿着音乐作品提取相似类别的类似部分。 收集在相似矩阵中表示为块的部分之间的实质转换,并从具有最高数目的部分的每个类中选择代表性的片段。

    Biometric-Music Interaction Methods and Systems
    5.
    发明申请
    Biometric-Music Interaction Methods and Systems 有权
    生物统计学 - 音乐交互方法与系统

    公开(公告)号:US20140074479A1

    公开(公告)日:2014-03-13

    申请号:US14021862

    申请日:2013-09-09

    申请人: BioBeats, Inc.

    IPC分类号: G10L25/48

    摘要: A system and method for the automatic, procedural generation of musical content in relation to biometric data. The systems and methods use a user's device, such as a cell phone to capture image data of a body part, and derive a biometric signal from the image data. The biometric signal includes biometric parameters, which are used by a music generation engine to generate music. The music generation can also be based on user-specific data and quality data related to the biometric detection process.

    摘要翻译: 一种与生物特征数据相关的自动程序性生成音乐内容的系统和方法。 系统和方法使用诸如手机的用户设备来捕获身体部位的图像数据,并从图像数据中导出生物特征信号。 生物测定信号包括由音乐生成引擎用于产生音乐的生物特征参数。 音乐生成还可以基于与生物特征检测过程相关的用户特定数据和质量数据。

    Method And An Apparatus For Deriving Information From An Audio Track And Determining Similarity Between Audio Tracks
    6.
    发明申请
    Method And An Apparatus For Deriving Information From An Audio Track And Determining Similarity Between Audio Tracks 审中-公开
    用于从音轨导出信息的方法和装置,并且确定音轨之间的相似性

    公开(公告)号:US20120237041A1

    公开(公告)日:2012-09-20

    申请号:US13384548

    申请日:2010-07-23

    申请人: Tim Pohle

    发明人: Tim Pohle

    IPC分类号: H04R29/00 G06F17/00

    摘要: A method of deriving information from an audio track, or a part thereof, wherein onsets or intensity/amplitude variations are detected as well as at which frequencies (timbral frequencies) or in which frequency bands these occur. Especially interesting is the frequency of such onsets. In this manner, the frequency of beats of a low frequency drum may be separated from that of onsets of a higher frequency drum or guitar of other instrument, and these frequencies provide important information about the track, such as genre, beat, etc. Naturally, parameters may be provided relating to the individual frequencies (frequency of onsets and frequency/tone of the sound of the onsets), or a fit thereto may be used to reduce the number of parameters. It is noted that the frequencies in which the onsets are determined may be tones or half tones in the relevant scale. As onsets of instruments normally are whole multiples of a basic frequency or beat, it has been found advantageous to represent the individual frequencies on a logarithmic scale so that such multiples of frequencies are equidistant and so that transposing to higher or lower beats is very easy.

    摘要翻译: 从音频轨道或其一部分导出信息的方法,其中检测到发射或强度/幅度变化,以及哪些频率(频率频率)或哪些频带发生。 特别有趣的是这种onsets的频率。 以这种方式,低频鼓的节奏频率可以与其他乐器的较高频率鼓或吉他的开头的频率分开,并且这些频率提供关于轨道的重要信息,例如类型,节拍等。自然地 可以提供与各个频率(开启的频率和开启的声音的频率/音调)相关的参数,或者可以使用与其相配合来减少参数的数量。 注意,确定开始的频率可以是相关尺度中的音调或半音。 由于仪器的通常通常是基本频率或节拍的整数倍,已经发现有利的是以对数标度表示各个频率,使得频率的这种倍数是等距的,因此转置到更高或更低的节拍是非常容易的。

    CELP Post-processing for Music Signals
    7.
    发明申请
    CELP Post-processing for Music Signals 有权
    CELP音乐信号后处理

    公开(公告)号:US20100070270A1

    公开(公告)日:2010-03-18

    申请号:US12559739

    申请日:2009-09-15

    申请人: Yang Gao

    发明人: Yang Gao

    IPC分类号: G10L11/04 H04L12/66 G10L19/00

    摘要: In one embodiment, a method of receiving a decoded audio signal that has a transmitted pitch lag is disclosed. The method includes estimating pitch correlations of possible short pitch lags that are smaller than a minimum pitch limitation and have an approximated multiple relationship with the transmitted pitch lag, checking if one of the pitch correlations of the possible short pitch lags is large enough compared to a pitch correlation estimated with the transmitted pitch lag, and selecting a short pitch lag as a corrected pitch lag if a corresponding pitch correlation is large enough. The postprocessing is performed using the corrected pitch lag. In another embodiment, when the existence of irregular harmonics or wrong pitch lag is detected, a coded-excited linear prediction (CELP) postfilter is made more aggressive.

    摘要翻译: 在一个实施例中,公开了一种接收具有传输音调滞后的解码音频信号的方法。 该方法包括估计小于最小音调限制的可能的短音调滞后的音调相关性,并且与所传输的音调滞后具有近似的多重关系,检查可能的短音调滞后的音调相关性之一是否足够大 如果相应的音调相关性足够大,则以传输的音调滞后估计音调相关性,并且选择短音调滞后作为校正的音调滞后。 后处理使用校正的音调滞后来执行。 在另一个实施例中,当检测到不规则谐波或错误音调滞后的存在时,编码激励线性预测(CELP)后置滤波器变得更具侵略性。

    Beat analysis of musical signals
    8.
    发明授权
    Beat analysis of musical signals 有权
    节奏分析音乐信号

    公开(公告)号:US07132595B2

    公开(公告)日:2006-11-07

    申请号:US11264326

    申请日:2005-11-01

    IPC分类号: G10H7/00

    摘要: A system that analyzes music to detect musical beats and to rectify beats that are out of sync with the actual beat phase of the music. The music analysis includes onset detection, tempo/meter estimation, and beat analysis, which includes the rectification of out-of-sync beats.

    摘要翻译: 一种分析音乐以检测音乐节奏并纠正与音乐的实际节奏不同步的节奏的系统。 音乐分析包括开始检测,速度/仪表估计和拍频分析,其中包括纠错不同步拍摄。

    Method for the synchronization of two digital data flows with identical content
    9.
    发明授权
    Method for the synchronization of two digital data flows with identical content 失效
    具有相同内容的两个数字数据流的同步方法

    公开(公告)号:US07043684B2

    公开(公告)日:2006-05-09

    申请号:US10502379

    申请日:2003-01-23

    申请人: Alexandre Joly

    发明人: Alexandre Joly

    IPC分类号: H03M13/00 G06F11/00

    摘要: The invention relates to a method of synchronizing two digital data streams with the same content, the method comprising the steps of:a) generating at given intervals for each of the two digital data streams S1 and S2 at least two characteristic numbers expressing at least one parameter characteristic of their content;b) generating from said numbers points D1 and D2 for each of the two streams S1 and S2 representing at least one of said characteristic parameters in a space of at least two dimensions, the points D1 corresponding to the stream S1 and the points D2 corresponding to the stream S2 that are situated in a time period T defining trajectories representative of the data streams S1 and S2 to be synchronized;c) shifting the time periods of duration T assigned to the digital data streams S1 and S2 relative to each other by calculating a criterion of superposition of said trajectories having an optimum value representing the required synchronization;d) choosing the shift between the time periods corresponding to said optimum value as a value representative of the synchronization.

    摘要翻译: 本发明涉及一种同步具有相同内容的两个数字数据流的方法,所述方法包括以下步骤:a)以给定间隔为两个数字数据流S 1和S 2中的每一个生成, 至少两个表示其内容的至少一个参数特征的特征数字; b)从两个流S 1和S 2中的每一个产生所述数字点D 1和D 2 2 >表示至少两个维度的空间中的所述特征参数中的至少一个,与流S 1对应的点D 1和对应于流S 1的点D 2 流S 2,其位于时间段T中,定义要同步的数据流S 1和S 2的轨迹; c)通过计算具有最佳值的所述轨迹的叠加标准来相对于彼此移动分配给数字数据流S 1和S 2的持续时间T的时间段 表示所需同步的值; d)选择对应于所述最佳值的时间段之间的偏移量作为表示同步的值。

    Method for music analysis
    10.
    发明申请
    Method for music analysis 失效
    音乐分析方法

    公开(公告)号:US20050217461A1

    公开(公告)日:2005-10-06

    申请号:US10823536

    申请日:2004-04-14

    申请人: Chun-Yi Wang

    发明人: Chun-Yi Wang

    摘要: A method for music analysis. The method includes the steps of acquiring a music soundtrack, re-sampling an audio stream of the music soundtrack so that the re-sampled audio stream is composed of blocks, applying FFT to each block, deriving a vector from each transformed block, wherein the vector components are energy summations of the block within different sub-bands, applying auto-correlation to each sequence composed of the vector components of all the blocks in the same sub-band using different tempo values, wherein, for each sequence, a largest correlation result is identified as a confidence value and the tempo value generating the largest correlation result is identified as an estimated tempo, and comparing the confidence values of all the sequences to identify the estimated tempo having the largest confidence value as a final estimated tempo.

    摘要翻译: 音乐分析方法。 该方法包括以下步骤:获取音乐声轨,对音乐声轨的音频流进行重新采样,使得重新采样的音频流由块组成,对每个块应用FFT,从每个变换块导出向量,其中, 向量分量是不同子带内的块的能量相加,使用不同的速度值对相同子带中的所有块的矢量分量组成的每个序列应用自相关,其中对于每个序列,最大相关 将结果识别为置信度值,并且将产生最大相关结果的速度值识别为估计速度,并且比较所有序列的置信度值,以将具有最大置信度值的估计速度识别为最终估计速度。