摘要:
A musical piece structure analysis method includes acquiring an acoustic signal of a musical piece, extracting a first feature amount indicating changes in tone from the acoustic signal of the musical piece, extracting a second feature amount indicating changes in chords from the acoustic signal of the musical piece, outputting a first boundary likelihood indicating likelihood of a constituent boundary of the musical piece from the first feature amount using a first learning model, outputting a second boundary likelihood indicating likelihood of the constituent boundary of the musical piece from the second feature amount using a second learning model, identifying the constituent boundary of the musical piece by performing weighted synthesis of the first boundary likelihood and the second boundary likelihood, and dividing the acoustic signal of the musical piece into a plurality of sections at the constituent boundary that has been identified.
摘要:
Embodiments of the present invention provide for the receipt of unprocessed audio. Musical information is retrieved or extracted from the same. This musical information may then be used to generate collaborative social co-creations of musical content, identify particular musical tastes, and search for content that corresponds to identified musical tastes.
摘要:
A method includes determining a first spectrogram of the audio signal, defining a similarity matrix of the audio signal based on the first spectrogram and a transposed version of the first spectrogram, identifying two or more similar frames in the similarity matrix that are more similar to a designated frame than to one or more other frames in the similarity matrix, creating a repeating spectrogram model based on the two or more similar frames that are identified in the similarity matrix, and deriving a mask based on the repeating spectrogram model and the first spectrogram of the audio signal. The mask is representative of similarities between the repeating spectrogram model and the first spectrogram of the audio signal. The method also includes extracting a repeating structure from the audio signal by applying the mask to the audio signal.
摘要:
A method for extracting the most representative segments of a musical composition, represented by an audio signal, according to which the audio signal is preprocessed by a set of preprocessors, each if which is adapted to identify a rhythmic pattern. The output of the preprocessors that provided the most periodic or rhythmical patterns in the musical composition selected and the musical composition is divided into bars with rhythmic patterns, while iteratively checking and scoring their quality and detecting a section that is a sequence of bars with score above a predetermined threshold. Checking and scoring is iteratively repeated until all sections are detected. Then similarity matrices between all bars that belong to the musical composition are constructed, based on MFCCs of the processed sound, chromograms and the rhythmic patterns. Then equivalent classes of similar sections are extracted along the musical composition. Substantial transitions between sections represented as blocks in the similarity matrices are collected and a representative segment is selected from each class with the highest number of sections.
摘要:
A system and method for the automatic, procedural generation of musical content in relation to biometric data. The systems and methods use a user's device, such as a cell phone to capture image data of a body part, and derive a biometric signal from the image data. The biometric signal includes biometric parameters, which are used by a music generation engine to generate music. The music generation can also be based on user-specific data and quality data related to the biometric detection process.
摘要:
A method of deriving information from an audio track, or a part thereof, wherein onsets or intensity/amplitude variations are detected as well as at which frequencies (timbral frequencies) or in which frequency bands these occur. Especially interesting is the frequency of such onsets. In this manner, the frequency of beats of a low frequency drum may be separated from that of onsets of a higher frequency drum or guitar of other instrument, and these frequencies provide important information about the track, such as genre, beat, etc. Naturally, parameters may be provided relating to the individual frequencies (frequency of onsets and frequency/tone of the sound of the onsets), or a fit thereto may be used to reduce the number of parameters. It is noted that the frequencies in which the onsets are determined may be tones or half tones in the relevant scale. As onsets of instruments normally are whole multiples of a basic frequency or beat, it has been found advantageous to represent the individual frequencies on a logarithmic scale so that such multiples of frequencies are equidistant and so that transposing to higher or lower beats is very easy.
摘要:
In one embodiment, a method of receiving a decoded audio signal that has a transmitted pitch lag is disclosed. The method includes estimating pitch correlations of possible short pitch lags that are smaller than a minimum pitch limitation and have an approximated multiple relationship with the transmitted pitch lag, checking if one of the pitch correlations of the possible short pitch lags is large enough compared to a pitch correlation estimated with the transmitted pitch lag, and selecting a short pitch lag as a corrected pitch lag if a corresponding pitch correlation is large enough. The postprocessing is performed using the corrected pitch lag. In another embodiment, when the existence of irregular harmonics or wrong pitch lag is detected, a coded-excited linear prediction (CELP) postfilter is made more aggressive.
摘要:
A system that analyzes music to detect musical beats and to rectify beats that are out of sync with the actual beat phase of the music. The music analysis includes onset detection, tempo/meter estimation, and beat analysis, which includes the rectification of out-of-sync beats.
摘要:
The invention relates to a method of synchronizing two digital data streams with the same content, the method comprising the steps of:a) generating at given intervals for each of the two digital data streams S1 and S2 at least two characteristic numbers expressing at least one parameter characteristic of their content;b) generating from said numbers points D1 and D2 for each of the two streams S1 and S2 representing at least one of said characteristic parameters in a space of at least two dimensions, the points D1 corresponding to the stream S1 and the points D2 corresponding to the stream S2 that are situated in a time period T defining trajectories representative of the data streams S1 and S2 to be synchronized;c) shifting the time periods of duration T assigned to the digital data streams S1 and S2 relative to each other by calculating a criterion of superposition of said trajectories having an optimum value representing the required synchronization;d) choosing the shift between the time periods corresponding to said optimum value as a value representative of the synchronization.
摘要:
A method for music analysis. The method includes the steps of acquiring a music soundtrack, re-sampling an audio stream of the music soundtrack so that the re-sampled audio stream is composed of blocks, applying FFT to each block, deriving a vector from each transformed block, wherein the vector components are energy summations of the block within different sub-bands, applying auto-correlation to each sequence composed of the vector components of all the blocks in the same sub-band using different tempo values, wherein, for each sequence, a largest correlation result is identified as a confidence value and the tempo value generating the largest correlation result is identified as an estimated tempo, and comparing the confidence values of all the sequences to identify the estimated tempo having the largest confidence value as a final estimated tempo.