摘要:
A computerized method for filtering a digital audio file to generate an output audio file that induces optimal health and cognitive ability in a listener of a playback of the output audio file is described herein. The method includes the steps of identifying a plurality of target frequencies that span within an octave, identifying a plurality of mid-point frequencies that are situated at mid-points between any two adjacent target frequencies, applying a peaking filter to the digital audio file centered around the plurality of mid-point frequencies to produce highest frequency attenuation at the plurality of mid-point frequencies, and generating the output audio file.
摘要:
A method for converting speech using phonetic posteriorgrams (PPGs). A target speech is obtained and a PPG is generated based on acoustic features of the target speech. Generating the PPG may include using a speaker-independent automatic speech recognition (SI-ASR) system for equalizing different speakers. The PPG includes a set of values corresponding to a range of times and a range of phonetic classes, the phonetic classes corresponding to senones. A mapping between the PPG and one or more segments of the target speech is generated. A source speech is obtained, and the source speech is converted into a converted speech based on the PPG and the mapping.
摘要:
A voice signal may be adjusted to mask traits such as the gender of a speaker by separating source and filter components of a voice signal using cepstral analysis, adjusting the components based on pitch and formant parameters, and synthesizing a modified signal. Features are disclosed to support real-time voice masking in a computer network by limiting computational complexity and reducing delays in processing and transmission while maintaining signal quality.
摘要:
In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
摘要:
Disclosed is a method, apparatus and non-transitory computer readable storage medium, which is configured to examine an input audio signal to determine an estimated pitch period associated with a detection criterion, encode the filtered audio signal to generate a compressed audio bit stream, decode the compressed audio bit stream to generate a decoded audio signal and adaptively filtering the decoded audio signal to produce an output audio signal, wherein adaptively filtering the decoded audio signal comprises filtering each of a plurality of segments of the decoded audio signal in a manner that is dependent upon the estimated pitch period associated therewith, and wherein a pitch-based post-filter operates to shape noise components located between harmonics of the input audio signal.
摘要:
Multi-user portable electronic devices for improving reading ability and/or comprehension for a plurality of subjects are provided. The multi-user portable electronic devices may include a pitch shifter circuit configured to generate frequency altered auditory speech feedback (FAF) signals corresponding to respective auditory speech signals received from respective active microphones, and to transmit the respective FAF signals to the plurality of subjects while one or more of the plurality of subjects are respectively reading aloud, to improve the plurality of subjects' reading ability and/or comprehension. The multi-user portable electronic devices may also include a switch configured to activate the microphones selectively, serially. Related methods and systems are also described.
摘要:
In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
摘要:
An electronic device and a corresponding method for analyzing and playing sound signals are provided. The electronic device includes a microphone, a processor, and a speaker. The microphone receives a sound and generates a sound signal according to the sound. The processor is coupled to the microphone for analyzing the sound signal to obtain an analysis parameter, determining a dynamic range parameter according to the analysis parameter, and adjusting the sound signal according to the dynamic range parameter. The speaker is coupled to the processor for playing the adjusted sound signal.
摘要:
Systems, including methods and apparatus, for applying audio effects to a non-ambient signal, based at least in part on information received in an ambient audio signal. Exemplary effects that can be applied using the present teachings include generation of harmony notes, pitch-correction of melody notes, and tempo-based effects that rely on beat detection.
摘要:
The present disclosure is directed towards an audio conferencing method. Some embodiments may include receiving, at a first mixing device, an audio signal from a first user associated with an audio conference. Embodiments may further include processing the audio signal at the first mixing device to generate a processed audio signal and transmitting the processed audio signal to a second mixing device, wherein the first mixing device and the second mixing device are distributed over a network in a cascaded configuration. Embodiments may also include receiving, at the second mixing device, a third audio signal from a second user associated with the audio conference and processing the third audio signal at the second mixing device to generate a second processed audio signal.