Abstract:
Methods and apparatus for processing media signals. In one embodiment, a data processing device processes fixed and variable rate data using a first and second processing unit. The processing comprises real-time processing of audio/video signals by a graphics processing unit (GPU) and/or central processing unit (CPU). The processing units process data efficiently by establishing one processor as always processing variable rate data, and using one or more schemes for determining processor will process fixed rate data. A shared memory enables the processors to communicate with one another in order to determine which will process the fixed rate data. In one scheme for determining which of the processors will process the fixed rate data the second processor need merely be unlocked. In another embodiment, the second processor must be unlocked and immediately available.
Abstract:
A video conferencing system is described that includes a near-end and a far-end system. The near-end system records both audio and video of one or more users proximate to the near-end system. This recorded audio and video is transmitted to the far-end system through the data connection. The video stream and/or one or more settings of the recording camera are analyzed to determine the amount of a video frame occupied by the recorded user(s). The video conferencing system may directly analyze the video frames themselves and/or a zoom setting of the recording camera to determine a ratio or percentage of the video frame occupied by the recorded user(s). By analyzing video frames associated with an audio stream, the video conferencing system may drive a speaker array of the far-end system to more accurately reproduce sound content based on the position of the recorded user in a video frame.
Abstract:
A loudspeaker that measures the impulse response of a listening area is described. The loudspeaker may output sounds corresponding to a segment of an audio signal. The sounds are sensed by a listening device proximate to a listener and transmitted to the loudspeaker. The loudspeaker includes an adaptive filter that estimates the impulse response of the listening area based on the signal segment. An error unit analyzes the estimated impulse response together with the sensed audio signal received from the listening device to determine the accuracy of the estimate. New estimates may be generated by the adaptive filter until an accuracy level is achieved for the signal segment. A processor may utilize one or more estimated impulse responses corresponding to various signal segments that cover a defined frequency spectrum for adjusting the audio signal to compensate for the impulse response of the listening area. Other embodiments are also described.
Abstract:
In one embodiment, a process for suppressing reverberation begins with a device of a user obtaining a reverberant speech signal from a voice of the user. The device determines a first estimated reverberation component of the reverberant speech signal. The device generates a first de-reverberated output signal with a first reverberation suppression based on the reverberant speech signal and the first estimated reverberation component. Then, the device generates a second improved reverberation component using the first de-reverberated output signal. The device generates a second de-reverberated output signal with a second reverberation suppression based on the reverberant speech signal and the second improved reverberation component.