Abstract:
Stereophonic sound is reproduced by acquiring image depth information indicating a distance between at least one object in an image signal and a reference location, acquiring sound depth information indicating a distance between at least one sound object in a sound signal and a reference location based on the image depth information, and providing sound perspective to the at least one sound object based on the sound depth information.