-
公开(公告)号:US20220223167A1
公开(公告)日:2022-07-14
申请号:US17609314
申请日:2020-03-25
Applicant: SONY GROUP CORPORATION
Inventor: AKIRA TAKAHASHI , KAZUYA TATEISHI , YUICHIRO KOYAMA , HIROAKI OGAWA , CHIE KAMADA , NORIKO TOTSUKA , EMIRO TSUNOO , YUKI TAKEDA , YOSHINORI MAEDA , KAN KURODA , AKIRA FUKUI , HIDEAKI WATANABE
IPC: G10L21/0232 , G10L15/30 , G10L15/22 , H04R3/00 , H04R1/40
Abstract: A device and a method capable of performing audio recognition based on clear user speech by removing an external apparatus output sound from audio input through the audio input unit are realized. The device includes a user spoken voice extraction unit that extracts a user spoken voice from a microphone input sound. The user spoken voice extraction unit analyzes a sound source direction of an input sound, determines whether the input sound includes an external apparatus output sound on the basis of sound source directions of external apparatus output sounds recorded in a database, and removes a sound signal corresponding to a feature amount, for example, a frequency characteristic of the external apparatus output sound recorded in the database, from the input sound to extract a user spoken voice from which the external apparatus output sound has been removed upon determining that the input sound includes the external apparatus output sound.