-
公开(公告)号:US20170308164A1
公开(公告)日:2017-10-26
申请号:US15645365
申请日:2017-07-10
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Jongwon Shin , Erik Visser
IPC: G06F3/01 , H04N7/15 , H04M3/56 , G10L25/48 , G04G21/00 , G10L21/0216 , G06F1/16 , G01S3/808 , H04R1/40 , H04R3/00 , H04R29/00 , H04S7/00 , G10L17/00
CPC classification number: G06F3/013 , G01S3/8083 , G04G21/00 , G06F1/1613 , G06F3/011 , G10L17/00 , G10L25/48 , G10L2021/02166 , H04M3/568 , H04N7/15 , H04R1/406 , H04R3/005 , H04R29/005 , H04S7/304
Abstract: Disclosed is an application interface that takes into account the user's gaze direction relative to who is speaking in an interactive multi-participant environment where audio-based contextual information and/or visual-based semantic information is being presented. Among these various implementations, two different types of microphone array devices (MADs) may be used. The first type of MAD is a steerable microphone array (a.k.a. a steerable array) which is worn by a user in a known orientation with regard to the user's eyes, and wherein multiple users may each wear a steerable array. The second type of MAD is a fixed-location microphone array (a.k.a. a fixed array) which is placed in the same acoustic space as the users (one or more of which are using steerable arrays).
-
公开(公告)号:US09319510B2
公开(公告)日:2016-04-19
申请号:US13768946
申请日:2013-02-15
Applicant: Qualcomm Incorporated
Inventor: Lae-Hoon Kim , Sang-Uk Ryu , Jongwon Shin
IPC: H04M1/76 , H04M3/22 , H04M9/08 , G10L21/038 , G10L17/00
CPC classification number: H04M3/22 , G10L17/00 , G10L21/038 , H04M9/08
Abstract: A personalized (i.e., speaker-derivable) bandwidth extension is provided in which the model used for bandwidth extension is personalized (e.g., tailored) to each specific user. A training phase is performed to generate a bandwidth extension model that is personalized to a user. The model may be subsequently used in a bandwidth extension phase during a phone call involving the user. The bandwidth extension phase, using the personalized bandwidth extension model, will be activated when a higher band (e.g., wideband) is not available and the call is taking place on a lower band (e.g., narrowband).
Abstract translation: 提供个性化(即,可引发扬声器)带宽扩展,其中用于带宽扩展的模型被个性化(例如,定制)给每个特定用户。 执行训练阶段以生成针对用户个性化的带宽扩展模型。 在涉及用户的电话呼叫期间,该模型可以随后用于带宽扩展阶段。 当较高频带(例如,宽带)不可用并且呼叫正在较低频带(例如,窄带)上发生时,使用个性化带宽扩展模型的带宽扩展阶段将被激活。
-
13.
公开(公告)号:US20130300648A1
公开(公告)日:2013-11-14
申请号:US13674789
申请日:2012-11-12
Applicant: QUALCOMM INCORPORATED
Inventor: Lae-Hoon Kim , Jongwon Shin , Erik Visser
IPC: G06F3/01
CPC classification number: G06F3/013 , G01S3/8083 , G04G21/00 , G06F1/1613 , G06F3/011 , G10L17/00 , G10L25/48 , G10L2021/02166 , H04M3/568 , H04N7/15 , H04R1/406 , H04R3/005 , H04R29/005 , H04S7/304
Abstract: Disclosed is an application interface that takes into account the user's gaze direction relative to who is speaking in an interactive multi-participant environment where audio-based contextual information and/or visual-based semantic information is being presented. Among these various implementations, two different types of microphone array devices (MADs) may be used. The first type of MAD is a steerable microphone array (a.k.a. a steerable array) which is worn by a user in a known orientation with regard to the user's eyes, and wherein multiple users may each wear a steerable array. The second type of MAD is a fixed-location microphone array (a.k.a. a fixed array) which is placed in the same acoustic space as the users (one or more of which are using steerable arrays).
Abstract translation: 公开了一种应用界面,其考虑到在呈现基于音频的上下文信息和/或基于视觉的语义信息的交互式多参与者环境中的用户的注视方向。 在这些各种实施方案中,可以使用两种不同类型的麦克风阵列装置(MAD)。 第一类型的MAD是可操纵的麦克风阵列(一个可操纵的阵列),其由用户以相对于用户的眼睛已知的方向佩戴,并且其中多个用户可以佩戴可操纵的阵列。 第二种类型的MAD是固定位置麦克风阵列(即固定阵列),其被放置在与用户(其中一个或多个使用可控阵列)相同的声学空间中。
-
公开(公告)号:US20130282369A1
公开(公告)日:2013-10-24
申请号:US13827894
申请日:2013-03-14
Applicant: QUALCOMM INCORPORATED
Inventor: Erik Visser , Lae-Hoon Kim , Jongwon Shin , Yinyi Guo , Sang-Ut Ryu , Andre Gustavo P. Schevciw
IPC: G10L21/0208
CPC classification number: G10L21/0208 , G10L15/20 , G10L21/0316 , G10L25/93 , G10L2021/02165
Abstract: A method for signal level matching by an electronic device is described. The method includes capturing a plurality of audio signals from a plurality of microphones. The method also includes determining a difference signal based on an inter-microphone subtraction. The difference signal includes multiple harmonics. The method also includes determining whether a harmonicity of the difference signal exceeds a harmonicity threshold. The method also includes preserving the harmonics to determine an envelope. The method further applies the envelope to a noise-suppressed signal.
Abstract translation: 描述了一种由电子设备进行信号电平匹配的方法。 该方法包括从多个麦克风中捕获多个音频信号。 该方法还包括基于麦克风间减法确定差分信号。 差分信号包括多个谐波。 该方法还包括确定差分信号的谐波是否超过谐波阈值。 该方法还包括保存谐波以确定信封。 该方法进一步将包络应用于噪声抑制信号。
-
-
-