Beamforming for a wearable computer

    公开(公告)号:US10863270B1

    公开(公告)日:2020-12-08

    申请号:US16361808

    申请日:2019-03-22

    Abstract: A wearable computer is configured to use beamforming techniques to isolate a user's speech from extraneous audio signals occurring within a physical environment. A microphone array of the wearable computer may generate audio signal data from an utterance from a user's mouth. A motion sensor(s) of the wearable computer may generate motion data from movement of the wearable computer. This motion data may be used to determine a direction vector pointing from the wearable computer to the user's mouth, and a beampattern may be defined that has a beampattern direction in substantial alignment with the determined direction vector to focus the microphone array on the user's mouth for speech isolation.

    Speech model retrieval in distributed speech recognition systems
    2.
    发明授权
    Speech model retrieval in distributed speech recognition systems 有权
    分布式语音识别系统中的语音模型检索

    公开(公告)号:US09190057B2

    公开(公告)日:2015-11-17

    申请号:US13712891

    申请日:2012-12-12

    CPC classification number: G10L15/32 G10L15/22 G10L15/30

    Abstract: Features are disclosed for managing the use of speech recognition models and data in automated speech recognition systems. Models and data may be retrieved asynchronously and used as they are received or after an utterance is initially processed with more general or different models. Once received, the models and statistics can be cached. Statistics needed to update models and data may also be retrieved asynchronously so that it may be used to update the models and data as it becomes available. The updated models and data may be immediately used to re-process an utterance, or saved for use in processing subsequently received utterances. User interactions with the automated speech recognition system may be tracked in order to predict when a user is likely to utilize the system. Models and data may be pre-cached based on such predictions.

    Abstract translation: 公开了用于管理语音识别模型和自动语音识别系统中的数据的使用的特征。 可以异步检索模型和数据,并在收到文字或使用更为一般或不同的模型对话语进行初始处理之后进行使用。 一旦收到,模型和统计信息可以被缓存。 还可以异步检索更新模型和数据所需的统计数据,以便可以在模型和数据可用时更新模型和数据。 可以立即使用更新的模型和数据来重新处理话语,或者保存用于处理随后接收的话语。 可以跟踪与自动语音识别系统的用户交互,以便预测用户什么时候可能利用该系统。 基于这样的预测,模型和数据可以被预先缓存。

    Beamforming for a wearable computer

    公开(公告)号:US10244313B1

    公开(公告)日:2019-03-26

    申请号:US15247670

    申请日:2016-08-25

    Abstract: A wearable computer is configured to use beamforming techniques to isolate a user's speech from extraneous audio signals occurring within a physical environment. A microphone array of the wearable computer may generate audio signal data from an utterance from a user's mouth. A motion sensor(s) of the wearable computer may generate motion data from movement of the wearable computer. This motion data may be used to determine a direction vector pointing from the wearable computer to the user's mouth, and a beampattern may be defined that has a beampattern direction in substantial alignment with the determined direction vector to focus the microphone array on the user's mouth for speech isolation.

    Speech model retrieval in distributed speech recognition systems

    公开(公告)号:US10152973B2

    公开(公告)日:2018-12-11

    申请号:US14942551

    申请日:2015-11-16

    Abstract: Features are disclosed for managing the use of speech recognition models and data in automated speech recognition systems. Models and data may be retrieved asynchronously and used as they are received or after an utterance is initially processed with more general or different models. Once received, the models and statistics can be cached. Statistics needed to update models and data may also be retrieved asynchronously so that it may be used to update the models and data as it becomes available. The updated models and data may be immediately used to re-process an utterance, or saved for use in processing subsequently received utterances. User interactions with the automated speech recognition system may be tracked in order to predict when a user is likely to utilize the system. Models and data may be pre-cached based on such predictions.

    Beam forming for a wearable computer
    5.
    发明授权
    Beam forming for a wearable computer 有权
    用于可穿戴式计算机的梁形成

    公开(公告)号:US09432768B1

    公开(公告)日:2016-08-30

    申请号:US14229406

    申请日:2014-03-28

    CPC classification number: H04R3/005 H04R2201/023 H04R2499/11

    Abstract: A wearable computer is configured to use beamforming techniques to isolate a user's speech from extraneous audio signals occurring within a physical environment. A microphone array of the wearable computer may generate audio signal data from an utterance from a user's mouth. A motion sensor(s) of the wearable computer may generate motion data from movement of the wearable computer. This motion data may be used to determine a direction vector pointing from the wearable computer to the user's mouth, and a beampattern may be defined that has a beampattern direction in substantial alignment with the determined direction vector to focus the microphone array on the user's mouth for speech isolation.

    Abstract translation: 可穿戴式计算机被配置为使用波束形成技术来将用户的语音与在物理环境中发生的外来音频信号隔离开来。 可穿戴式计算机的麦克风阵列可以从用户口的话语产生音频信号数据。 可穿戴计算机的运动传感器可以从可穿戴计算机的运动产生运动数据。 该运动数据可以用于确定从可佩戴计算机指向用户嘴部的方向矢量,并且可以限定具有与所确定的方向矢量基本对准的波纹图案方向的波纹图案,以将麦克风阵列聚焦在用户的嘴上 言语隔离。

    SPEECH MODEL RETRIEVAL IN DISTRIBUTED SPEECH RECOGNITION SYSTEMS
    6.
    发明申请
    SPEECH MODEL RETRIEVAL IN DISTRIBUTED SPEECH RECOGNITION SYSTEMS 审中-公开
    分布式语音识别系统中的语音模型检索

    公开(公告)号:US20160071519A1

    公开(公告)日:2016-03-10

    申请号:US14942551

    申请日:2015-11-16

    CPC classification number: G10L15/32 G10L15/22 G10L15/30

    Abstract: Features are disclosed for managing the use of speech recognition models and data in automated speech recognition systems. Models and data may be retrieved asynchronously and used as they are received or after an utterance is initially processed with more general or different models. Once received, the models and statistics can be cached. Statistics needed to update models and data may also be retrieved asynchronously so that it may be used to update the models and data as it becomes available. The updated models and data may be immediately used to re-process an utterance, or saved for use in processing subsequently received utterances. User interactions with the automated speech recognition system may be tracked in order to predict when a user is likely to utilize the system. Models and data may be pre-cached based on such predictions.

    Abstract translation: 公开了用于管理语音识别模型和自动语音识别系统中的数据的使用的特征。 可以异步检索模型和数据,并在收到文字或使用更为一般或不同的模型对话语进行初始处理之后进行使用。 一旦收到,模型和统计信息可以被缓存。 还可以异步检索更新模型和数据所需的统计数据,以便可以在模型和数据可用时更新模型和数据。 可以立即使用更新的模型和数据来重新处理话语,或者保存用于处理随后接收的话语。 可以跟踪与自动语音识别系统的用户交互,以便预测用户什么时候可能利用该系统。 基于这样的预测,模型和数据可以被预先缓存。

    Selective speech recognition scoring using articulatory features
    7.
    发明授权
    Selective speech recognition scoring using articulatory features 有权
    使用发音功能的选择性语音识别评分

    公开(公告)号:US09355636B1

    公开(公告)日:2016-05-31

    申请号:US14027828

    申请日:2013-09-16

    CPC classification number: G10L15/142 G10L15/14 G10L25/48 G10L25/93

    Abstract: Features are provided for selectively scoring portions of user utterances based at least on articulatory features of the portions. One or more articulatory features of a portion of a user utterance can be determined. Acoustic models or subsets of individual acoustic model components (e.g., Gaussians or Gaussian mixture models) can be selected based on the articulatory features of the portion. The portion can then be scored using a selected acoustic model or subset of acoustic model components. The process may be repeated for the multiple portions of the utterance, and speech recognition results can be generated from the scored portions.

    Abstract translation: 提供了特征,用于至少基于部分的关节特征来选择性地评分用户话语的部分。 可以确定用户话语的一部分的一个或多个发音特征。 可以基于该部分的关节特征来选择单个声学模型分量(例如,高斯混合模型或高斯混合模型)的声学模型或子集。 然后可以使用选定的声学模型或声学模型组件的子集对该部分进行评分。 可以对话语的多个部分重复该过程,并且可以从刻痕部分产生语音识别结果。

    SPEECH MODEL RETRIEVAL IN DISTRIBUTED SPEECH RECOGNITION SYSTEMS
    8.
    发明申请
    SPEECH MODEL RETRIEVAL IN DISTRIBUTED SPEECH RECOGNITION SYSTEMS 有权
    分布式语音识别系统中的语音模型检索

    公开(公告)号:US20140163977A1

    公开(公告)日:2014-06-12

    申请号:US13712891

    申请日:2012-12-12

    CPC classification number: G10L15/32 G10L15/22 G10L15/30

    Abstract: Features are disclosed for managing the use of speech recognition models and data in automated speech recognition systems. Models and data may be retrieved asynchronously and used as they are received or after an utterance is initially processed with more general or different models. Once received, the models and statistics can be cached. Statistics needed to update models and data may also be retrieved asynchronously so that it may be used to update the models and data as it becomes available. The updated models and data may be immediately used to re-process an utterance, or saved for use in processing subsequently received utterances. User interactions with the automated speech recognition system may be tracked in order to predict when a user is likely to utilize the system. Models and data may be pre-cached based on such predictions.

    Abstract translation: 公开了用于管理语音识别模型和自动语音识别系统中的数据的使用的特征。 可以异步检索模型和数据,并在收到文字或使用更为一般或不同的模型对话语进行初始处理之后进行使用。 一旦收到,模型和统计信息可以被缓存。 还可以异步检索更新模型和数据所需的统计数据,以便可以在模型和数据可用时更新模型和数据。 可以立即使用更新的模型和数据来重新处理话语,或者保存用于处理随后接收的话语。 可以跟踪与自动语音识别系统的用户交互,以便预测用户什么时候可能利用该系统。 基于这样的预测,模型和数据可以被预先缓存。

Patent Agency Ranking