Disambiguation of contact information using historical and context data
    1.
    发明授权
    Disambiguation of contact information using historical and context data 有权
    使用历史和上下文数据消除联系信息

    公开(公告)号:US08688450B2

    公开(公告)日:2014-04-01

    申请号:US13545744

    申请日:2012-07-10

    IPC分类号: G10L15/06 G10L15/00

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for disambiguating contact information are described. A method includes determining, for each of multiple communications that were initiated by a user of a mobile device, a time when the communication was initiated or received; determining, for each of multiple contacts associated with the user, a probability associated with the contact based at least on the times when the communications were initiated or received; weighting a contact disambiguation grammar according to the probabilities; and processing audio data using the contact disambiguation grammar to select a particular contact.

    摘要翻译: 描述了包括在计算机存储介质上编码的计算机程序用于消除联系信息的方法,系统和装置。 一种方法包括:确定由移动设备的用户发起的多个通信中的每一个,通信被启动或接收的时间; 至少基于通信被启动或接收的时间,确定与用户相关联的多个联系人中的每一个与联系人相关联的概率; 根据概率加权联系消歧语法; 并使用联系消歧语法来处理音频数据,以选择特定的联系人。

    Speech and noise models for speech recognition
    2.
    发明授权
    Speech and noise models for speech recognition 有权
    用于语音识别的语音和噪声模型

    公开(公告)号:US08666740B2

    公开(公告)日:2014-03-04

    申请号:US13530614

    申请日:2012-06-22

    IPC分类号: G10L15/20

    CPC分类号: G10L15/20 G10L21/0208

    摘要: An audio signal generated by a device based on audio input from a user may be received. The audio signal may include at least a user audio portion that corresponds to one or more user utterances recorded by the device. A user speech model associated with the user may be accessed and a determination may be made background audio in the audio signal is below a defined threshold. In response to determining that the background audio in the audio signal is below the defined threshold, the accessed user speech model may be adapted based on the audio signal to generate an adapted user speech model that models speech characteristics of the user. Noise compensation may be performed on the received audio signal using the adapted user speech model to generate a filtered audio signal with reduced background audio compared to the received audio signal.

    摘要翻译: 可以接收由基于来自用户的音频输入的设备生成的音频信号。 音频信号可以包括至少一个对应于由该设备记录的一个或多个用户话语的用户音频部分。 可以访问与用户相关联的用户语音模型,并且可以确定音频信号中的背景音频低于定义的阈值。 响应于确定音频信号中的背景音频低于定义的阈值,可以基于音频信号来调整所访问的用户语音模型,以生成对用户的语音特征进行建模的适配的用户语音模型。 可以使用适配的用户语音模型对所接收的音频信号执行噪声补偿,以生成与接收的音频信号相比具有降低的背景音频的滤波音频信号。

    Position and orientation determination for a mobile computing device
    3.
    发明授权
    Position and orientation determination for a mobile computing device 有权
    移动计算设备的位置和方向确定

    公开(公告)号:US08648799B1

    公开(公告)日:2014-02-11

    申请号:US13249364

    申请日:2011-09-30

    申请人: Matthew I. Lloyd

    发明人: Matthew I. Lloyd

    IPC分类号: G09G5/00

    CPC分类号: G06F3/017 G06F3/0346

    摘要: For multiple times in a time period, multiple data points can be received from an accelerometer and from a magnetometer that are included in a mobile computing device. For each of the data points, an orientation and a position of the mobile computing device can be determined based on an acceleration output and a magnetometer output that corresponds to the particular time. A trajectory is determined that represents movement of the mobile computing device during the time period based on the determined orientations and positions of the mobile computing device at the multiple times. Information that characterizes the trajectory is compared to stored information that characterizes a set of one or more base trajectories. Based on the comparison, an operation of the mobile computing device is identified that is associated with a trajectory included in the set of one or more base trajectories.

    摘要翻译: 在一段时间内多次,可以从加速度计和包括在移动计算设备中的磁力计接收多个数据点。 对于每个数据点,可以基于对应于特定时间的加速度输出和磁力计输出来确定移动计算设备的方向和位置。 基于所确定的多个移动计算设备的方向和位置,确定表示在该时间段期间的移动计算设备的移动的轨迹。 将表征轨迹的信息与表征一个或多个基本轨迹的集合的存储信息进行比较。 基于该比较,识别与一个或多个基本轨迹的集合中包括的轨迹相关联的移动计算设备的操作。

    GEOTAGGED ENVIRONMENTAL AUDIO FOR ENHANCED SPEECH RECOGNITION ACCURACY
    4.
    发明申请
    GEOTAGGED ENVIRONMENTAL AUDIO FOR ENHANCED SPEECH RECOGNITION ACCURACY 有权
    GEOTAGGED环境音频用于增强语音识别精度

    公开(公告)号:US20120296643A1

    公开(公告)日:2012-11-22

    申请号:US13564636

    申请日:2012-08-01

    IPC分类号: G10L21/02

    CPC分类号: G10L21/0208 G10L15/20

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving an audio signal that corresponds to an utterance recorded by a mobile device, determining a geographic location associated with the mobile device, identifying a set of geotagged audio signals that correspond to environmental audio associated with the geographic location, weighting each geotagged audio signal of the set of geotagged audio signals based on metadata associated with the respective geotagged audio signal, and using the set of weighted geotagged audio signals to perform noise compensation on the audio signal that corresponds to the utterance.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于增强语音识别精度。 在一个方面,一种方法包括接收对应于由移动设备记录的话语的音频信号,确定与移动设备相关联的地理位置,识别与地理位置相关联的环境音频对应的一组地理标记音频信号, 基于与相应的地理标记音频信号相关联的元数据,对所述一组地理标记音频信号的每个地理标记音频信号进行加权,并且使用该组加权的地理标记音频信号对对应于话语的音频信号执行噪声补偿。

    Content item location arrangement
    5.
    发明授权
    Content item location arrangement 有权
    内容项目位置安排

    公开(公告)号:US08311875B1

    公开(公告)日:2012-11-13

    申请号:US11928840

    申请日:2007-10-30

    申请人: Matthew I. Lloyd

    发明人: Matthew I. Lloyd

    IPC分类号: G06Q10/00

    CPC分类号: G06Q30/0241

    摘要: One or more content items associated with a content property are identified, each of the one or more content items associated with one or more performance measures. A rank score is determined for each of the one or more content items. One or more locations are identified for display proximate to the one or more content items based on the rank score for each of the one or more content items, and one or more other content items are provided for display in each of the one or more content item locations.

    摘要翻译: 识别与内容属性相关联的一个或多个内容项,与一个或多个性能测量相关联的一个或多个内容项中的每一个。 确定一个或多个内容项目中的每一个的等级分数。 基于一个或多个内容项目中的每一个的等级分数,识别一个或多个位置用于显示在一个或多个内容项目附近的显示,并且提供一个或多个其他内容项目以显示在所述一个或多个内容 项目位置。

    Speech recognition using dock context
    6.
    发明授权
    Speech recognition using dock context 有权
    使用设备对接语境进行语音识别

    公开(公告)号:US08296142B2

    公开(公告)日:2012-10-23

    申请号:US13040553

    申请日:2011-03-04

    IPC分类号: G10L15/18 G10L15/00

    摘要: Methods, systems, and apparatuses, including computer programs encoded on a computer storage medium, for performing speech recognition using dock context. In one aspect, a method includes accessing audio data that includes encoded speech. Information that indicates a docking context of a client device is accessed, the docking context being associated with the audio data. A plurality of language models is identified. At least one of the plurality of language models is selected based on the docking context. Speech recognition is performed on the audio data using the selected language model to identify a transcription for a portion of the audio data.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用码头上下文进行语音识别。 一方面,一种方法包括访问包括已编码语音的音频数据。 访问指示客户端设备的对接上下文的信息,对接上下文与音频数据相关联。 识别出多种语言模型。 基于对接上下文选择多个语言模型中的至少一个。 使用所选择的语言模型对音频数据执行语音识别,以识别音频数据的一部分的转录。

    Geotagged and weighted environmental audio for enhanced speech recognition accuracy
    7.
    发明授权
    Geotagged and weighted environmental audio for enhanced speech recognition accuracy 有权
    地理标记和加权环境音频,以提高语音识别精度

    公开(公告)号:US08175872B2

    公开(公告)日:2012-05-08

    申请号:US13250843

    申请日:2011-09-30

    IPC分类号: G10L21/02 G10L15/00

    CPC分类号: G10L21/0208 G10L15/20

    摘要: Enhancing noisy speech recognition accuracy by receiving geotagged audio signals that correspond to environmental audio recorded by multiple mobile devices in multiple geographic locations, receiving an audio signal that corresponds to an utterance recorded by a particular mobile device, determining a particular geographic location associated with the particular mobile device, selecting a subset of geotagged audio signals and weighting each geotagged audio signal of the subset based on whether the respective audio signal was manually uploaded or automatically updated, generating a noise model for the particular geographic location using the subset of weighted geotagged audio signals, where noise compensation is performed on the audio signal that corresponds to the utterance using the noise model that has been generated for the particular geographic location.

    摘要翻译: 通过接收与多个地理位置中的多个移动设备记录的环境音频相对应的地理标记音频信号来增强噪声语音识别精度,接收对应于由特定移动设备记录的话语的音频信号,确定与该特定移动设备相关联的特定地理位置 移动设备,基于是否手动上传或自动更新相应的音频信号,选择地理标记的音频信号的子集并对该子集的每个地理标记音频信号进行加权,使用加权的地理标记音频信号的子集生成特定地理位置的噪声模型 使用对特定地理位置生成的噪声模型对与发音对应的音频信号执行噪声补偿。

    PROGRESSIVE ENCODING OF AUDIO
    8.
    发明申请

    公开(公告)号:US20120084089A1

    公开(公告)日:2012-04-05

    申请号:US13250576

    申请日:2011-09-30

    IPC分类号: G10L19/00

    摘要: The present disclosure includes processing a signal to generate a first sub-set of data, transmitting the first sub-set of data for generation of a reconstructed audio signal, the reconstructed audio signal having a fidelity relative to the signal, processing the signal to generate a second sub-set of data and a third sub-set of data, the second sub-set of data defining a second portion of the signal and comprising data that is different than data of the first sub-set of data, and the third sub-set of data defining a third portion of the signal and comprising data that is different than data of the first and second sub-sets of data, comparing a priority of the second sub-set of data to a priority of the third sub-set of data, and transmitting one of the second sub-set of data and the third sub-set of data over the network for improving the fidelity.

    PROGRESSIVE ENCODING OF AUDIO
    9.
    发明申请
    PROGRESSIVE ENCODING OF AUDIO 有权
    音频编码

    公开(公告)号:US20120083910A1

    公开(公告)日:2012-04-05

    申请号:US12895258

    申请日:2010-09-30

    IPC分类号: G06F17/00

    摘要: The present disclosure includes processing a signal to generate a first sub-set of data, transmitting the first sub-set of data for generation of a reconstructed audio signal, the reconstructed audio signal having a fidelity relative to the signal, processing the signal to generate a second sub-set of data and a third sub-set of data, the second sub-set of data defining a second portion of the signal and comprising data that is different than data of the first sub-set of data, and the third sub-set of data defining a third portion of the signal and comprising data that is different than data of the first and second sub-sets of data, comparing a priority of the second sub-set of data to a priority of the third sub-set of data, and transmitting one of the second sub-set of data and the third sub-set of data over the network for improving the fidelity.

    摘要翻译: 本公开包括处理信号以产生第一数据子集,发送用于生成重构音频信号的第一数据子集,重构音频信号相对于该信号具有保真度,处理该信号以产生 第二子集数据和第三子数据集合,所述第二子数据集定义所述信号的第二部分,并且包括与所述第一数据子集的数据不同的数据,以及所述第三子集 定义信号的第三部分的数据子集包括与第一和第二数据子集的数据不同的数据,将第二子数据集的优先级与第三子集的优先级进行比较, 并且通过网络发送数据的第二子集和第三子数据集中的一个,以提高保真度。

    DISAMBIGUATION OF CONTACT INFORMATION USING HISTORICAL DATA
    10.
    发明申请
    DISAMBIGUATION OF CONTACT INFORMATION USING HISTORICAL DATA 有权
    使用历史数据分析联系信息

    公开(公告)号:US20110288868A1

    公开(公告)日:2011-11-24

    申请号:US12782862

    申请日:2010-05-19

    IPC分类号: G10L15/04 G06N5/02 H04W4/12

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for disambiguating contact information. A method includes receiving an audio signal, generating an affinity score based on a frequency with which a user has previously communicated with a contact associated with an item of contact information, and further based on a recency of one or more past interactions between the user and the contact associated with the item of contact information, inferring a probability that the user intends to initiate a communication using the item of contact information based on the affinity score generated for the item of contact information, and generating a communication initiation grammar.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于消除联系信息的歧义。 一种方法包括接收音频信号,基于用户先前已经与与联系人信息项相关联的联系人进行通信的频率生成亲和度分数,并且还基于用户和 所述联系人与所述联系人信息项相关联,基于为所述联系人信息项生成的所述亲和度得出推断所述用户意图使用所述联系人信息项发起通信的概率,以及生成通信开始语法。