Intelligent call routing and call supervision method for call centers
    1.
    发明申请
    Intelligent call routing and call supervision method for call centers 有权
    呼叫中心的智能呼叫路由和呼叫监控方法

    公开(公告)号:US20050286705A1

    公开(公告)日:2005-12-29

    申请号:US10869810

    申请日:2004-06-16

    CPC classification number: H04M3/5232 H04M2203/2061

    Abstract: A call routing and supervising system includes an input receiving customer speech from a remote location, and a voice characteristics extractor extracting voice characteristics from the customer speech, such as language/dialect/accent, age group, gender, and eigendimension coordinates. A customer service representative selector selects one or more customer service representatives based on profiles of the customer service representatives respective of customers having voice characteristics similar to the extracted voice characteristics. In other aspects, a call monitor automatically analyzes dialogue between the customer and the customer service representative, such as detected interruptions, tracked dialogue turns, and recognized key phrases indicating frustration, polity, and/or resolution characteristics of dialogue. The call monitor records performance of the customer service representative respective of customers having the voice characteristics. Automatic call rerouting and/or real-time instruction of call center personnel can also be accomplished based on analysis results.

    Abstract translation: 呼叫路由和监督系统包括从远程位置接收客户语音的输入和从客户语音提取语音特征的语音特征提取器,例如语言/方言/口音,年龄组,性别和特征坐标。 客户服务代表选择器基于具有与所提取的语音特征相似的语音特征的客户的客户服务代表的简档来选择一个或多个客户服务代表。 在其他方面,呼叫监视器自动分析客户和客户服务代表之间的对话,诸如检测到的中断,跟踪的对话转弯以及表示对话的挫折,政治和/或解析特征的识别的关键短语。 呼叫监视器记录具有语音特征的客户的客户服务代表的性能。 呼叫中心人员的自动呼叫转接和/或实时指令也可以基于分析结果完成。

    Automated hotel attendant using speech recognition
    2.
    发明授权
    Automated hotel attendant using speech recognition 失效
    自动化酒店服务员使用语音识别

    公开(公告)号:US06314165B1

    公开(公告)日:2001-11-06

    申请号:US09070399

    申请日:1998-04-30

    CPC classification number: G10L15/26

    Abstract: An automated hotel attendant is provided for coordinating room-to-room calling over a telephone switching system that supports multiple telephone extensions. A hotel registration system receives and stores the spelled names of hotel guests as well as assigns each guest an associated telephone extension. A lexicon training system is connected to the hotel registration system for generating pronunciations for each spelled name by converting the characters that spell those names into word-phoneme data. This word-phoneme data is in turn stored in a lexicon that is used by a speech recognition system. In particular, a phoneticizer in conjunction with a Hidden Markov Model (HMM) based model trainer serves as the basis for the lexicon training system, such that one or several HMM models associated with each guest name are stored in the lexicon. An automated attendant is coupled to the speech recognition system for converting a spoken name of a hotel guest entered from one of the telephone extensions into a predefined hotel guest name that can be used to retrieve an assigned telephone extension from the hotel registration system. Next, the automated attendant causes the telephone switching system to call the requested telephone extension in response to the entry of the spoken name from one of the telephone extensions.

    Abstract translation: 提供一个自动化酒店服务人员通过支持多个电话分机的电话交换系统协调房间到房间的通话。 酒店注册系统接收并存储酒店客人的拼写名称,并为每个客人分配相关的电话分机。 词典训练系统连接到酒店注册系统,用于通过将拼写这些名称的字符转换为字音素数据来产生每个拼写名称的发音。 该字音素数据又被存储在由语音识别系统使用的词典中。 特别地,与基于隐马尔可夫模型(HMM)的模型训练器相结合的语音提供者用作词典训练系统的基础,使得与每个来宾姓名相关联的一个或多个HMM模型被存储在词典中。 自动话务员被耦合到语音识别系统,用于将从一个电话分机输入的酒店客人的口语名称转换成可用于从酒店注册系统检索分配的电话分机的预定义酒店客人姓名。 接下来,自动话务员使得电话交换系统响应于从电话分机之一输入口语名称来呼叫所请求的电话分机。

    Method and apparatus using decision trees to generate and score multiple
pronunciations for a spelled word
    3.
    发明授权
    Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word 失效
    使用决策树生成和评分拼写单词的多个发音的方法和设备

    公开(公告)号:US6016471A

    公开(公告)日:2000-01-18

    申请号:US67764

    申请日:1998-04-29

    CPC classification number: G10L13/08

    Abstract: The mixed decision tree includes a network of yes-no questions about adjacent letters in a spelled word sequence and also about adjacent phonemes in the phoneme sequence corresponding to the spelled word sequence. Leaf nodes of the mixed decision tree provide information about which phonetic transcriptions are most probable. Using the mixed trees, scores are developed for each of a plurality of possible pronunciations, and these scores can be used to select the best pronunciation as well as to rank pronunciations in order of probability. The pronunciations generated by the system can be used in speech synthesis and speech recognition applications as well as lexicography applications.

    Abstract translation: 混合决策树包括关于拼写字序列中的相邻字母的是 - 否问题的网络,并且还包括与拼写单词序列相对应的音素序列中的相邻音素。 混合决策树的叶节点提供了哪些语音转录最有可能的信息。 使用混合树,为多个可能的发音中的每一个开发分数,并且这些分数可以用于选择最佳发音以及按概率的排序排列发音。 系统生成的发音可用于语音合成和语音识别应用以及词典应用。

    Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue
    4.
    发明授权
    Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue 有权
    使用意义提取和对话的手持设备中面向目标的语音翻译方法

    公开(公告)号:US06233561B1

    公开(公告)日:2001-05-15

    申请号:US09290628

    申请日:1999-04-12

    CPC classification number: G10L15/1822 G10L15/1815

    Abstract: A computer-implemented method and apparatus is provided for processing a spoken request from a user. A speech recognizer converts the spoken request into a digital format. A frame data structure associates semantic components of the digitized spoken request with predetermined slots. The slots are indicative of data which are used to achieve a predetermined goal. A speech understanding module which is connected to the speech recognizer and to the frame data structure determines semantic components of the spoken request. The slots are populated based upon the determined semantic components. A dialog manager which is connected to the speech understanding module may determine at least one slot which is unpopulated based upon the determined semantic components and in a preferred embodiment may provide confirmation of the populated slots. A computer generated-request is formulated in order for the user to provide data related to the unpopulated slot. The method and apparatus are well-suited (but not limited) to use in a hand-held speech translation device.

    Abstract translation: 提供了一种用于处理来自用户的口头请求的计算机实现的方法和装置。 语音识别器将口头请求转换为数字格式。 帧数据结构将数字化语音请求的语义分量与预定时隙相关联。 这些时隙指示用于实现预定目标的数据。 连接到语音识别器和帧数据结构的语音理解模块确定语音请求的语义分量。 基于确定的语义分量来填充时隙。 连接到语音理解模块的对话管理器可以基于所确定的语义组件来确定未填充的至少一个时隙,并且在优选实施例中可以提供填充时隙的确认。 制定计算机生成请求以便用户提供与未填充槽相关的数据。 该方法和装置非常适合(但不限于)在手持语音翻译装置中使用。

    Method for generating spelling-to-pronunciation decision tree
    5.
    发明授权
    Method for generating spelling-to-pronunciation decision tree 失效
    拼写到发音决策树的方法

    公开(公告)号:US06230131B1

    公开(公告)日:2001-05-08

    申请号:US09069308

    申请日:1998-04-29

    CPC classification number: G10L13/08

    Abstract: Decision trees are used to store a series of yes-no questions that can be used to convert spelled-word letter sequences into pronunciations. Letter-only trees, having internal nodes populated with questions about letters in the input sequence, generate one or more pronunciations based on probability data stored in the leaf nodes of the tree. The pronunciations may then be improved by processing them using mixed trees which are populated with questions about letters in the sequence and also questions about phonemes associated with those letters. The mixed tree screens out pronunciations that would not occur in natural speech, thereby greatly improving the results of the letter-to-pronunciation transformation.

    Abstract translation: 决策树用于存储可用于将拼写字母序列转换为发音的一系列“是”的问题。 仅有信息树,内部节点填充有关输入序列中的字母的问题,根据存储在树的叶节点中的概率数据生成一个或多个发音。 然后可以通过使用填充有序列中的字母的问题的混合树以及与这些字母相关的音素的问题来处理它们来发音。 混合树屏蔽了自然语言中不会发生的发音,从而大大提高了字母到发音转换的结果。

    Speaker authentication by fusion of voiceprint match attempt results with additional information
    6.
    发明授权
    Speaker authentication by fusion of voiceprint match attempt results with additional information 有权
    通过融合声纹的扬声器认证与尝试结果匹配附加信息

    公开(公告)号:US07240007B2

    公开(公告)日:2007-07-03

    申请号:US10392549

    申请日:2003-03-20

    CPC classification number: G10L15/24

    Abstract: A speaker authentication system includes a data fuser operable to fuse voiceprint match attempt results with additional information to assist in authenticating a speaker providing audio input. In other aspects, the system includes a data store of speaker voiceprints and a voiceprint matching module adapted to receive an audio input and operable to attempt to assist in authenticating a speaker by matching the audio input to at least one of the speaker voiceprints. The voiceprint matching module adjusts a confidence of voiceprint match attempt results by at least one of: (a) a number of utterance repetitions upon which a matching speaker voiceprint has been trained; or (b) a passage of time since a training occurrence associated with a matching speaker voiceprint.

    Abstract translation: 扬声器认证系统包括数据定影器,其可操作以将声纹匹配尝试结果与附加信息融合,以帮助认证提供音频输入的扬声器。 在其他方面,该系统包括扬声器声纹的数据存储器和声纹匹配模块,其适于接收音频输入并且可操作以通过将音频输入与扬声器声纹中的至少一个相匹配来尝试辅助认证扬声器。 声纹匹配模块通过以下至少一个来调整声纹匹配尝试结果的置信度:(a)已经训练了匹配的说话者声纹的多个话语重复; 或(b)与匹配的说话者声纹相关联的训练发生之后的时间段。

    Technique for developing discriminative sound units for speech recognition and allophone modeling
    7.
    发明授权
    Technique for developing discriminative sound units for speech recognition and allophone modeling 有权
    用于发展用于语音识别和异音素建模的辨别声音单元的技术

    公开(公告)号:US06711541B1

    公开(公告)日:2004-03-23

    申请号:US09390434

    申请日:1999-09-07

    CPC classification number: G10L15/063 G10L2015/025

    Abstract: A set of models is developed to represent sound units and these models are then used with the incorrect sound units to determine which generate high likelihood scores. The models generating high likelihood scores for the incorrect sound units represent those that are more likely to be confused. The resulting confusability data may then be used in generating more discriminative speech models and in subsequent pruning of the acoustic decision tree. The confusability data may also be used to develop confusability predictors used for rejection during search and in developing continuous speech recognition models that are optimized to minimize confusability.

    Abstract translation: 开发了一组模型来表示声音单元,然后将这些模型与不正确的声音单元一起使用以确定哪个产生高似然分数。 为不正确声音单位产生高似然分数的模型代表更可能被混淆的那些。 所产生的可混淆性数据然后可以用于产生更具歧视性的语音模型以及随后的声学决策树的修剪。 可混淆性数据还可用于开发用于搜索期间拒绝的混淆性预测变量,并开发出经过优化以最小化混淆性的连续语音识别模型。

    Voice activated controller for recording and retrieving audio/video programs
    8.
    发明授权
    Voice activated controller for recording and retrieving audio/video programs 有权
    语音激活控制器,用于记录和检索音频/视频节目

    公开(公告)号:US06643620B1

    公开(公告)日:2003-11-04

    申请号:US09270262

    申请日:1999-03-15

    Abstract: The system includes a database of program records representing A/V programs which are available for recording. The system also includes an A/V recording device for receiving a recording command and recording the A/V program. A speech recognizer is provided for receiving the spoken request and translating the spoken request into a text stream having a plurality of words. A natural language processor receives the text stream and processes the words for resolving a semantic content of the spoken request. The natural language processor places the meaning of the words into a task frame having a plurality of key word slots. A dialogue system analyzes the task frame for determining if a sufficient number of key word slots have been filled and prompts the user for additional information for filling empty slots. The dialogue system searches the database of program records using the key words placed within the task frame for selecting the A/V program and generating the recording command for use by the A/V recording device.

    Abstract translation: 该系统包括表示可用于记录的A / V节目的节目记录的数据库。 该系统还包括用于接收记录命令并记录A / V程序的A / V记录装置。 语音识别器被提供用于接收口头请求并将口头请求转换成具有多个单词的文本流。 自然语言处理器接收文本流并处理用于解析语音请求的语义内容的单词。 自然语言处理器将单词的含义置于具有多个关键字时隙的任务帧中。 对话系统分析任务框以确定是否已经填充了足够数量的关键字槽,并提示用户填充空槽的附加信息。 对话系统使用放置在任务帧内的关键词来搜索节目记录的数据库,用于选择A / V节目并产生由A / V记录装置使用的记录命令。

    Intelligent call routing and call supervision method for call centers
    9.
    发明授权
    Intelligent call routing and call supervision method for call centers 有权
    呼叫中心的智能呼叫路由和呼叫监控方法

    公开(公告)号:US07844045B2

    公开(公告)日:2010-11-30

    申请号:US10869810

    申请日:2004-06-16

    CPC classification number: H04M3/5232 H04M2203/2061

    Abstract: A call routing and supervising system includes an input receiving customer speech from a remote location, and a voice characteristics extractor extracting voice characteristics from the customer speech, such as language/dialect/accent, age group, gender, and eigendimension coordinates. A customer service representative selector selects one or more customer service representatives based on profiles of the customer service representatives respective of customers having voice characteristics similar to the extracted voice characteristics. In other aspects, a call monitor automatically analyzes dialogue between the customer and the customer service representative, such as detected interruptions, tracked dialogue turns, and recognized key phrases indicating frustration, polity, and/or resolution characteristics of dialogue. The call monitor records performance of the customer service representative respective of customers having the voice characteristics. Automatic call rerouting and/or real-time instruction of call center personnel can also be accomplished based on analysis results.

    Abstract translation: 呼叫路由和监督系统包括从远程位置接收客户语音的输入和从客户语音提取语音特征的语音特征提取器,例如语言/方言/口音,年龄组,性别和特征坐标。 客户服务代表选择器基于具有与所提取的语音特征相似的语音特征的客户的客户服务代表的简档来选择一个或多个客户服务代表。 在其他方面,呼叫监视器自动分析客户和客户服务代表之间的对话,诸如检测到的中断,跟踪的对话转弯以及表示对话的挫折,政治和/或解析特征的识别的关键短语。 呼叫监视器记录具有语音特征的客户的客户服务代表的性能。 呼叫中心人员的自动呼叫转接和/或实时指令也可以基于分析结果完成。

    Constraint-based speech recognition system and method

    公开(公告)号:US07124085B2

    公开(公告)日:2006-10-17

    申请号:US10022023

    申请日:2001-12-13

    CPC classification number: G10L15/24

    Abstract: A constraint-based speech recognition system for use with a form-filling application employed over a telephone system is disclosed. The system comprises an input signal, wherein the input signal includes both speech input and non-speech input of a type generated by a user via a manually operated device. The system further comprises a constraint module operable to access an information database containing information suitable for use with speech recognition, and to generate candidate information based on the non-speech input and the information database, wherein the candidate information corresponds to a portion of the information. The system further comprises a speech recognition module operable to recognize speech based on the speech input and the candidate information. In an exemplary embodiment, the manually operated device is a touch-tone telephone keypad, and the information database is a lexicon encoded according to classes defined by the keys of the keypad.

Patent Agency Ranking