-
公开(公告)号:US20230142892A1
公开(公告)日:2023-05-11
申请号:US18093972
申请日:2023-01-06
Applicant: Microsoft Technology Licensing, LLC
Inventor: Paul CROOK , Vasiliy RADOSTEV , Omar Zia KHAN , Vipul AGARWAL , Ruhi SARIKAYA , Marius Alexandru MARIN , Alexandre ROCHETTE , Jean-Philippe ROBICHAUD
CPC classification number: G10L15/22 , G06F9/54 , G06F40/35 , G06F9/4881 , G10L15/18 , G10L15/222 , G10L2015/223
Abstract: Conversational understanding systems allow users to conversationally interface with a computing device. In examples, a query may be received that includes a request for execution of a task. A data exchange task definition may be accessed. The data exchange task definition assists a conversational understanding system in managing task state tracking for information needed for task execution. Using the data exchange task definition, a per-turn policy for interacting with the user computing device is generated based on the state of a dialogue with a computing device and an evaluation of a process flow chart provided by a task owner resource. The task owner resource may be independent from the conversational understanding system. A response to the query may be generated and output based on the per-turn policy. In examples, the per-turn policy is used to generate one or more responses during a dialogue with a user via a computing device.
-
公开(公告)号:US11646031B2
公开(公告)日:2023-05-09
申请号:US16956913
申请日:2018-11-26
Applicant: VOLKSWAGEN AKTIENGESELLSCHAFT
Inventor: Rüdiger Woike
Abstract: A method, a device, and a computer-readable storage medium having instructions for processing a speech input. A speech input from a user is received and preprocessed for at least one of two or more available speech-processing services. The preprocessed speech inputs are transferred to one or more of the available speech-processing services.
-
公开(公告)号:US20190198008A1
公开(公告)日:2019-06-27
申请号:US15854309
申请日:2017-12-26
Applicant: International Business Machines Corporation
Inventor: Shang Qing GUO , Jonathan LENCHNER
CPC classification number: G10L13/02 , B25J9/0003 , B25J11/0005 , B25J13/003 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/84 , G10L2015/088 , G10L2021/02087
Abstract: A system, a computer program product, and method for controlling synthesized speech output on a voice-controlled device. The voice-controlled device recognized that speech input is being received. The voice-controlled device outputs synthesized speech based on the speech input. While outputting synthesized speech based on the audio is captured. The voice-controlled device recognized the audio input as speech and pausing the outputting of synthesized speech. Otherwise, in response to the captured audio not being recognized as speech and above a settable background noise threshold, pausing the outputting of synthesized speech. The paused output of speech based on the synthesized speech input is resumed after the pausing of the output of synthesized speech being within a settable pause timeframe.
-
公开(公告)号:US20190035391A1
公开(公告)日:2019-01-31
申请号:US15661991
申请日:2017-07-27
Applicant: Intel Corporation
Inventor: Lavinia A. Danielescu , Shawn C. Nikkila , Robert J. Firby , Beth Ann Hockey
CPC classification number: G10L15/22 , G10L15/02 , G10L15/18 , G10L15/32 , G10L25/90 , G10L2015/088 , G10L2015/223
Abstract: Apparatuses, methods and storage medium associated with a spoken dialogue system are disclosed herein. In embodiments, an apparatus for natural machine conversing with a user may comprise a listening component to detect a keyword that denotes start of a conversation; a dialogue engine to converse with the user during the conversation; and a controller to selectively activate or cause to be activated one of the listening component or the dialogue component, and to pass control to the activated listening component or the activated dialogue engine, based at least in part on a state of the conversation. Other embodiments may be disclosed or claimed.
-
75.
公开(公告)号:US10073953B2
公开(公告)日:2018-09-11
申请号:US15632737
申请日:2017-06-26
Applicant: WORLD AWARD ACADEMY , WORLD AWARD FOUNDATION , AMOBILEPAY, INC.
Inventor: Zhou Tian Xing
IPC: G06Q20/36 , G06Q20/32 , G06Q20/40 , G06K9/00 , G06F3/01 , G06F19/00 , G07C9/00 , G06K7/10 , G06F21/32 , G06F21/62 , G06N99/00 , A61B10/00 , A61B5/00 , A61B5/02 , A61B5/0205 , A61B5/0402 , A61B5/11 , A61B5/1172 , A61B5/145 , G10L15/22 , G06F3/16 , G06F3/041 , G16H40/63 , G16H50/20 , G16H10/60 , G06F1/16 , G06F21/35 , G06Q20/20 , G06Q20/34 , G06Q20/38 , G06Q30/02 , G06Q30/06 , G10L15/18 , H04B1/3827 , H04M1/725 , H04W4/04 , G06Q20/10 , H04L29/08 , A61B5/01 , A61B5/021 , A61B5/024 , A61B5/08 , G06F3/0488
CPC classification number: G16H20/10 , A61B5/0024 , A61B5/01 , A61B5/02055 , A61B5/021 , A61B5/02433 , A61B5/0402 , A61B5/0816 , A61B5/1112 , A61B5/1117 , A61B5/1118 , A61B5/1172 , A61B5/14507 , A61B5/14532 , A61B5/14546 , A61B5/4866 , A61B5/6804 , A61B5/681 , A61B5/7405 , A61B5/7425 , A61B5/7455 , A61B5/746 , A61B5/7495 , A61B10/0051 , A61B2560/0252 , A61B2562/0219 , A61B2562/028 , G06F1/163 , G06F3/016 , G06F3/017 , G06F3/04883 , G06F19/3456 , G06F21/32 , G06F21/35 , G06F21/6245 , G06K7/10762 , G06K9/00087 , G06K9/00335 , G06K9/00892 , G06K2007/10534 , G06K2009/00939 , G06K2209/01 , G06N20/00 , G06Q20/204 , G06Q20/32 , G06Q20/322 , G06Q20/3221 , G06Q20/3224 , G06Q20/3229 , G06Q20/327 , G06Q20/3274 , G06Q20/3276 , G06Q20/3278 , G06Q20/351 , G06Q20/36 , G06Q20/401 , G06Q20/4012 , G06Q20/40145 , G06Q30/0226 , G06Q30/0267 , G06Q30/06 , G06Q30/0601 , G06Q30/0641 , G06Q2220/00 , G07C9/00309 , G10L15/18 , G10L15/22 , G10L2015/223 , G16H10/60 , G16H40/63 , G16H50/20 , H04B1/385 , H04B2001/3861 , H04L67/04 , H04M1/72527 , H04M1/7253 , H04W4/021 , H04W4/029 , H04W4/33
Abstract: Provided are a wearable personal digital device and related methods. The wearable personal digital device may comprise a processor, a display, biometric sensors, activity tracking sensors, a memory unit, a communication circuit, a housing, an input unit, a projector, a timepiece unit, a haptic touch control actuator, and a band. The processor may be operable to receive data from an external device, provide a notification to a user based on the data, receive a user input, and perform a command selected based on the user input. The communication circuit may be communicatively coupled to the processor and operable to connect to a wireless network and communicate with the external device. The housing may be adapted to enclose the components of the wearable personal digital device. The band may be adapted to attach to the housing and secure the wearable personal digital device on a user body.
-
公开(公告)号:US20180121062A1
公开(公告)日:2018-05-03
申请号:US15857160
申请日:2017-12-28
Applicant: Next IT Corporation
Inventor: Ian Beaver , Fred Brown , Casey Gossard
CPC classification number: G06F3/04842 , G06F17/2785 , G06F17/279 , G10L15/18 , G10L15/22 , G10L15/30 , G10L25/51 , G10L25/63 , G10L25/66
Abstract: This disclosure describes techniques and architectures for evaluating conversations. In some instances, conversations with users, virtual assistants, and others may be analyzed to identify potential risks within a language model that is employed by the virtual assistants and other entities. The potential risks may be evaluated by administrators, users, systems, and others to identify potential issues with the language model that need to be addressed. This may allow the language model to be improved and enhance user experience with the virtual assistants and others that employ the language model.
-
公开(公告)号:US09961442B2
公开(公告)日:2018-05-01
申请号:US14850747
申请日:2015-09-10
Applicant: Zero Labs, Inc.
Inventor: Rajesh Pradhan , Amit Pradhan
IPC: G10L15/00 , H04R3/00 , G10L21/0208 , G10L15/18 , G10L21/0216
CPC classification number: H04R3/005 , G10L15/18 , G10L21/0208 , G10L2021/02166
Abstract: The invention provides a computer system for interacting with a user. A set of concepts initially forms a target set of concepts. An input module receives a language input from the user. An analysis system executes a plurality of narrowing cycles until a concept packet having at least one concept has been identified. Each narrowing cycle includes identifying at least one portion of the language and determining a subset of concepts from the target set of concepts to form a new target subset. An action item identifier identifies an action item from the action items based on the concept packet. An action executer that executes an action based on the action item that has been identified.
-
公开(公告)号:US20180114526A1
公开(公告)日:2018-04-26
申请号:US15837777
申请日:2017-12-11
Applicant: Mattersight Corporation
Inventor: Roger WARFORD , Christopher DANSON , Jennifer KUHN
IPC: G10L15/18 , G10L17/06 , G10L21/0272 , G10L25/63 , G06Q30/00
CPC classification number: G10L15/18 , G06Q30/01 , G10L15/1822 , G10L15/26 , G10L17/06 , G10L21/0272 , G10L25/48 , G10L25/63 , H04L12/1831 , H04M3/42221 , H04M3/5175 , H04M2201/40
Abstract: The methods, apparatus, non-transitory computer readable media, and systems described herein include recording a mono recording of a software and a customer inquiry communication using a microphone to interpret and respond to the customer inquiry communication, wherein the mono recording is unseparated and includes customer voice data and audio data generated by the software agent, separately and concurrently recording the software agent audio data in an agent recording, and subtracting agent audio data from the unseparated mono recording to provide a separated recording including only customer voice data.
-
公开(公告)号:US09953649B2
公开(公告)日:2018-04-24
申请号:US15430952
申请日:2017-02-13
Applicant: VoiceBox Technologies Corporation
Inventor: Larry Baldwin , Chris Weider
CPC classification number: G10L15/22 , G06Q30/02 , G06Q30/0241 , G06Q30/0261 , G06Q30/0273 , G10L15/18 , G10L15/1815 , G10L15/24 , G10L17/22 , G10L2015/223 , G10L2015/227
Abstract: A system and method for processing multi-modal device interactions in a natural language voice services environment may be provided. In particular, one or more multi-modal device interactions may be received in a natural language voice services environment that includes one or more electronic devices. The multi-modal device interactions may include a non-voice interaction with at least one of the electronic devices or an application associated therewith, and may further include a natural language utterance relating to the non-voice interaction. Context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multi-modal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent of the multi-modal device interaction.
-
公开(公告)号:US20180108343A1
公开(公告)日:2018-04-19
申请号:US15294234
申请日:2016-10-14
Applicant: SoundHound, Inc.
Inventor: Mark Stevans , Monika Almudafar-Depeyrot , Keyvan Mohajer
CPC classification number: G10L13/08 , G10L13/043 , G10L15/18 , G10L15/22 , G10L15/30 , G10L2015/025 , G10L2015/088 , G10L2015/223
Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.
-
-
-
-
-
-
-
-
-