-
71.
公开(公告)号:US20180166076A1
公开(公告)日:2018-06-14
申请号:US15834030
申请日:2017-12-06
发明人: SEIYA HIGUCHI , YUJI KUNITAKE , YUSAKU OTA , RYOUTA MIYAZAKI
CPC分类号: G10L15/22 , B25J9/0003 , B25J11/0005 , B25J13/003 , G06F3/167 , G06F16/00 , G06F16/3329 , G06K9/00281 , G06K9/00288 , G06K9/00664 , G06K9/6292 , G10L13/043 , G10L15/1815 , G10L15/24 , G10L15/26 , G10L25/21 , G10L25/48 , G10L2015/088 , G10L2015/223 , Y10S901/01 , Y10S901/46
摘要: A topic providing device includes a candidate topic extractor, a provided topic determiner, a voice synthesizer, and a speaker. When a determination is made that a parent and child are conversing and that there is a need to provide a new topic to the parent and child, based on a conversation history database and a child activity database storing at least one activity name indicating an activity the child was engaged in for a first predetermined period of time, the candidate topic extractor extracts at least one candidate topic that corresponds to the at least one activity name in the child activity database and does not correspond to an activity name included in text data recorded in a first database. From the at least one candidate topic, the provided topic determiner selects one topic to provide to the parent and the child. The voice synthesizer generates voice data containing the one topic. The speaker outputs the voice data.
-
公开(公告)号:US20180166070A1
公开(公告)日:2018-06-14
申请号:US15894302
申请日:2018-02-12
IPC分类号: G10L15/065 , G10L15/06 , G10L15/07 , G10L15/24 , G10L21/007 , G10L15/30
CPC分类号: G10L15/065 , G10L15/06 , G10L15/07 , G10L15/24 , G10L15/30 , G10L21/007
摘要: A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when a communication channel between the computing device and the mobile device becomes available and transmitting the new ASR adaptation data to the mobile device when a communication channel between the computing device and the mobile device becomes available. The new ASR adaptation data on the mobile device more accurately recognizes user utterances.
-
公开(公告)号:US20180074661A1
公开(公告)日:2018-03-15
申请号:US15265522
申请日:2016-09-14
发明人: Xu Fang Zhao , Gaurav Talwar
IPC分类号: G06F3/0482 , G06F3/0481 , G06F3/16 , G06F3/01 , G06F3/0488 , G10L15/24 , G10L15/08 , G06F17/30
CPC分类号: G06F3/0482 , G06F3/012 , G06F3/017 , G06F3/04817 , G06F3/04883 , G06F3/167 , G06F16/51 , G06F2203/011 , G10L13/08 , G10L15/08 , G10L15/18 , G10L15/24 , G10L15/26 , G10L2013/083 , H04L51/08
摘要: A system and method of identifying and generating preferred emojis includes: detecting at a wireless device a plurality of selected emoji; determining the frequency with which each emoji is selected; identifying a defined number of emojis from the plurality of selected emojis based on the frequency with which each emoji is selected; and creating a frequently-used emoji library for the identified emojis.
-
公开(公告)号:US09905240B2
公开(公告)日:2018-02-27
申请号:US14886714
申请日:2015-10-19
申请人: Audimax LLC
发明人: Harry Levitt
IPC分类号: G10L21/003 , G10L21/02 , G10L21/057 , G10L15/24
CPC分类号: G10L21/0388 , G10L15/24 , G10L21/003 , G10L21/013 , G10L21/02 , G10L21/0205 , G10L21/057
摘要: Systems, methods, and devices for intelligent speech recognition and processing are disclosed. According to one embodiment, a method for improving intelligibility of a speech signal may include (1) at least one processor receiving an incoming speech signal comprising a plurality of sound elements; (2) the at least one processor recognizing a sound element in the incoming speech signal to improve the intelligibility thereof; (3) the at least one processor processing the sound element by at least one of modifying and replacing the sound element; and (4) the at least one processor outputting the processed speech signal comprising the processed sound element.
-
公开(公告)号:US09899025B2
公开(公告)日:2018-02-20
申请号:US14790142
申请日:2015-07-02
IPC分类号: G10L15/00 , G05B15/00 , H04R25/00 , G06K9/62 , G10L15/25 , G10L15/07 , G06N3/00 , G06K9/00 , G06F3/01 , G10L15/24 , G10L15/22
CPC分类号: G10L15/25 , G06F3/016 , G06K9/00281 , G06N3/008 , G10L15/07 , G10L15/24 , G10L2015/227 , H04R25/407
摘要: Non-acoustic data from a vicinity of speech input is obtained. A subject speaker is identified as the source of the speech input from the obtained non-acoustic data by detecting mouth motion on one or more faces segmented from the non-acoustic data by comparing a first pixel intensity associated at a first time with a second pixel intensity at a second time, and selecting a face corresponding to the subject speaker from the one or more faces in response to a determination that a number of significantly changed pixels between the first pixel intensity and the second pixel intensity exceeds a threshold. A demographic is assigned to the subject speaker based on an analysis of one or more non-acoustic attributes of the subject speaker extracted from the non-acoustic data. The speech input is processed using a speech recognition system adjusted using a model selected based on the demographic.
-
公开(公告)号:US09892732B1
公开(公告)日:2018-02-13
申请号:US15236094
申请日:2016-08-12
申请人: PAYPAL, INC.
发明人: Cheng Tian , Srivathsan Narasimhan
CPC分类号: G10L15/24 , G01S3/80 , G01S5/02 , G01S5/18 , G06F21/32 , G06F2221/2111 , G10L15/00 , G10L17/06 , G10L17/22 , H05K999/99
摘要: Systems and methods for providing location based voice recognition include receiving, through a first microphone, an audio signal from a first user that includes an audio command requesting a service that requires user authorization before access to at least a portion of the service is granted. The user authorization is based on voice recognition (e.g., voice authentication and/or voice identification) of the audio signal. The source location of the audio signal is determined and a user location of the first user is determined. If the source location of the audio signal correlates with the user location, voice recognition on the audio signal may be performed. The first user may be authorized to access the service based on the voice recognition performed on the audio signal.
-
公开(公告)号:US09892728B2
公开(公告)日:2018-02-13
申请号:US15071645
申请日:2016-03-16
IPC分类号: G10L15/00 , G10L15/065 , G10L15/07 , G10L15/30 , G10L15/06 , G10L15/24 , G10L21/007
CPC分类号: G10L15/065 , G10L15/06 , G10L15/07 , G10L15/24 , G10L15/30 , G10L21/007
摘要: A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when a communication channel between the computing device and the mobile device becomes available and transmitting the new ASR adaptation data to the mobile device when a communication channel between the computing device and the mobile device becomes available. The new ASR adaptation data on the mobile device more accurately recognizes user utterances.
-
公开(公告)号:US20180018965A1
公开(公告)日:2018-01-18
申请号:US15646446
申请日:2017-07-11
申请人: Bose Corporation
发明人: Michael J. Daley
CPC分类号: G10L15/22 , G06F3/017 , G06F3/165 , G06F3/167 , G06F2203/0381 , G10L15/24 , G10L15/30 , G10L2015/223
摘要: A system includes a microphone providing input to a voice user interface (VUI), a motion sensor providing input to a gesture-based user interface (GBI), an audio output device, and a processor in communication with the VUI, the GBI, and the audio output device. The processor detects a predetermined gesture input to the GBI, and in response to the detection, decreases the volume of audio being output by the audio output device and activates the VUI to listen for a command. A system includes an audio output device for providing audible output from a virtual personal assistant (VPA), a motion sensor input to a gesture-based user interface (GBI), and a processor in communication with the VPA and the GBI. The processor, upon receiving an input from the GBI after the audio output device provided output from the VPA, forwards the input received from the GBI to the VPA.
-
公开(公告)号:US09786273B2
公开(公告)日:2017-10-10
申请号:US14080665
申请日:2013-11-14
摘要: The present invention provides a speech recognition system combined with one or more alternate input modalities to ensure efficient and accurate text input. The speech recognition system achieves less than perfect accuracy due to limited processing power, environmental noise, and/or natural variations in speaking style. The alternate input modalities use disambiguation or recognition engines to compensate for reduced keyboards, sloppy input, and/or natural variations in writing style. The ambiguity remaining in the speech recognition process is mostly orthogonal to the ambiguity inherent in the alternate input modality, such that the combination of the two modalities resolves the recognition errors efficiently and accurately. The invention is especially well suited for mobile devices with limited space for keyboards or touch-screen input.
-
公开(公告)号:US20170263254A1
公开(公告)日:2017-09-14
申请号:US15066434
申请日:2016-03-10
申请人: Intel IP Corporation
CPC分类号: G10L15/28 , G10L15/22 , G10L15/24 , G10L2015/223 , G10L2015/226
摘要: A voice command device (VCD) has privacy protection. The VCD comprises a processor, first and second input devices, at least one data line to couple the first and second input devices to the processor, a power supply, and a sensor power line to couple the first and second input devices to the power supply. The VCD also comprises a manually operated mechanical switch on the sensor power line, to divide the sensor power line into a first leg comprising the power supply and a second leg comprising the input devices. The VCD also comprises an active sensor indicator light on the second leg of the sensor power line. The indicator light is configured to indicate whether the input devices are operational, based on a power level of the second leg of the sensor power line. Other embodiments are described and claimed.
-
-
-
-
-
-
-
-
-