专利检索 ipc:G10L15/24 第 8 页

71.

发明申请
VOICE INTERACTION DEVICE, VOICE INTERACTION METHOD, VOICE INTERACTION PROGRAM, AND ROBOT 审中-公开

公开(公告)号：US20180166076A1

公开(公告)日：2018-06-14

申请号：US15834030

申请日：2017-12-06

申请人： Panasonic Intellectual Property Management Co., Ltd.

发明人： SEIYA HIGUCHI , YUJI KUNITAKE , YUSAKU OTA , RYOUTA MIYAZAKI

IPC分类号： G10L15/22 , G10L15/18 , G10L15/24 , G10L13/04 , G10L25/48 , G06K9/00 , B25J13/00

CPC分类号： G10L15/22 , B25J9/0003 , B25J11/0005 , B25J13/003 , G06F3/167 , G06F16/00 , G06F16/3329 , G06K9/00281 , G06K9/00288 , G06K9/00664 , G06K9/6292 , G10L13/043 , G10L15/1815 , G10L15/24 , G10L15/26 , G10L25/21 , G10L25/48 , G10L2015/088 , G10L2015/223 , Y10S901/01 , Y10S901/46

摘要： A topic providing device includes a candidate topic extractor, a provided topic determiner, a voice synthesizer, and a speaker. When a determination is made that a parent and child are conversing and that there is a need to provide a new topic to the parent and child, based on a conversation history database and a child activity database storing at least one activity name indicating an activity the child was engaged in for a first predetermined period of time, the candidate topic extractor extracts at least one candidate topic that corresponds to the at least one activity name in the child activity database and does not correspond to an activity name included in text data recorded in a first database. From the at least one candidate topic, the provided topic determiner selects one topic to provide to the parent and the child. The voice synthesizer generates voice data containing the one topic. The speaker outputs the voice data.

72.

发明申请
System and Method for Mobile Automatic Speech Recognition 审中-公开

公开(公告)号：US20180166070A1

公开(公告)日：2018-06-14

申请号：US15894302

申请日：2018-02-12

申请人： Nuance Communications, Inc.

发明人： Sarangarajan PARTHASARATHY , Richard Cameron ROSE

IPC分类号： G10L15/065 , G10L15/06 , G10L15/07 , G10L15/24 , G10L21/007 , G10L15/30

CPC分类号： G10L15/065 , G10L15/06 , G10L15/07 , G10L15/24 , G10L15/30 , G10L21/007

摘要： A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when a communication channel between the computing device and the mobile device becomes available and transmitting the new ASR adaptation data to the mobile device when a communication channel between the computing device and the mobile device becomes available. The new ASR adaptation data on the mobile device more accurately recognizes user utterances.

73.

发明申请
PREFERRED EMOJI IDENTIFICATION AND GENERATION 审中-公开

公开(公告)号：US20180074661A1

公开(公告)日：2018-03-15

申请号：US15265522

申请日：2016-09-14

申请人： GM Global Technology Operations LLC

发明人： Xu Fang Zhao , Gaurav Talwar

IPC分类号： G06F3/0482 , G06F3/0481 , G06F3/16 , G06F3/01 , G06F3/0488 , G10L15/24 , G10L15/08 , G06F17/30

CPC分类号： G06F3/0482 , G06F3/012 , G06F3/017 , G06F3/04817 , G06F3/04883 , G06F3/167 , G06F16/51 , G06F2203/011 , G10L13/08 , G10L15/08 , G10L15/18 , G10L15/24 , G10L15/26 , G10L2013/083 , H04L51/08

摘要： A system and method of identifying and generating preferred emojis includes: detecting at a wireless device a plurality of selected emoji; determining the frequency with which each emoji is selected; identifying a defined number of emojis from the plurality of selected emojis based on the frequency with which each emoji is selected; and creating a frequently-used emoji library for the identified emojis.

74.

发明授权
Systems, methods, and devices for intelligent speech recognition and processing 有权

公开(公告)号：US09905240B2

公开(公告)日：2018-02-27

申请号：US14886714

申请日：2015-10-19

申请人： Audimax LLC

发明人： Harry Levitt

IPC分类号： G10L21/003 , G10L21/02 , G10L21/057 , G10L15/24

CPC分类号： G10L21/0388 , G10L15/24 , G10L21/003 , G10L21/013 , G10L21/02 , G10L21/0205 , G10L21/057

摘要： Systems, methods, and devices for intelligent speech recognition and processing are disclosed. According to one embodiment, a method for improving intelligibility of a speech signal may include (1) at least one processor receiving an incoming speech signal comprising a plurality of sound elements; (2) the at least one processor recognizing a sound element in the incoming speech signal to improve the intelligibility thereof; (3) the at least one processor processing the sound element by at least one of modifying and replacing the sound element; and (4) the at least one processor outputting the processed speech signal comprising the processed sound element.

75.

发明授权
Speech recognition system adaptation based on non-acoustic attributes and face selection based on mouth motion using pixel intensities 有权

公开(公告)号：US09899025B2

公开(公告)日：2018-02-20

申请号：US14790142

申请日：2015-07-02

申请人： International Business Machines Corporation

发明人： Jonathan H. Connell, II , Etienne Marcheret

IPC分类号： G10L15/00 , G05B15/00 , H04R25/00 , G06K9/62 , G10L15/25 , G10L15/07 , G06N3/00 , G06K9/00 , G06F3/01 , G10L15/24 , G10L15/22

CPC分类号： G10L15/25 , G06F3/016 , G06K9/00281 , G06N3/008 , G10L15/07 , G10L15/24 , G10L2015/227 , H04R25/407

摘要： Non-acoustic data from a vicinity of speech input is obtained. A subject speaker is identified as the source of the speech input from the obtained non-acoustic data by detecting mouth motion on one or more faces segmented from the non-acoustic data by comparing a first pixel intensity associated at a first time with a second pixel intensity at a second time, and selecting a face corresponding to the subject speaker from the one or more faces in response to a determination that a number of significantly changed pixels between the first pixel intensity and the second pixel intensity exceeds a threshold. A demographic is assigned to the subject speaker based on an analysis of one or more non-acoustic attributes of the subject speaker extracted from the non-acoustic data. The speech input is processed using a speech recognition system adjusted using a model selected based on the demographic.

76.

发明授权
Location based voice recognition system 有权

公开(公告)号：US09892732B1

公开(公告)日：2018-02-13

申请号：US15236094

申请日：2016-08-12

申请人： PAYPAL, INC.

发明人： Cheng Tian , Srivathsan Narasimhan

IPC分类号： G10L15/00 , G10L15/24 , G10L17/22 , G10L17/06

CPC分类号： G10L15/24 , G01S3/80 , G01S5/02 , G01S5/18 , G06F21/32 , G06F2221/2111 , G10L15/00 , G10L17/06 , G10L17/22 , H05K999/99

摘要： Systems and methods for providing location based voice recognition include receiving, through a first microphone, an audio signal from a first user that includes an audio command requesting a service that requires user authorization before access to at least a portion of the service is granted. The user authorization is based on voice recognition (e.g., voice authentication and/or voice identification) of the audio signal. The source location of the audio signal is determined and a user location of the first user is determined. If the source location of the audio signal correlates with the user location, voice recognition on the audio signal may be performed. The first user may be authorized to access the service based on the voice recognition performed on the audio signal.

77.

发明授权
System and method for mobile automatic speech recognition 有权

公开(公告)号：US09892728B2

公开(公告)日：2018-02-13

申请号：US15071645

申请日：2016-03-16

申请人： Nuance Communications, Inc.

发明人： Sarangarajan Parthasarathy , Richard Cameron Rose

IPC分类号： G10L15/00 , G10L15/065 , G10L15/07 , G10L15/30 , G10L15/06 , G10L15/24 , G10L21/007

CPC分类号： G10L15/065 , G10L15/06 , G10L15/07 , G10L15/24 , G10L15/30 , G10L21/007

摘要： A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when a communication channel between the computing device and the mobile device becomes available and transmitting the new ASR adaptation data to the mobile device when a communication channel between the computing device and the mobile device becomes available. The new ASR adaptation data on the mobile device more accurately recognizes user utterances.

78.

发明申请
Combining Gesture and Voice User Interfaces 审中-公开

公开(公告)号：US20180018965A1

公开(公告)日：2018-01-18

申请号：US15646446

申请日：2017-07-11

申请人： Bose Corporation

发明人： Michael J. Daley

IPC分类号： G10L15/22 , G06F3/16 , G06F3/01 , G10L15/24 , G10L15/30

CPC分类号： G10L15/22 , G06F3/017 , G06F3/165 , G06F3/167 , G06F2203/0381 , G10L15/24 , G10L15/30 , G10L2015/223

摘要： A system includes a microphone providing input to a voice user interface (VUI), a motion sensor providing input to a gesture-based user interface (GBI), an audio output device, and a processor in communication with the VUI, the GBI, and the audio output device. The processor detects a predetermined gesture input to the GBI, and in response to the detection, decreases the volume of audio being output by the audio output device and activates the VUI to listen for a command. A system includes an audio output device for providing audible output from a virtual personal assistant (VPA), a motion sensor input to a gesture-based user interface (GBI), and a processor in communication with the VPA and the GBI. The processor, upon receiving an input from the GBI after the audio output device provided output from the VPA, forwards the input received from the GBI to the VPA.

79.

发明授权
Multimodal disambiguation of speech recognition 有权

公开(公告)号：US09786273B2

公开(公告)日：2017-10-10

申请号：US14080665

申请日：2013-11-14

申请人： Nuance Communications, Inc.

发明人： Michael R. Longé , Richard Eyraud , Keith C. Hullfish

IPC分类号： G10L15/18 , G10L15/24 , G10L15/32

CPC分类号： G10L15/18 , G10L15/24 , G10L15/32

摘要： The present invention provides a speech recognition system combined with one or more alternate input modalities to ensure efficient and accurate text input. The speech recognition system achieves less than perfect accuracy due to limited processing power, environmental noise, and/or natural variations in speaking style. The alternate input modalities use disambiguation or recognition engines to compensate for reduced keyboards, sloppy input, and/or natural variations in writing style. The ambiguity remaining in the speech recognition process is mostly orthogonal to the ambiguity inherent in the alternate input modality, such that the combination of the two modalities resolves the recognition errors efficiently and accurately. The invention is especially well suited for mobile devices with limited space for keyboards or touch-screen input.

80.

发明申请
MALWARE-PROOF PRIVACY INDICATOR 审中-公开

公开(公告)号：US20170263254A1

公开(公告)日：2017-09-14

申请号：US15066434

申请日：2016-03-10

申请人： Intel IP Corporation

发明人： PRASHANT DEWAN , UTTAM K. SENGUPTA , SATISH KUMAR L. BHRUGUMALLA , MANDAR S. JOSHI

IPC分类号： G10L15/28 , G10L15/24 , G10L15/22

CPC分类号： G10L15/28 , G10L15/22 , G10L15/24 , G10L2015/223 , G10L2015/226

摘要： A voice command device (VCD) has privacy protection. The VCD comprises a processor, first and second input devices, at least one data line to couple the first and second input devices to the processor, a power supply, and a sensor power line to couple the first and second input devices to the power supply. The VCD also comprises a manually operated mechanical switch on the sensor power line, to divide the sensor power line into a first leg comprising the power supply and a second leg comprising the input devices. The VCD also comprises an active sensor indicator light on the second leg of the sensor power line. The indicator light is configured to indicate whether the input devices are operational, based on a power level of the second leg of the sensor power line. Other embodiments are described and claimed.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类