专利检索 ipc:"G10L15/14" 第 1 页

1.

发明授权
Using machine learning to locate mobile device 有权

公开(公告)号：US12051438B1

公开(公告)日：2024-07-30

申请号：US17214399

申请日：2021-03-26

申请人： T-Mobile USA, Inc.

发明人： Yasmin Karimli , Ryan Cyrus Khamneian , Jie Hui , Antoine T. Tran

IPC分类号： G10L25/51 , G06N7/01 , G06N20/00 , G10L15/14 , H04M3/22 , H04W4/029

CPC分类号： G10L25/51 , G06N7/01 , G06N20/00 , G10L15/14 , H04M3/2218 , H04W4/029

摘要： Described herein are techniques, devices, and systems for training a machine learning model(s) and/or artificial intelligence algorithm(s) to determine where a mobile device (and, hence, a user of the mobile device) is located based on audio data associated with the mobile device and/or contextual data associated with the mobile device. The machine learning techniques may be used to determine contextual information about users, such as determining that a particular location is likely to be a user's home, office, or the like, based on movement patterns exhibited in the data associated with a user's mobile device. Once trained, the machine learning model(s) is usable to classify a mobile device as having been located at one of multiple candidate locations, such as indoors or outdoors, at a particular time. The described techniques can improve the accuracy of determining a mobile device's location, among other technical benefits.

2.

发明授权
Method and apparatus for recognizing speech, electronic device and storage medium 有权

公开(公告)号：US12033615B2

公开(公告)日：2024-07-09

申请号：US17499129

申请日：2021-10-12

申请人： BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

发明人： Yinlou Zhao , Liao Zhang , Zhengxiang Jiang

IPC分类号： G10L15/00 , G10L15/14 , G10L15/16 , G10L15/26

CPC分类号： G10L15/005 , G10L15/142 , G10L15/16 , G10L15/26

摘要： The disclosure provides a method and an apparatus for recognizing a speech, an electronic device and a storage medium. A speech to be recognized is obtained. An acoustic feature of the speech to be recognized and a language feature of the speech to be recognized are obtained. The speech to be recognized is input to a pronunciation difference statistics to generate a differential pronunciation pair corresponding to the speech to be recognized. The text information of the speech to be recognized is generated based on the differential pronunciation pair, the acoustic feature and the language feature.

3.

发明公开
KEY PHRASE SPOTTING 审中-公开

公开(公告)号：US20240221750A1

公开(公告)日：2024-07-04

申请号：US18610233

申请日：2024-03-19

申请人： Google LLC

发明人： Wei Li , Rohit Prakash Prabhavalkar , Kanury Kanishka Rao , Yanzhang He , Ian C. McGraw , Anton Bakhtin

IPC分类号： G10L15/22 , G10L15/02 , G10L15/06 , G10L15/08 , G10L15/14 , G10L15/18 , G10L19/00

CPC分类号： G10L15/22 , G10L15/02 , G10L15/063 , G10L15/18 , G10L19/00 , G10L2015/025 , G10L2015/088 , G10L15/142 , G10L2015/223

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting utterances of a key phrase in an audio signal. One of the methods includes receiving, by a key phrase spotting system, an audio signal encoding one or more utterances; while continuing to receive the audio signal, generating, by the key phrase spotting system, an attention output using an attention mechanism that is configured to compute the attention output based on a series of encodings generated by an encoder comprising one or more neural network layers; generating, by the key phrase spotting system and using attention output, output that indicates whether the audio signal likely encodes the key phrase; and providing, by the key phrase spotting system, the output that indicates whether the audio signal likely encodes the key phrase.

4.

发明公开
METHOD AND SYSTEM FOR INTELLIGENT INTERPRETATION OF INFORMATION TO AUTONOMOUSLY DESIGN IN-CLASS/HYBRID/REMOTE ASSESSMENT 审中-公开

公开(公告)号：US20240221524A1

公开(公告)日：2024-07-04

申请号：US18091334

申请日：2022-12-29

申请人： SUFIAN MUNIR INC.

发明人： Zahid Nisar , FARHAN HASSAN , SUFIAN MUNIR

IPC分类号： G09B7/00 , G06F40/253 , G06F40/279 , G06F40/35 , G06F40/40 , G10L15/14 , G10L15/18 , G10L15/187 , G10L15/19 , G10L15/22 , G10L15/30

CPC分类号： G09B7/00 , G06F40/253 , G06F40/279 , G06F40/35 , G06F40/40 , G10L15/14 , G10L15/1815 , G10L15/187 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/088

摘要： The embodiments herein disclose a method and system for intelligent interpretation of information to autonomously design in-class/hybrid/remote assessment. In an embodiment disclosed herein, involves picking up audio of the presenter from an audio input device such as microphone during an interaction. The interaction includes both either audio or video interaction. Further, the embodiment herein, involves extracting the key information present in the captured audio interaction and then use the extracted key information to intelligently generate assessments such as quiz questions, multiple-choice questions, and mathematical questions.

5.

发明公开
PROACTIVE COMMAND FRAMEWORK 审中-公开

公开(公告)号：US20240153505A1

公开(公告)日：2024-05-09

申请号：US18490029

申请日：2023-10-19

申请人： Amazon Technologies, Inc.

发明人： Anjishnu Kumar , Xing Fan , Arpit Gupta , Ruhi Sarikaya

IPC分类号： G10L15/22 , G06F40/30 , G06N5/022 , G10L13/00 , G10L15/14 , G10L15/18 , G10L17/00

CPC分类号： G10L15/22 , G06F40/30 , G06N5/022 , G10L13/00 , G10L15/14 , G10L15/1815 , G10L17/00 , G06F40/295 , G10L2015/223

摘要： Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.

6.

发明授权
Systems and methods for improving content discovery in response to a voice query using a recognition rate which depends on detected trigger terms 有权

公开(公告)号：US11961507B2

公开(公告)日：2024-04-16

申请号：US18116501

申请日：2023-03-02

申请人： Rovi Guides, Inc.

发明人： Jeffry Copps Robert Jose , Sindhuja Chonat Sri

IPC分类号： G10L15/02 , G06F16/432 , G06F16/438 , G06F40/279 , G10L15/14 , G10L15/26 , H04M3/51

CPC分类号： G10L15/02 , G06F16/433 , G06F16/438 , G06F40/279 , G10L15/26 , G10L15/14 , H04M3/5116

摘要： A transcription of a query for content discovery is generated, and a context of the query is identified, as well as a first plurality of candidate entities to which the query refers. A search is performed based on the context of the query and the first plurality of candidate entities, and results are generated for output. A transcription of a second voice query is then generated, and it is determined whether the second transcription includes a trigger term indicating a corrective query. If so, the context of the first query is retrieved. A second term of the second query similar to a term of the first query is identified, and a second plurality of candidate entities to which the second term refers is determined. A second search is performed based on the second plurality of candidates and the context, and new search results are generated for output.

7.

发明授权
Biasing voice correction suggestions 有权

公开(公告)号：US11881207B2

公开(公告)日：2024-01-23

申请号：US17656214

申请日：2022-03-23

申请人： Google LLC

发明人： Evgeny A. Cherepanov , Jakob Nicolaus Foerster , Vikram Sridar , Ishai Rabinovitz , Omer Tabach

IPC分类号： G10L15/22 , G10L15/14 , G10L13/00 , G10L15/01 , G10L15/187 , G10L15/26

CPC分类号： G10L15/01 , G10L15/187 , G10L15/22 , G10L15/26 , G10L2015/221

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the method includes receiving a voice input from a user device; generating a recognition output; receiving a user selection of one or more terms in the recognition output; receiving a user input of one or more letters replacing the user selected one or more terms; determining suggested correction candidates based in part on the user input and the voice input; and providing one or more suggested correction candidates to the user device as suggested corrected recognition outputs.

8.

发明授权
Determining input for speech processing engine 有权

公开(公告)号：US11854550B2

公开(公告)日：2023-12-26

申请号：US18148221

申请日：2022-12-29

申请人： Magic Leap, Inc.

发明人： Anthony Robert Sheeder , Colby Nelson Leider

IPC分类号： G10L15/22 , G10L15/30 , G10L15/14 , G06F3/01 , G10L15/25

CPC分类号： G10L15/22 , G06F3/013 , G10L15/14 , G10L15/25 , G10L15/30 , G10L2015/223 , G10L2015/227

摘要： A method of presenting a signal to a speech processing engine is disclosed. According to an example of the method, an audio signal is received via a microphone. A portion of the audio signal is identified, and a probability is determined that the portion comprises speech directed by a user of the speech processing engine as input to the speech processing engine. In accordance with a determination that the probability exceeds a threshold, the portion of the audio signal is presented as input to the speech processing engine. In accordance with a determination that the probability does not exceed the threshold, the portion of the audio signal is not presented as input to the speech processing engine.

9.

发明授权
Interaction data and processing natural language inputs 有权

公开(公告)号：US11837229B1

公开(公告)日：2023-12-05

申请号：US17363387

申请日：2021-06-30

申请人： Amazon Technologies, Inc.

发明人： Xing Fan , Saurabh Gupta , Chenlei Guo , Eunah Cho

IPC分类号： G06F16/33 , G10L15/22 , G06N5/02 , G10L15/14 , G06F16/36

CPC分类号： G10L15/22 , G06N5/02 , G10L15/144 , G06F16/3338 , G06F16/367 , G10L2015/223

摘要： Techniques for determining and using interaction affinity data are described. Interaction affinity data may indicate a latent affinity between information corresponding to an interaction, such as, intents, entities, device type from which a user input is received, domain, etc. A system may use the interaction affinity data to determine an alternative input representation for a spoken input to cause output of a desired response to the spoken input. The system may also use the interaction affinity data to recommend an action to a user.

10.

发明公开
SYSTEMS AND METHODS FOR SMART DIALOGUE COMMUNICATION 审中-公开

公开(公告)号：US20230370549A1

公开(公告)日：2023-11-16

申请号：US18359075

申请日：2023-07-26

申请人： HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.

发明人： Ming CHEN

IPC分类号： H04M3/51 , G06N3/08 , G06F40/35 , G10L15/08 , G06F16/9032 , H04M3/54 , H04M3/436 , G10L15/26 , G06F16/683 , H04M3/527 , H04M3/533 , G10L25/63 , H04W4/16 , G06Q10/10 , H04W4/12 , G10L15/14 , G10L15/16

CPC分类号： H04M3/5166 , G06N3/08 , G06F40/35 , G10L15/08 , G06F16/90332 , H04M3/541 , H04M3/4365 , G10L15/26 , G06F16/685 , H04M3/527 , H04M3/5335 , H04M3/5141 , G10L25/63 , H04W4/16 , G06Q10/10 , H04W4/12 , H04M3/53341 , G10L2015/088 , G10L15/142 , G10L15/16

摘要： Systems and methods for smart dialogue communication are provided. A method may include receiving, from a responder terminal device, a dialogue request configured to request a smart dialogue communication, wherein the dialogue request is associated with an incoming call request that is initiated by a requester via a requester terminal device and satisfies a smart dialogue condition determined by the responder terminal device; performing the smart dialogue communication with the requester terminal device associated with the requester; recording voice information associated with the smart dialogue communication; converting the voice information into the text information; and transmitting the text information to the responder terminal device.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类