-
公开(公告)号:US12051438B1
公开(公告)日:2024-07-30
申请号:US17214399
申请日:2021-03-26
申请人: T-Mobile USA, Inc.
发明人: Yasmin Karimli , Ryan Cyrus Khamneian , Jie Hui , Antoine T. Tran
摘要: Described herein are techniques, devices, and systems for training a machine learning model(s) and/or artificial intelligence algorithm(s) to determine where a mobile device (and, hence, a user of the mobile device) is located based on audio data associated with the mobile device and/or contextual data associated with the mobile device. The machine learning techniques may be used to determine contextual information about users, such as determining that a particular location is likely to be a user's home, office, or the like, based on movement patterns exhibited in the data associated with a user's mobile device. Once trained, the machine learning model(s) is usable to classify a mobile device as having been located at one of multiple candidate locations, such as indoors or outdoors, at a particular time. The described techniques can improve the accuracy of determining a mobile device's location, among other technical benefits.
-
公开(公告)号:US12033615B2
公开(公告)日:2024-07-09
申请号:US17499129
申请日:2021-10-12
发明人: Yinlou Zhao , Liao Zhang , Zhengxiang Jiang
CPC分类号: G10L15/005 , G10L15/142 , G10L15/16 , G10L15/26
摘要: The disclosure provides a method and an apparatus for recognizing a speech, an electronic device and a storage medium. A speech to be recognized is obtained. An acoustic feature of the speech to be recognized and a language feature of the speech to be recognized are obtained. The speech to be recognized is input to a pronunciation difference statistics to generate a differential pronunciation pair corresponding to the speech to be recognized. The text information of the speech to be recognized is generated based on the differential pronunciation pair, the acoustic feature and the language feature.
-
公开(公告)号:US20240221750A1
公开(公告)日:2024-07-04
申请号:US18610233
申请日:2024-03-19
申请人: Google LLC
发明人: Wei Li , Rohit Prakash Prabhavalkar , Kanury Kanishka Rao , Yanzhang He , Ian C. McGraw , Anton Bakhtin
CPC分类号: G10L15/22 , G10L15/02 , G10L15/063 , G10L15/18 , G10L19/00 , G10L2015/025 , G10L2015/088 , G10L15/142 , G10L2015/223
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting utterances of a key phrase in an audio signal. One of the methods includes receiving, by a key phrase spotting system, an audio signal encoding one or more utterances; while continuing to receive the audio signal, generating, by the key phrase spotting system, an attention output using an attention mechanism that is configured to compute the attention output based on a series of encodings generated by an encoder comprising one or more neural network layers; generating, by the key phrase spotting system and using attention output, output that indicates whether the audio signal likely encodes the key phrase; and providing, by the key phrase spotting system, the output that indicates whether the audio signal likely encodes the key phrase.
-
4.
公开(公告)号:US20240221524A1
公开(公告)日:2024-07-04
申请号:US18091334
申请日:2022-12-29
申请人: SUFIAN MUNIR INC.
发明人: Zahid Nisar , FARHAN HASSAN , SUFIAN MUNIR
IPC分类号: G09B7/00 , G06F40/253 , G06F40/279 , G06F40/35 , G06F40/40 , G10L15/14 , G10L15/18 , G10L15/187 , G10L15/19 , G10L15/22 , G10L15/30
CPC分类号: G09B7/00 , G06F40/253 , G06F40/279 , G06F40/35 , G06F40/40 , G10L15/14 , G10L15/1815 , G10L15/187 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/088
摘要: The embodiments herein disclose a method and system for intelligent interpretation of information to autonomously design in-class/hybrid/remote assessment. In an embodiment disclosed herein, involves picking up audio of the presenter from an audio input device such as microphone during an interaction. The interaction includes both either audio or video interaction. Further, the embodiment herein, involves extracting the key information present in the captured audio interaction and then use the extracted key information to intelligently generate assessments such as quiz questions, multiple-choice questions, and mathematical questions.
-
公开(公告)号:US20240153505A1
公开(公告)日:2024-05-09
申请号:US18490029
申请日:2023-10-19
发明人: Anjishnu Kumar , Xing Fan , Arpit Gupta , Ruhi Sarikaya
CPC分类号: G10L15/22 , G06F40/30 , G06N5/022 , G10L13/00 , G10L15/14 , G10L15/1815 , G10L17/00 , G06F40/295 , G10L2015/223
摘要: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
-
公开(公告)号:US11961507B2
公开(公告)日:2024-04-16
申请号:US18116501
申请日:2023-03-02
申请人: Rovi Guides, Inc.
IPC分类号: G10L15/02 , G06F16/432 , G06F16/438 , G06F40/279 , G10L15/14 , G10L15/26 , H04M3/51
CPC分类号: G10L15/02 , G06F16/433 , G06F16/438 , G06F40/279 , G10L15/26 , G10L15/14 , H04M3/5116
摘要: A transcription of a query for content discovery is generated, and a context of the query is identified, as well as a first plurality of candidate entities to which the query refers. A search is performed based on the context of the query and the first plurality of candidate entities, and results are generated for output. A transcription of a second voice query is then generated, and it is determined whether the second transcription includes a trigger term indicating a corrective query. If so, the context of the first query is retrieved. A second term of the second query similar to a term of the first query is identified, and a second plurality of candidate entities to which the second term refers is determined. A second search is performed based on the second plurality of candidates and the context, and new search results are generated for output.
-
公开(公告)号:US11881207B2
公开(公告)日:2024-01-23
申请号:US17656214
申请日:2022-03-23
申请人: Google LLC
发明人: Evgeny A. Cherepanov , Jakob Nicolaus Foerster , Vikram Sridar , Ishai Rabinovitz , Omer Tabach
CPC分类号: G10L15/01 , G10L15/187 , G10L15/22 , G10L15/26 , G10L2015/221
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the method includes receiving a voice input from a user device; generating a recognition output; receiving a user selection of one or more terms in the recognition output; receiving a user input of one or more letters replacing the user selected one or more terms; determining suggested correction candidates based in part on the user input and the voice input; and providing one or more suggested correction candidates to the user device as suggested corrected recognition outputs.
-
公开(公告)号:US11854550B2
公开(公告)日:2023-12-26
申请号:US18148221
申请日:2022-12-29
申请人: Magic Leap, Inc.
CPC分类号: G10L15/22 , G06F3/013 , G10L15/14 , G10L15/25 , G10L15/30 , G10L2015/223 , G10L2015/227
摘要: A method of presenting a signal to a speech processing engine is disclosed. According to an example of the method, an audio signal is received via a microphone. A portion of the audio signal is identified, and a probability is determined that the portion comprises speech directed by a user of the speech processing engine as input to the speech processing engine. In accordance with a determination that the probability exceeds a threshold, the portion of the audio signal is presented as input to the speech processing engine. In accordance with a determination that the probability does not exceed the threshold, the portion of the audio signal is not presented as input to the speech processing engine.
-
公开(公告)号:US11837229B1
公开(公告)日:2023-12-05
申请号:US17363387
申请日:2021-06-30
发明人: Xing Fan , Saurabh Gupta , Chenlei Guo , Eunah Cho
CPC分类号: G10L15/22 , G06N5/02 , G10L15/144 , G06F16/3338 , G06F16/367 , G10L2015/223
摘要: Techniques for determining and using interaction affinity data are described. Interaction affinity data may indicate a latent affinity between information corresponding to an interaction, such as, intents, entities, device type from which a user input is received, domain, etc. A system may use the interaction affinity data to determine an alternative input representation for a spoken input to cause output of a desired response to the spoken input. The system may also use the interaction affinity data to recommend an action to a user.
-
公开(公告)号:US20230370549A1
公开(公告)日:2023-11-16
申请号:US18359075
申请日:2023-07-26
发明人: Ming CHEN
IPC分类号: H04M3/51 , G06N3/08 , G06F40/35 , G10L15/08 , G06F16/9032 , H04M3/54 , H04M3/436 , G10L15/26 , G06F16/683 , H04M3/527 , H04M3/533 , G10L25/63 , H04W4/16 , G06Q10/10 , H04W4/12 , G10L15/14 , G10L15/16
CPC分类号: H04M3/5166 , G06N3/08 , G06F40/35 , G10L15/08 , G06F16/90332 , H04M3/541 , H04M3/4365 , G10L15/26 , G06F16/685 , H04M3/527 , H04M3/5335 , H04M3/5141 , G10L25/63 , H04W4/16 , G06Q10/10 , H04W4/12 , H04M3/53341 , G10L2015/088 , G10L15/142 , G10L15/16
摘要: Systems and methods for smart dialogue communication are provided. A method may include receiving, from a responder terminal device, a dialogue request configured to request a smart dialogue communication, wherein the dialogue request is associated with an incoming call request that is initiated by a requester via a requester terminal device and satisfies a smart dialogue condition determined by the responder terminal device; performing the smart dialogue communication with the requester terminal device associated with the requester; recording voice information associated with the smart dialogue communication; converting the voice information into the text information; and transmitting the text information to the responder terminal device.
-
-
-
-
-
-
-
-
-