专利检索 cpc:"G10L15/063" 第 2 页

11.

发明授权
Method and apparatus for decentralized supervised learning in NLP applications 有权

公开(公告)号：US12112129B2

公开(公告)日：2024-10-08

申请号：US17527167

申请日：2021-11-16

申请人： Fujitsu Limited

发明人： Nuria Garcia Santa , Kendrick Cetina

IPC分类号： G10L15/16 , G06F18/214 , G06F40/169 , G06F40/226 , G06N3/04 , G10L15/06 , G10L15/07 , G10L15/18 , G06F40/279 , G06F40/295 , G10L15/183

CPC分类号： G06F40/226 , G06F18/214 , G06F40/169 , G06N3/04 , G10L15/063 , G10L15/075 , G10L15/16 , G10L15/18 , G06F40/279 , G06F40/295 , G10L2015/0635 , G10L15/1822 , G10L15/183

摘要： A method of training a neural network as a natural language processing, NLP, model, comprises: inputting annotated training data to first architecture portions of the neural network, the first architecture portions being executed respectively in a plurality of distributed client computing devices in communication with a server computing device, the training data being derived from text data private to the client computing device in which the first architecture portion is executed, the server computing device having no access to any of the private text data; deriving from the training data, using the first architecture portions, weight matrices of numeric weights which are decoupled from the private text data; concatenating the weight matrices, in a second architecture portion of the neural network executed in the server computing device, to obtain a single concatenated weight matrix; and training, on the second architecture portion, the NLP model using the concatenated weight matrix.

12.

发明公开
MEDICAL ASSESSMENT BASED ON VOICE 审中-公开

公开(公告)号：US20240331882A1

公开(公告)日：2024-10-03

申请号：US18742032

申请日：2024-06-13

申请人： Canary Speech, LLC

发明人： JANGWON KIM , NAMHEE KWON , HENRY O’CONNELL , PHILLIP WALSTAD , KEVIN SHENGBIN YANG

IPC分类号： G16H80/00 , A61B5/00 , A61B5/11 , G06F111/10 , G06N3/08 , G06N7/01 , G06N20/10 , G10L15/02 , G10L15/06 , G10L15/22 , G10L25/66 , G16H10/20 , G16H40/67 , G16H50/20 , G16H50/50

CPC分类号： G16H80/00 , A61B5/1123 , A61B5/4088 , A61B5/4803 , A61B5/7267 , G06N3/08 , G06N7/01 , G06N20/10 , G10L25/66 , G16H10/20 , G16H40/67 , G16H50/20 , G16H50/50 , G06F2111/10 , G10L15/02 , G10L15/063 , G10L15/22

摘要： Apparatuses, systems, methods, and computer program products are disclosed for medical assessment based on voice. A query module is configured to audibly question a user from an electronic display screen and/or a speaker of a computing device with one or more open ended questions. A response module is configured to receive a conversational verbal response of a user from a microphone of a computing device in response to one or more open ended questions. A detection module is configured to provide a machine learning assessment for a user of a medical condition based on a machine learning analysis of a received conversational verbal response of the user.

13.

发明公开
METHOD AND SYSTEM FOR CONVERSATION TRANSCRIPTION WITH METADATA 审中-公开

公开(公告)号：US20240331702A1

公开(公告)日：2024-10-03

申请号：US18743562

申请日：2024-06-14

申请人： SoundHound AI IP, LLC.

发明人： Kiersten L. BRADLEY , Ethan COEYTAUX , Ziming YIN

IPC分类号： G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/06 , G10L15/07

CPC分类号： G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/063 , G10L15/07 , G10L2015/0631

摘要： Methods and systems for enabling an efficient review of meeting content via a metadata-enriched, speaker-attributed transcript are disclosed. By incorporating speaker diarization and other metadata, the system can provide a structured and effective way to review and/or edit the transcript. One type of metadata can be image or video data to represent the meeting content. Furthermore, the present subject matter utilizes a multimodal diarization model to identify and label different speakers. The system can synchronize various sources of data, e.g., audio channel data, voice feature vectors, acoustic beamforming, image identification, and extrinsic data, to implement speaker diarization.

14.

发明公开
RELEVANT CONTEXT DETERMINATION 审中-公开

公开(公告)号：US20240331686A1

公开(公告)日：2024-10-03

申请号：US18739466

申请日：2024-06-11

申请人： Amazon Technologies, Inc.

发明人： Kai Wei , Thanh Dac Tran , Grant Strimel

IPC分类号： G10L15/18 , G06N3/08 , G10L15/06 , G10L15/16 , G10L15/22 , G10L15/28

CPC分类号： G10L15/1815 , G06N3/08 , G10L15/063 , G10L15/16 , G10L15/22 , G10L15/28 , G10L2015/228

摘要： Techniques for determining and storing relevant context information for a user input, such as a spoken input, are described. In some embodiments, context information is determined to be relevant on an audio frame basis. Context scores for different types of context data (e.g., prior dialog turn data, user profile data, device information, etc.) are determined for individual audio frames corresponding to a spoken input. Based on the corresponding context scores, the most relevant context is stored in a local context cache. The local context cache is updated as subsequent audio frames, of the user input, are processed. The data stored in the context cache is provided to downstream components to perform tasks such as ASR, NLU and SLU.

15.

发明授权
Cross-class ontology integration for language modeling 有权

公开(公告)号：US12087282B2

公开(公告)日：2024-09-10

申请号：US18489212

申请日：2023-10-18

申请人： Sorcero, Inc.

发明人： Walter Bender , Martin Abente Lahaye , Christopher Liu

IPC分类号： G06F17/00 , G06F9/451 , G06F9/54 , G06F16/22 , G06F16/248 , G06F16/31 , G06F16/33 , G06F16/332 , G06F16/34 , G06F16/36 , G06F16/9032 , G06F40/20 , G06F40/289 , G06F40/30 , G06F40/40 , G06N3/04 , G06N20/00 , G10L15/06 , G10L15/16 , G10L15/197 , G16H10/60 , G16H40/20 , G16H70/20

CPC分类号： G10L15/063 , G06F9/451 , G06F9/547 , G06F16/2237 , G06F16/248 , G06F16/328 , G06F16/3323 , G06F16/3329 , G06F16/3338 , G06F16/3344 , G06F16/3347 , G06F16/345 , G06F16/367 , G06F16/90332 , G06F40/20 , G06F40/289 , G06F40/30 , G06F40/40 , G06N3/04 , G06N20/00 , G10L15/16 , G10L15/197 , G16H10/60 , G16H40/20 , G16H70/20

摘要： Provided is a method including obtaining a set of ontologies mapping n-grams onto concepts to which the n-grams refer in different respective domains of knowledge. The method includes receiving an update associating a first n-gram with a first concept and receiving information by which the update is associated with a given domain of knowledge. The method includes selecting a subset of ontologies by determining that the update in the given domain of knowledge is applicable to respective domains of knowledge of the subset of ontologies and that the first concept has a specified type of relationship to a subset of concepts to which other n-grams are mapped in the subset of ontologies. The method also includes storing, in response to the determination, associations between the first n-gram and the subset of concepts in at least some of the subset of ontologies in memory of the computer system.

16.

发明公开
MACHINE LEARNING MODEL UPDATING 审中-公开

公开(公告)号：US20240296838A1

公开(公告)日：2024-09-05

申请号：US18643372

申请日：2024-04-23

申请人： Amazon Technologies, Inc.

发明人： Anil K. Ramakrishna , Rahul Gupta , Yuval Merhav , Zefei Li , Heather Brooke Spetalnick

IPC分类号： G10L15/18 , G06N20/00 , G10L15/06 , G10L25/27

CPC分类号： G10L15/1815 , G06N20/00 , G10L15/063 , G10L25/27

摘要： Techniques for updating a machine learning (ML) model are described. A device or system may receive input data corresponding to a natural or non-natural language (e.g., gesture) input. Using a first ML model, the device or system may determine the input data corresponds to a data category of a plurality of data categories. Based on the data category, the device or system may select a ML training type from among a plurality of ML training types. Using the input data, the device or system may perform the selected ML training type with respect to a runtime ML model to generate an updated ML model.

17.

发明公开
METHOD AND APPARATUS FOR GENERATING DATA TO TRAIN MODELS FOR ENTITY RECOGNITION FROM CONVERSATIONS 审中-公开

公开(公告)号：US20240296836A1

公开(公告)日：2024-09-05

申请号：US18116302

申请日：2023-03-02

申请人： Uniphore Technologies, Inc.

发明人： Gideon Hollander

IPC分类号： G10L15/08 , G10L15/06

CPC分类号： G10L15/083 , G10L15/063 , G10L2015/0631 , G10L2015/0638

摘要： In a method and apparatus for generating training data to train models for entity recognition from conversations, the method includes identifying a first text from a first data element on a first graphical user interface (GUI), on which a first action is performed by a first agent, the first data element corresponding to an entity type, wherein the first action comprises at least one of typing, clicking, highlighting, hovering or reading, matching the first text to a first transcribed text within a transcription of a first conversation between the first agent and a first customer, where the first transcribed text corresponds to a time proximate to the time the first action, and determining at least a portion of the first transcribed text as an automatically generated training data (AGTD) for the entity.

18.

发明公开
UNSUPERVISED FEDERATED LEARNING OF MACHINE LEARNING MODEL LAYERS 审中-公开

公开(公告)号：US20240296834A1

公开(公告)日：2024-09-05

申请号：US18659940

申请日：2024-05-09

申请人： GOOGLE LLC

发明人： Françoise Beaufays , Khe Chai Sim , Johan Schalkwyk

IPC分类号： G10L15/06 , G10L15/187 , G10L15/22 , G10L15/30

CPC分类号： G10L15/063 , G10L15/187 , G10L15/22 , G10L15/30 , G10L2015/0635

摘要： Implementations disclosed herein are directed to unsupervised federated training of global machine learning (“ML”) model layers that, after the federated training, can be combined with additional layer(s), thereby resulting in a combined ML model. Processor(s) can: detect audio data that captures a spoken utterance of a user of a client device; process, using a local ML model, the audio data to generate predicted output(s); generate, using unsupervised learning locally at the client device, a gradient based on the predicted output(s); transmit the gradient to a remote system; update weight(s) of the global ML model layers based on the gradient; subsequent to updating the weight(s), train, using supervised learning remotely at the remote system, a combined ML model that includes the updated global ML model layers and additional layer(s); transmit the combined ML model to the client device; and use the combined ML model to make prediction(s) at the client device.

19.

发明公开
WEARABLE SILENT SPEECH DEVICE, SYSTEMS, AND METHODS FOR ADJUSTING A MACHINE LEARNING MODEL 审中-公开

公开(公告)号：US20240296833A1

公开(公告)日：2024-09-05

申请号：US18648138

申请日：2024-04-26

申请人： Sahaj Garg , Anthony Leonardo , Tanay Kothari

发明人： Sahaj Garg , Anthony Leonardo , Tanay Kothari

IPC分类号： G10L15/06 , A61B5/389 , G10L15/24 , G10L25/78

CPC分类号： G10L15/063 , A61B5/389 , G10L15/24 , G10L25/78

摘要： The present disclosure relates to methods and systems for adjusting a silent speech machine learning model for use with a wearable silent speech device. In some embodiments, a method may include recording speech signals from a user, using a first sensor and a second sensor of a wearable silent speech device. The method may include providing for a silent speech machine learning model for use with the wearable silent speech device, determining whether the silent speech machine learning model is to be adjusted, and in response to determining the silent speech machine learning model is to be adjusted, adjusting the silent speech machine learning model based on at least the speech signals recorded using the first sensor and the second sensor.

20.

发明公开
METHOD AND APPARATUS FOR GENERATING DATA TO TRAIN MODELS FOR PREDICTING INTENT FROM CONVERSATIONS 审中-公开

公开(公告)号：US20240296831A1

公开(公告)日：2024-09-05

申请号：US18116301

申请日：2023-03-02

申请人： Uniphore Technologies, Inc.

发明人： Gideon Hollander

IPC分类号： G10L15/06 , G10L15/18 , G10L15/22 , H04M3/42 , H04M3/523

CPC分类号： G10L15/063 , G10L15/1815 , G10L15/22 , H04M3/42221 , H04M3/5232

摘要： In a method and apparatus for generating training data to train models for predicting intent from conversations, the method includes identifying, from a cluster of calls having a single intent, at least two calls, each having a sequence comprising a first action followed by a second action. The method identifies at least one first portion of the first transcribed text overlapping with the sequence in a first call, and at least one second portion of the second transcribed text overlapping with the sequence, and combining the at least one first portion, the at least one second portion and the intent to generate an automatically generated training data (AGTD) for the intent.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类