-
公开(公告)号:US12112129B2
公开(公告)日:2024-10-08
申请号:US17527167
申请日:2021-11-16
申请人: Fujitsu Limited
IPC分类号: G10L15/16 , G06F18/214 , G06F40/169 , G06F40/226 , G06N3/04 , G10L15/06 , G10L15/07 , G10L15/18 , G06F40/279 , G06F40/295 , G10L15/183
CPC分类号: G06F40/226 , G06F18/214 , G06F40/169 , G06N3/04 , G10L15/063 , G10L15/075 , G10L15/16 , G10L15/18 , G06F40/279 , G06F40/295 , G10L2015/0635 , G10L15/1822 , G10L15/183
摘要: A method of training a neural network as a natural language processing, NLP, model, comprises: inputting annotated training data to first architecture portions of the neural network, the first architecture portions being executed respectively in a plurality of distributed client computing devices in communication with a server computing device, the training data being derived from text data private to the client computing device in which the first architecture portion is executed, the server computing device having no access to any of the private text data; deriving from the training data, using the first architecture portions, weight matrices of numeric weights which are decoupled from the private text data; concatenating the weight matrices, in a second architecture portion of the neural network executed in the server computing device, to obtain a single concatenated weight matrix; and training, on the second architecture portion, the NLP model using the concatenated weight matrix.
-
公开(公告)号:US20240331882A1
公开(公告)日:2024-10-03
申请号:US18742032
申请日:2024-06-13
申请人: Canary Speech, LLC
IPC分类号: G16H80/00 , A61B5/00 , A61B5/11 , G06F111/10 , G06N3/08 , G06N7/01 , G06N20/10 , G10L15/02 , G10L15/06 , G10L15/22 , G10L25/66 , G16H10/20 , G16H40/67 , G16H50/20 , G16H50/50
CPC分类号: G16H80/00 , A61B5/1123 , A61B5/4088 , A61B5/4803 , A61B5/7267 , G06N3/08 , G06N7/01 , G06N20/10 , G10L25/66 , G16H10/20 , G16H40/67 , G16H50/20 , G16H50/50 , G06F2111/10 , G10L15/02 , G10L15/063 , G10L15/22
摘要: Apparatuses, systems, methods, and computer program products are disclosed for medical assessment based on voice. A query module is configured to audibly question a user from an electronic display screen and/or a speaker of a computing device with one or more open ended questions. A response module is configured to receive a conversational verbal response of a user from a microphone of a computing device in response to one or more open ended questions. A detection module is configured to provide a machine learning assessment for a user of a medical condition based on a machine learning analysis of a received conversational verbal response of the user.
-
公开(公告)号:US20240331702A1
公开(公告)日:2024-10-03
申请号:US18743562
申请日:2024-06-14
发明人: Kiersten L. BRADLEY , Ethan COEYTAUX , Ziming YIN
IPC分类号: G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/06 , G10L15/07
CPC分类号: G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/063 , G10L15/07 , G10L2015/0631
摘要: Methods and systems for enabling an efficient review of meeting content via a metadata-enriched, speaker-attributed transcript are disclosed. By incorporating speaker diarization and other metadata, the system can provide a structured and effective way to review and/or edit the transcript. One type of metadata can be image or video data to represent the meeting content. Furthermore, the present subject matter utilizes a multimodal diarization model to identify and label different speakers. The system can synchronize various sources of data, e.g., audio channel data, voice feature vectors, acoustic beamforming, image identification, and extrinsic data, to implement speaker diarization.
-
公开(公告)号:US20240331686A1
公开(公告)日:2024-10-03
申请号:US18739466
申请日:2024-06-11
发明人: Kai Wei , Thanh Dac Tran , Grant Strimel
CPC分类号: G10L15/1815 , G06N3/08 , G10L15/063 , G10L15/16 , G10L15/22 , G10L15/28 , G10L2015/228
摘要: Techniques for determining and storing relevant context information for a user input, such as a spoken input, are described. In some embodiments, context information is determined to be relevant on an audio frame basis. Context scores for different types of context data (e.g., prior dialog turn data, user profile data, device information, etc.) are determined for individual audio frames corresponding to a spoken input. Based on the corresponding context scores, the most relevant context is stored in a local context cache. The local context cache is updated as subsequent audio frames, of the user input, are processed. The data stored in the context cache is provided to downstream components to perform tasks such as ASR, NLU and SLU.
-
公开(公告)号:US12087282B2
公开(公告)日:2024-09-10
申请号:US18489212
申请日:2023-10-18
申请人: Sorcero, Inc.
IPC分类号: G06F17/00 , G06F9/451 , G06F9/54 , G06F16/22 , G06F16/248 , G06F16/31 , G06F16/33 , G06F16/332 , G06F16/34 , G06F16/36 , G06F16/9032 , G06F40/20 , G06F40/289 , G06F40/30 , G06F40/40 , G06N3/04 , G06N20/00 , G10L15/06 , G10L15/16 , G10L15/197 , G16H10/60 , G16H40/20 , G16H70/20
CPC分类号: G10L15/063 , G06F9/451 , G06F9/547 , G06F16/2237 , G06F16/248 , G06F16/328 , G06F16/3323 , G06F16/3329 , G06F16/3338 , G06F16/3344 , G06F16/3347 , G06F16/345 , G06F16/367 , G06F16/90332 , G06F40/20 , G06F40/289 , G06F40/30 , G06F40/40 , G06N3/04 , G06N20/00 , G10L15/16 , G10L15/197 , G16H10/60 , G16H40/20 , G16H70/20
摘要: Provided is a method including obtaining a set of ontologies mapping n-grams onto concepts to which the n-grams refer in different respective domains of knowledge. The method includes receiving an update associating a first n-gram with a first concept and receiving information by which the update is associated with a given domain of knowledge. The method includes selecting a subset of ontologies by determining that the update in the given domain of knowledge is applicable to respective domains of knowledge of the subset of ontologies and that the first concept has a specified type of relationship to a subset of concepts to which other n-grams are mapped in the subset of ontologies. The method also includes storing, in response to the determination, associations between the first n-gram and the subset of concepts in at least some of the subset of ontologies in memory of the computer system.
-
公开(公告)号:US20240296838A1
公开(公告)日:2024-09-05
申请号:US18643372
申请日:2024-04-23
CPC分类号: G10L15/1815 , G06N20/00 , G10L15/063 , G10L25/27
摘要: Techniques for updating a machine learning (ML) model are described. A device or system may receive input data corresponding to a natural or non-natural language (e.g., gesture) input. Using a first ML model, the device or system may determine the input data corresponds to a data category of a plurality of data categories. Based on the data category, the device or system may select a ML training type from among a plurality of ML training types. Using the input data, the device or system may perform the selected ML training type with respect to a runtime ML model to generate an updated ML model.
-
17.
公开(公告)号:US20240296836A1
公开(公告)日:2024-09-05
申请号:US18116302
申请日:2023-03-02
发明人: Gideon Hollander
CPC分类号: G10L15/083 , G10L15/063 , G10L2015/0631 , G10L2015/0638
摘要: In a method and apparatus for generating training data to train models for entity recognition from conversations, the method includes identifying a first text from a first data element on a first graphical user interface (GUI), on which a first action is performed by a first agent, the first data element corresponding to an entity type, wherein the first action comprises at least one of typing, clicking, highlighting, hovering or reading, matching the first text to a first transcribed text within a transcription of a first conversation between the first agent and a first customer, where the first transcribed text corresponds to a time proximate to the time the first action, and determining at least a portion of the first transcribed text as an automatically generated training data (AGTD) for the entity.
-
公开(公告)号:US20240296834A1
公开(公告)日:2024-09-05
申请号:US18659940
申请日:2024-05-09
申请人: GOOGLE LLC
IPC分类号: G10L15/06 , G10L15/187 , G10L15/22 , G10L15/30
CPC分类号: G10L15/063 , G10L15/187 , G10L15/22 , G10L15/30 , G10L2015/0635
摘要: Implementations disclosed herein are directed to unsupervised federated training of global machine learning (“ML”) model layers that, after the federated training, can be combined with additional layer(s), thereby resulting in a combined ML model. Processor(s) can: detect audio data that captures a spoken utterance of a user of a client device; process, using a local ML model, the audio data to generate predicted output(s); generate, using unsupervised learning locally at the client device, a gradient based on the predicted output(s); transmit the gradient to a remote system; update weight(s) of the global ML model layers based on the gradient; subsequent to updating the weight(s), train, using supervised learning remotely at the remote system, a combined ML model that includes the updated global ML model layers and additional layer(s); transmit the combined ML model to the client device; and use the combined ML model to make prediction(s) at the client device.
-
19.
公开(公告)号:US20240296833A1
公开(公告)日:2024-09-05
申请号:US18648138
申请日:2024-04-26
申请人: Sahaj Garg , Anthony Leonardo , Tanay Kothari
发明人: Sahaj Garg , Anthony Leonardo , Tanay Kothari
CPC分类号: G10L15/063 , A61B5/389 , G10L15/24 , G10L25/78
摘要: The present disclosure relates to methods and systems for adjusting a silent speech machine learning model for use with a wearable silent speech device. In some embodiments, a method may include recording speech signals from a user, using a first sensor and a second sensor of a wearable silent speech device. The method may include providing for a silent speech machine learning model for use with the wearable silent speech device, determining whether the silent speech machine learning model is to be adjusted, and in response to determining the silent speech machine learning model is to be adjusted, adjusting the silent speech machine learning model based on at least the speech signals recorded using the first sensor and the second sensor.
-
20.
公开(公告)号:US20240296831A1
公开(公告)日:2024-09-05
申请号:US18116301
申请日:2023-03-02
发明人: Gideon Hollander
CPC分类号: G10L15/063 , G10L15/1815 , G10L15/22 , H04M3/42221 , H04M3/5232
摘要: In a method and apparatus for generating training data to train models for predicting intent from conversations, the method includes identifying, from a cluster of calls having a single intent, at least two calls, each having a sequence comprising a first action followed by a second action. The method identifies at least one first portion of the first transcribed text overlapping with the sequence in a first call, and at least one second portion of the second transcribed text overlapping with the sequence, and combining the at least one first portion, the at least one second portion and the intent to generate an automatically generated training data (AGTD) for the intent.
-
-
-
-
-
-
-
-
-