专利检索 ipc:"G10L15/26" 第 1 页

1.

发明授权
Contextual auto-completion for assistant systems 有权

公开(公告)号：US12131522B2

公开(公告)日：2024-10-29

申请号：US17077316

申请日：2020-10-22

申请人： Meta Platforms, Inc.

发明人： Jiedan Zhu , Fuchun Peng , Benoit F. Dumoulin , Xiaohu Liu , Rajen Subba , Mohsen Agsen , Michael Robert Hanson

IPC分类号： G06F16/338 , G06F3/01 , G06F3/16 , G06F7/14 , G06F9/451 , G06F16/176 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/2457 , G06F16/248 , G06F16/33 , G06F16/332 , G06F16/903 , G06F16/9032 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/00 , G06V10/764 , G06V10/82 , G06V20/10 , G06V40/20 , G10L15/02 , G10L15/06 , G10L15/07 , G10L15/16 , G10L15/18 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/28 , H04L41/00 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/50 , H04L67/5651 , H04L67/75 , H04W12/08 , G10L13/00 , G10L13/04 , H04L51/046 , H04L67/10 , H04L67/53

CPC分类号： G06V10/82 , G06F3/011 , G06F3/013 , G06F3/017 , G06F3/167 , G06F7/14 , G06F9/453 , G06F16/176 , G06F16/2255 , G06F16/2365 , G06F16/243 , G06F16/24552 , G06F16/24575 , G06F16/24578 , G06F16/248 , G06F16/3323 , G06F16/3329 , G06F16/3344 , G06F16/338 , G06F16/90332 , G06F16/90335 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/01 , G06V10/764 , G06V20/10 , G06V40/28 , G10L15/02 , G10L15/063 , G10L15/07 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/2816 , H04L41/20 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/535 , H04L67/5651 , H04L67/75 , H04W12/08 , G06F2216/13 , G10L13/00 , G10L13/04 , G10L2015/223 , G10L2015/225 , H04L51/046 , H04L67/10 , H04L67/53

摘要： In one embodiment, a method includes receiving a first user input from a first user, wherein the first user input comprises a partial request, presenting one or more suggested intent auto-completions corresponding to the partial request, receiving a selection by the first user of a first suggested intent auto-completion of the suggested intent auto-completions and a second user input, presenting one or more suggested slot auto-completions corresponding to one or more candidate slot-hypotheses corresponding to the second user input, respectively, wherein each of the candidate slot-hypotheses comprise a slot-suggestion, and wherein each suggested slot auto-completion comprises the second user input and the corresponding candidate slot-hypothesis, receiving a selection by the first user of a first suggested slot auto-completion of the suggested slot auto-completions, and presenting execution results of one or more tasks corresponding to the first suggested intent auto-completion and the first suggested slot auto-completion.

2.

发明授权
Computer implemented method for the automated analysis or use of data 有权

公开(公告)号：US12131127B2

公开(公告)日：2024-10-29

申请号：US18088588

申请日：2022-12-25

申请人： UNLIKELY ARTIFICIAL INTELLIGENCE LIMITED

发明人： William Tunstall-Pedoe , Finlay Curran , Harry Roscoe , Robert Heywood

IPC分类号： G06N5/02 , G06F16/242 , G06F16/31 , G06F16/332 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/35 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06Q10/1053 , G06Q30/0251 , G06Q30/0601 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L15/08

CPC分类号： G06F40/35 , G06F16/243 , G06F16/322 , G06F16/3329 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06Q10/1053 , G06Q30/0255 , G06Q30/0257 , G06Q30/0631 , G10L15/16 , G10L15/1815 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L2015/088

摘要： A computer implemented method for automated analysis or use of data, comprising: (a) storing in a non-transitory computer-readable medium a structured, machine-readable representation of data that conforms to a machine-readable language, wherein the data relates to social media postings; (b) automatically processing structured, machine-readable representation of data to determine if the social media postings are compliant with requirements preventing abusive or illegal social media postings.

3.

发明公开
SPATIAL AUDIO CONVERSATIONAL ANALYSIS FOR ENHANCED CONVERSATION DISCOVERY 审中-公开

公开(公告)号：US20240355331A1

公开(公告)日：2024-10-24

申请号：US18760626

申请日：2024-07-01

申请人： MICROSOFT TECHNOLOGY LICENSING, LLC

发明人： Spencer G. FOWERS , David Anthony TITTSWORTH , Amber Dawn HOAK

IPC分类号： G10L15/26 , G06F3/16 , H04M3/56

CPC分类号： G10L15/26 , G06F3/165 , H04M3/568

摘要： Systems and methods for providing enhanced teleconferencing. An example method includes receiving audio streams from a plurality of client devices of participants of a teleconference; converting the audio streams for a first conversation within the teleconference into first text; converting the audio streams for a second conversation within the teleconference into a second text; analyzing the first text to identify one or more topics being discussed in the first conversation; analyzing the second text to identify one or more topics being discussed in the second conversation; and presenting, in a teleconference user interface, at least one of the one or more topics being discussed in the first conversation or the one or more topics being discussed in the second conversation.

4.

发明公开
SYSTEM AND METHOD FOR TRANSCRIBING AUDIBLE INFORMATION 审中-公开

公开(公告)号：US20240355329A1

公开(公告)日：2024-10-24

申请号：US18138707

申请日：2023-04-24

申请人： Logitech Europe S.A.

发明人： Nicolas CHAUVIN , Yan CHETELAT , Anna Maria PUCHALSKA , Evan Patrick KELLY , Curtis Devin BROWN , Robin Antero Olof PIISPANEN , Madelene Rae STANLEY

IPC分类号： G10L15/26 , G06F40/166

CPC分类号： G10L15/26 , G06F40/166

摘要： Embodiments herein include a processing system and method for transcribing audible information, including converting audible information or data received from a user into alphanumeric data. The alphanumeric data can be processed allowing a user to provide input that improves of the accuracy of the alphanumeric data, facilitates the transfer of the alphanumeric data to other electronic devices by a computer or other electronic device, and/or improves the communicative or expressive properties of the alphanumeric data in an electronic communication that is provided to one or more users. In some embodiments, the transcribed and/or translated text can be automatically formatted by a program for use in a software application. More specifically embodiments of the present application disclose a system and program that can embellish transcribed text to alert and provide suggestions for the correction of potentially inaccurate transcribed and/or translated text, and/or provide potential emojis to add into or replace text.

5.

发明公开
SYSTEM AND METHOD FOR HYBRID GENERATION OF TEXT FROM AUDIO 审中-公开

公开(公告)号：US20240355328A1

公开(公告)日：2024-10-24

申请号：US18138295

申请日：2023-04-24

申请人： Verbit, Inc.

发明人： Maksym SARANA , Ariel COHEN , Irit OFER

IPC分类号： G10L15/26 , G10L15/02 , G10L15/22

CPC分类号： G10L15/26 , G10L15/02 , G10L15/22

摘要： A method, system and computer program product for transcribing audio signals, the method comprising: obtaining a source audio signal; obtaining meta data associated with the audio signal; analyzing the meta data; extracting acoustic features from the source audio signal; determining a difficulty level assessment of transcribing the audio signal, based at least on the meta data and acoustic features; selecting based on the level of transcription difficulty a first transcription option; and providing a related audio signal which is related to the source audio signal to the first transcription option over a communication channel, to obtain a transcription of the related audio signal.

6.

发明授权
Media presentation device with voice command feature 有权

公开(公告)号：US12126872B2

公开(公告)日：2024-10-22

申请号：US18181461

申请日：2023-03-09

申请人： The Nielsen Company (US), LLC

发明人： John R. Burbank , Kurt Roman Thielen

IPC分类号： H04N21/478 , G10L15/26 , H04N21/2389 , H04N21/658

CPC分类号： H04N21/47815 , G10L15/26 , H04N21/23892 , H04N21/6581

摘要： A media presentation device determines a voice command associated with media content presented by the media presentation device. The media presentation device then listens for and detects utterance of the determined voice command during presentation of the media content, and the media presentation device responds to the detected utterance by performing an action that facilitates user purchase of the good or service associated with the media content segment.

7.

发明授权
Personalized gesture recognition for user interaction with assistant systems 有权

公开(公告)号：US12125272B2

公开(公告)日：2024-10-22

申请号：US18449525

申请日：2023-08-14

申请人： Meta Platforms Technologies, LLC

发明人： Paul Anthony Crook , Francislav P. Penov , Rajen Subba , Xiaohu Liu

IPC分类号： G06V10/82 , G06F3/01 , G06F3/16 , G06F7/14 , G06F9/451 , G06F16/176 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/2457 , G06F16/248 , G06F16/33 , G06F16/332 , G06F16/338 , G06F16/903 , G06F16/9032 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/00 , G06V10/764 , G06V20/10 , G06V40/20 , G10L15/02 , G10L15/06 , G10L15/07 , G10L15/16 , G10L15/18 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/28 , H04L41/00 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/50 , H04L67/5651 , H04L67/75 , H04W12/08 , G10L13/00 , G10L13/04 , H04L51/046 , H04L67/10 , H04L67/53

CPC分类号： G06V10/82 , G06F3/011 , G06F3/013 , G06F3/017 , G06F3/167 , G06F7/14 , G06F9/453 , G06F16/176 , G06F16/2255 , G06F16/2365 , G06F16/243 , G06F16/24552 , G06F16/24575 , G06F16/24578 , G06F16/248 , G06F16/3323 , G06F16/3329 , G06F16/3344 , G06F16/338 , G06F16/90332 , G06F16/90335 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/01 , G06V10/764 , G06V20/10 , G06V40/28 , G10L15/02 , G10L15/063 , G10L15/07 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/2816 , H04L41/20 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/535 , H04L67/5651 , H04L67/75 , H04W12/08 , G06F2216/13 , G10L13/00 , G10L13/04 , G10L2015/223 , G10L2015/225 , H04L51/046 , H04L67/10 , H04L67/53

摘要： In one embodiment, a method includes receiving a user request from a first user from a client system associated with a first user, wherein the user request comprise a gesture-input from the first user and a speech-input from the first user, determining an intent corresponding to the user request based on the gesture-input by a personalized gesture-classification model associated with the first user, executing one or more tasks based on the determined intent and the speech-input, and sending instructions for presenting execution results of the one or more tasks to the client system responsive the user request.

8.

发明公开
REAL-TIME INTERACTIVE VOICE CONVERSATION STATE MANAGEMENT IN LARGE LANGUAGE MODELS 审中-公开

公开(公告)号：US20240347058A1

公开(公告)日：2024-10-17

申请号：US18634800

申请日：2024-04-12

申请人： Animato, Inc.

发明人： Francesco Rossi , Nicholas Peretti

IPC分类号： G10L15/22 , G10L13/08 , G10L15/26

CPC分类号： G10L15/22 , G10L13/08 , G10L15/26

摘要： A method or system for managing interruptions during oral interactions between users and Large Language Models (LLMs). Initially, a user's spoken input is received and converted to text, which forms a prompt for the LLM. Upon generating a text response by the LLM, the text response is then converted back into speech and played to the user. If the user interrupts while the response is being played, the playback stops, and the interruption is captured as a new spoken input. This interruption is used to generate a new prompt for the LLM. Subsequently, the LLM generates a second text response based on the interruption, which is converted to speech and played back to the user. This process ensures that user interruptions are effectively managed, allowing for a more dynamic and interactive conversation with the LLM and enhancing the user's experience by adapting the conversation flow to real-time inputs.

9.

发明授权
Determining multilingual content in responses to a query 有权

公开(公告)号：US12118981B2

公开(公告)日：2024-10-15

申请号：US17475897

申请日：2021-09-15

申请人： GOOGLE LLC

发明人： Wangqing Yuan , Bryan Christopher Horling , David Kogan

IPC分类号： G10L15/00 , G10L13/08 , G10L15/22 , G10L15/26

CPC分类号： G10L13/086 , G10L15/22 , G10L2015/223 , G10L2015/225

摘要： Implementations relate to determining multilingual content to render at an interface in response to a user submitted query. Those implementations further relate to determining a first language response and a second language response to a query that is submitted to an automated assistant. Some of those implementations relate to determining multilingual content that includes a response to the query in both the first and second languages. Other implementations relate to determining multilingual content that includes a query suggestion in the first language and a query suggestion in a second language. Some of those implementations relate to pre-fetching results for the query suggestions prior to rendering the multilingual content.

10.

发明授权
Systems and methods for generating synthesized speech responses to voice inputs indicative of a user in a hurry 有权

公开(公告)号：US12118978B2

公开(公告)日：2024-10-15

申请号：US18387211

申请日：2023-11-06

申请人： ROVI GUIDES, INC.

发明人： Ankur Aher , Jeffry Copps Robert Jose

IPC分类号： G10L13/06 , G10L13/00 , G10L13/02 , G10L13/033 , G10L13/08 , G10L15/00 , G10L15/10 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63

CPC分类号： G10L13/0335 , G10L25/63 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/00 , G10L15/10 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26

摘要： The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类