-
公开(公告)号:US12131522B2
公开(公告)日:2024-10-29
申请号:US17077316
申请日:2020-10-22
申请人: Meta Platforms, Inc.
发明人: Jiedan Zhu , Fuchun Peng , Benoit F. Dumoulin , Xiaohu Liu , Rajen Subba , Mohsen Agsen , Michael Robert Hanson
IPC分类号: G06F16/338 , G06F3/01 , G06F3/16 , G06F7/14 , G06F9/451 , G06F16/176 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/2457 , G06F16/248 , G06F16/33 , G06F16/332 , G06F16/903 , G06F16/9032 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/00 , G06V10/764 , G06V10/82 , G06V20/10 , G06V40/20 , G10L15/02 , G10L15/06 , G10L15/07 , G10L15/16 , G10L15/18 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/28 , H04L41/00 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/50 , H04L67/5651 , H04L67/75 , H04W12/08 , G10L13/00 , G10L13/04 , H04L51/046 , H04L67/10 , H04L67/53
CPC分类号: G06V10/82 , G06F3/011 , G06F3/013 , G06F3/017 , G06F3/167 , G06F7/14 , G06F9/453 , G06F16/176 , G06F16/2255 , G06F16/2365 , G06F16/243 , G06F16/24552 , G06F16/24575 , G06F16/24578 , G06F16/248 , G06F16/3323 , G06F16/3329 , G06F16/3344 , G06F16/338 , G06F16/90332 , G06F16/90335 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/01 , G06V10/764 , G06V20/10 , G06V40/28 , G10L15/02 , G10L15/063 , G10L15/07 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/2816 , H04L41/20 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/535 , H04L67/5651 , H04L67/75 , H04W12/08 , G06F2216/13 , G10L13/00 , G10L13/04 , G10L2015/223 , G10L2015/225 , H04L51/046 , H04L67/10 , H04L67/53
摘要: In one embodiment, a method includes receiving a first user input from a first user, wherein the first user input comprises a partial request, presenting one or more suggested intent auto-completions corresponding to the partial request, receiving a selection by the first user of a first suggested intent auto-completion of the suggested intent auto-completions and a second user input, presenting one or more suggested slot auto-completions corresponding to one or more candidate slot-hypotheses corresponding to the second user input, respectively, wherein each of the candidate slot-hypotheses comprise a slot-suggestion, and wherein each suggested slot auto-completion comprises the second user input and the corresponding candidate slot-hypothesis, receiving a selection by the first user of a first suggested slot auto-completion of the suggested slot auto-completions, and presenting execution results of one or more tasks corresponding to the first suggested intent auto-completion and the first suggested slot auto-completion.
-
公开(公告)号:US12131127B2
公开(公告)日:2024-10-29
申请号:US18088588
申请日:2022-12-25
IPC分类号: G06N5/02 , G06F16/242 , G06F16/31 , G06F16/332 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/35 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06Q10/1053 , G06Q30/0251 , G06Q30/0601 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L15/08
CPC分类号: G06F40/35 , G06F16/243 , G06F16/322 , G06F16/3329 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06Q10/1053 , G06Q30/0255 , G06Q30/0257 , G06Q30/0631 , G10L15/16 , G10L15/1815 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L2015/088
摘要: A computer implemented method for automated analysis or use of data, comprising: (a) storing in a non-transitory computer-readable medium a structured, machine-readable representation of data that conforms to a machine-readable language, wherein the data relates to social media postings; (b) automatically processing structured, machine-readable representation of data to determine if the social media postings are compliant with requirements preventing abusive or illegal social media postings.
-
公开(公告)号:US20240355331A1
公开(公告)日:2024-10-24
申请号:US18760626
申请日:2024-07-01
摘要: Systems and methods for providing enhanced teleconferencing. An example method includes receiving audio streams from a plurality of client devices of participants of a teleconference; converting the audio streams for a first conversation within the teleconference into first text; converting the audio streams for a second conversation within the teleconference into a second text; analyzing the first text to identify one or more topics being discussed in the first conversation; analyzing the second text to identify one or more topics being discussed in the second conversation; and presenting, in a teleconference user interface, at least one of the one or more topics being discussed in the first conversation or the one or more topics being discussed in the second conversation.
-
公开(公告)号:US20240355329A1
公开(公告)日:2024-10-24
申请号:US18138707
申请日:2023-04-24
申请人: Logitech Europe S.A.
发明人: Nicolas CHAUVIN , Yan CHETELAT , Anna Maria PUCHALSKA , Evan Patrick KELLY , Curtis Devin BROWN , Robin Antero Olof PIISPANEN , Madelene Rae STANLEY
IPC分类号: G10L15/26 , G06F40/166
CPC分类号: G10L15/26 , G06F40/166
摘要: Embodiments herein include a processing system and method for transcribing audible information, including converting audible information or data received from a user into alphanumeric data. The alphanumeric data can be processed allowing a user to provide input that improves of the accuracy of the alphanumeric data, facilitates the transfer of the alphanumeric data to other electronic devices by a computer or other electronic device, and/or improves the communicative or expressive properties of the alphanumeric data in an electronic communication that is provided to one or more users. In some embodiments, the transcribed and/or translated text can be automatically formatted by a program for use in a software application. More specifically embodiments of the present application disclose a system and program that can embellish transcribed text to alert and provide suggestions for the correction of potentially inaccurate transcribed and/or translated text, and/or provide potential emojis to add into or replace text.
-
公开(公告)号:US20240355328A1
公开(公告)日:2024-10-24
申请号:US18138295
申请日:2023-04-24
申请人: Verbit, Inc.
发明人: Maksym SARANA , Ariel COHEN , Irit OFER
摘要: A method, system and computer program product for transcribing audio signals, the method comprising: obtaining a source audio signal; obtaining meta data associated with the audio signal; analyzing the meta data; extracting acoustic features from the source audio signal; determining a difficulty level assessment of transcribing the audio signal, based at least on the meta data and acoustic features; selecting based on the level of transcription difficulty a first transcription option; and providing a related audio signal which is related to the source audio signal to the first transcription option over a communication channel, to obtain a transcription of the related audio signal.
-
公开(公告)号:US12126872B2
公开(公告)日:2024-10-22
申请号:US18181461
申请日:2023-03-09
IPC分类号: H04N21/478 , G10L15/26 , H04N21/2389 , H04N21/658
CPC分类号: H04N21/47815 , G10L15/26 , H04N21/23892 , H04N21/6581
摘要: A media presentation device determines a voice command associated with media content presented by the media presentation device. The media presentation device then listens for and detects utterance of the determined voice command during presentation of the media content, and the media presentation device responds to the detected utterance by performing an action that facilitates user purchase of the good or service associated with the media content segment.
-
公开(公告)号:US12125272B2
公开(公告)日:2024-10-22
申请号:US18449525
申请日:2023-08-14
IPC分类号: G06V10/82 , G06F3/01 , G06F3/16 , G06F7/14 , G06F9/451 , G06F16/176 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/2457 , G06F16/248 , G06F16/33 , G06F16/332 , G06F16/338 , G06F16/903 , G06F16/9032 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/00 , G06V10/764 , G06V20/10 , G06V40/20 , G10L15/02 , G10L15/06 , G10L15/07 , G10L15/16 , G10L15/18 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/28 , H04L41/00 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/50 , H04L67/5651 , H04L67/75 , H04W12/08 , G10L13/00 , G10L13/04 , H04L51/046 , H04L67/10 , H04L67/53
CPC分类号: G06V10/82 , G06F3/011 , G06F3/013 , G06F3/017 , G06F3/167 , G06F7/14 , G06F9/453 , G06F16/176 , G06F16/2255 , G06F16/2365 , G06F16/243 , G06F16/24552 , G06F16/24575 , G06F16/24578 , G06F16/248 , G06F16/3323 , G06F16/3329 , G06F16/3344 , G06F16/338 , G06F16/90332 , G06F16/90335 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/01 , G06V10/764 , G06V20/10 , G06V40/28 , G10L15/02 , G10L15/063 , G10L15/07 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/2816 , H04L41/20 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/535 , H04L67/5651 , H04L67/75 , H04W12/08 , G06F2216/13 , G10L13/00 , G10L13/04 , G10L2015/223 , G10L2015/225 , H04L51/046 , H04L67/10 , H04L67/53
摘要: In one embodiment, a method includes receiving a user request from a first user from a client system associated with a first user, wherein the user request comprise a gesture-input from the first user and a speech-input from the first user, determining an intent corresponding to the user request based on the gesture-input by a personalized gesture-classification model associated with the first user, executing one or more tasks based on the determined intent and the speech-input, and sending instructions for presenting execution results of the one or more tasks to the client system responsive the user request.
-
公开(公告)号:US20240347058A1
公开(公告)日:2024-10-17
申请号:US18634800
申请日:2024-04-12
申请人: Animato, Inc.
发明人: Francesco Rossi , Nicholas Peretti
摘要: A method or system for managing interruptions during oral interactions between users and Large Language Models (LLMs). Initially, a user's spoken input is received and converted to text, which forms a prompt for the LLM. Upon generating a text response by the LLM, the text response is then converted back into speech and played to the user. If the user interrupts while the response is being played, the playback stops, and the interruption is captured as a new spoken input. This interruption is used to generate a new prompt for the LLM. Subsequently, the LLM generates a second text response based on the interruption, which is converted to speech and played back to the user. This process ensures that user interruptions are effectively managed, allowing for a more dynamic and interactive conversation with the LLM and enhancing the user's experience by adapting the conversation flow to real-time inputs.
-
公开(公告)号:US12118981B2
公开(公告)日:2024-10-15
申请号:US17475897
申请日:2021-09-15
申请人: GOOGLE LLC
CPC分类号: G10L13/086 , G10L15/22 , G10L2015/223 , G10L2015/225
摘要: Implementations relate to determining multilingual content to render at an interface in response to a user submitted query. Those implementations further relate to determining a first language response and a second language response to a query that is submitted to an automated assistant. Some of those implementations relate to determining multilingual content that includes a response to the query in both the first and second languages. Other implementations relate to determining multilingual content that includes a query suggestion in the first language and a query suggestion in a second language. Some of those implementations relate to pre-fetching results for the query suggestions prior to rendering the multilingual content.
-
公开(公告)号:US12118978B2
公开(公告)日:2024-10-15
申请号:US18387211
申请日:2023-11-06
申请人: ROVI GUIDES, INC.
IPC分类号: G10L13/06 , G10L13/00 , G10L13/02 , G10L13/033 , G10L13/08 , G10L15/00 , G10L15/10 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63
CPC分类号: G10L13/0335 , G10L25/63 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/00 , G10L15/10 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26
摘要: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
-
-
-
-
-
-
-
-
-