-
21.
公开(公告)号:US20240321277A1
公开(公告)日:2024-09-26
申请号:US18677629
申请日:2024-05-29
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Krishna Sapkota , Behshad Behzadi , Julia Proskurnia , Jacopo Sannazzaro Natta , Justin Lu , Magali Boizot-Roche , Marius Sajgalik , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari
CPC classification number: G10L15/26 , G10L15/22 , G10L2015/223
Abstract: Implementations described herein relate to an application and/or automated assistant that can identify arrangement operations to perform for arranging text during speech-to-text operations—without a user having to expressly identify the arrangement operations. In some instances, a user that is dictating a document (e.g., an email, a text message, etc.) can provide a spoken utterance to an application in order to incorporate textual content. However, in some of these instances, certain corresponding arrangements are needed for the textual content in the document. The textual content that is derived from the spoken utterance can be arranged by the application based on an intent, vocalization features, and/or contextual features associated with the spoken utterance and/or a type of the application associated with the document, without the user expressly identifying the corresponding arrangements. In this way, the application can infer content arrangement operations from a spoken utterance that only specifies the textual content.
-
公开(公告)号:US20240078374A1
公开(公告)日:2024-03-07
申请号:US17957489
申请日:2022-09-30
Applicant: GOOGLE LLC
Inventor: Ajay Gokhale , Jiawei Chen , Alvin Abdagic , Adrien Olczak , Alessandro Agostini , Alexander Robertson , Cristian Udrescu , Jackie Xiang , Jennifer Daniel , Keqian Yan , Mehek Sharma , Nicolo D'Ercole , Yang Lu , Dror Ayalon
IPC: G06F40/166 , G06F3/0482 , G06F3/0488 , G06F40/279 , G10L15/197 , G10L15/22 , G10L25/63
CPC classification number: G06F40/166 , G06F3/0482 , G06F3/0488 , G06F40/279 , G10L15/197 , G10L15/22 , G10L25/63 , G10L2015/223
Abstract: Implementations described herein relate to causing emoji(s) that are associated with a given emotion class expressed by a spoken utterance to be visually rendered for presentation to a user at a display of a client device of the user. Processor(s) of the client device may receive audio data that captures the spoken utterance, process the audio data to generate textual data that is predicted to correspond to the spoken utterance, and cause a transcription of the textual data to be visually rendered for presentation to the user via the display. Further, the processor(s) may determine, based on processing the textual data, whether the spoken utterance expresses a given emotion class. In response to determining that the spoken utterance expresses the given emotion class, the processor(s) may cause emoji(s) that are stored in association with the given emotion class to be visually rendered for presentation to the user via the display.
-
23.
公开(公告)号:US20240029728A1
公开(公告)日:2024-01-25
申请号:US17902560
申请日:2022-09-02
Applicant: GOOGLE LLC
Inventor: Nicolo D'Ercole , Shumin Zhai , Swante Scholz , Mehek Sharma , Adrien Olczak , Akshay Kannan , Alvin Abdagic , Julia Proskurnia , Viesturs Zarins
IPC: G10L15/22 , G10L15/08 , G06F16/683
CPC classification number: G10L15/22 , G10L15/08 , G06F16/685
Abstract: Implementations described herein generally relate to generating a modification selectable element that may be provided for presentation to a user in a smart dictation session with an automated assistant. The modification selectable element may, when selected, cause a transcription, that includes textual data generated based on processing audio data that captures a spoken utterance and that is automatically arranged, to be modified. The transcription may be automatically arranged to include spacing, punctuation, capitalization, indentations, paragraph breaks, and/or other arrangement operations that are not specified by the user in providing the spoken utterance. Accordingly, a subsequent selection of the modification selectable element may cause these automatic arrangement operation(s), and/or the textual data locationally proximate to these automatic arrangement operation(s), to be modified. Implementations described herein also relate to generating the transcription and/or the modification selectable element on behalf of a third-party software application.
-
公开(公告)号:US20220108696A1
公开(公告)日:2022-04-07
申请号:US17552887
申请日:2021-12-16
Applicant: Google LLC
Inventor: Andrea Terwisscha van Scheltinga , Nicolo D'Ercole , Zaheed Sabur , Bibo Xu , Megan Knight , Alvin Abdagic , Jan Lamecki , Bo Zhang
Abstract: Determining whether, upon cessation of a second automated assistant session that interrupted and supplanted a prior first automated assistant session: (1) to automatically resume the prior first automated assistant session, or (2) to transition to an alternative automated assistant state in which the prior first session is not automatically resumed. Implementations further relate to selectively causing, based on the determining and upon cessation of the second automated assistant session, either the automatic resumption of the prior first automated assistant session that was interrupted, or the transition to the state in which the first session is not automatically resumed.
-
-
-