-
公开(公告)号:US20240420680A1
公开(公告)日:2024-12-19
申请号:US18337168
申请日:2023-06-19
Applicant: GOOGLE LLC
Inventor: Te I , Chris Kau , Jeffrey Robert Pitman , Robert Eric Genter , Qi Ge , Wolfgang Macherey , Dirk Ryan Padfield , Naveen Arivazhagan , Colin Cherry
Abstract: Implementations relate to a multimodal translation application that can provide an abridged version of a translation through an audio interface of a computing device, while simultaneously providing a verbatim textual translation at a display interface of the computing device. The application can provide these different versions of the translation in certain circumstances when, for example, the rate of speech of a person speaking to a user is relatively high compared to a preferred rate of speech of the user. For example, a comparison between phonemes of an original language speech and a translated language speech can be performed to determine whether the ratio satisfies a threshold for providing an audible abridged translation. A determination to provide the abridged translation can additionally or alternatively be based on a determined language of the speaker.
-
公开(公告)号:US11562152B2
公开(公告)日:2023-01-24
申请号:US17030093
申请日:2020-09-23
Applicant: Google LLC
Inventor: Naveen Arivazhagan , Colin Andrew Cherry , Wolfgang Macherey , Te I , George Foster , Pallavi N Baljekar
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer-storage media, for re-translation for simultaneous, spoken-language machine translation. In some implementations, a stream of audio data comprising speech in a first language is received. A transcription for the speech in the stream of audio data is generated using an automated speech recognizer through a series of updates. A translation of the transcription into a second language is generated using a machine translation module. The translation is generated with translation iterations that translate increasing amounts of the transcription, including re-translating previously portions of the transcription. A series of translation updates are provided to a client device based on the translation iterations.
-
公开(公告)号:US20220092274A1
公开(公告)日:2022-03-24
申请号:US17030093
申请日:2020-09-23
Applicant: Google LLC
Inventor: Naveen Arivazhagan , Colin Andrew Cherry , Wolfgang Macherey , Te I , George Foster , Pallavi N. Baljekar
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer-storage media, for re-translation for simultaneous, spoken-language machine translation. In some implementations, a stream of audio data comprising speech in a first language is received. A transcription for the speech in the stream of audio data is generated using an automated speech recognizer through a series of updates. A translation of the transcription into a second language is generated using a machine translation module. The translation is generated with translation iterations that translate increasing amounts of the transcription, including re-translating previously portions of the transcription. A series of translation updates are provided to a client device based on the translation iterations.
-
-