-
公开(公告)号:US12020681B2
公开(公告)日:2024-06-25
申请号:US17487114
申请日:2021-09-28
申请人: Google LLC
发明人: Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff
摘要: A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.
-
公开(公告)号:US20220013104A1
公开(公告)日:2022-01-13
申请号:US17487114
申请日:2021-09-28
申请人: Google LLC
发明人: Benedict Davies , Guillaume Boniface-Chang , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff
摘要: A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.
-
公开(公告)号:US11145288B2
公开(公告)日:2021-10-12
申请号:US16418492
申请日:2019-05-21
申请人: Google LLC
发明人: Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff
摘要: A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.
-
公开(公告)号:US20240127792A1
公开(公告)日:2024-04-18
申请号:US18535279
申请日:2023-12-11
申请人: Google LLC
IPC分类号: G10L13/08 , G06F3/0485 , G10L13/02
CPC分类号: G10L13/08 , G06F3/0485 , G10L13/02
摘要: An audio playback system that provides intuitive audio playback of textual content responsive to user input actions, such as scrolling portions of textual content on a display. Playback of audio (e.g., text-to-speech audio) that includes textual content can begin based on a portion of textual content being positioned by a user input at a certain position on a device display. As one example, a user can simply scroll through a webpage or other content item to cause a text-to-speech system to perform audio playback of textual content displayed in one or more playback section(s) of the device's viewport (e.g., rather than requiring the user to perform additional tapping or gesturing to specifically select a certain portion of textual content).
-
公开(公告)号:US20200035218A1
公开(公告)日:2020-01-30
申请号:US16418492
申请日:2019-05-21
申请人: Google LLC
发明人: Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff
摘要: A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.
-
公开(公告)号:US20240304173A1
公开(公告)日:2024-09-12
申请号:US18665861
申请日:2024-05-16
申请人: Google LLC
发明人: Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff
摘要: A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.
-
公开(公告)号:US11887581B2
公开(公告)日:2024-01-30
申请号:US17052046
申请日:2019-11-14
IPC分类号: G06F3/04883 , G10L13/08 , G06F3/0485 , G10L13/02
CPC分类号: G10L13/08 , G06F3/0485 , G10L13/02
摘要: An audio playback system that provides intuitive audio playback of textual content responsive to user input actions, such as scrolling portions of textual content on a display. Playback of audio (e.g., text-to-speech audio) that includes textual content can begin based on a portion of textual content being positioned by a user input at a certain position on a device display. As one example, a user can simply scroll through a webpage or other content item to cause a text-to-speech system to perform audio playback of textual content displayed in one or more playback section(s) of the device's viewport (e.g., rather than requiring the user to perform additional tapping or gesturing to specifically select a certain portion of textual content).
-
公开(公告)号:US20230094174A1
公开(公告)日:2023-03-30
申请号:US17052046
申请日:2019-11-14
IPC分类号: G10L13/08 , G10L13/02 , G06F3/0485
摘要: An audio playback system that provides intuitive audio playback of textual content responsive to user input actions, such as scrolling portions of textual content on a display. Playback of audio (e.g., text-to-speech audio) that includes textual content can begin based on a portion of textual content being positioned by a user input at a certain position on a device display. As one example, a user can simply scroll through a webpage or other content item to cause a text-to-speech system to perform audio playback of textual content displayed in one or more playback section(s) of the device's viewport (e.g., rather than requiring the user to perform additional tapping or gesturing to specifically select a certain portion of textual content).
-
-
-
-
-
-
-