专利检索 ap:("Google LLC") AND inv:"Jack Whyte" 第 1 页

1.

发明公开
Systems and Methods for a Text-To-Speech Interface 审中-公开

公开(公告)号：US20240304173A1

公开(公告)日：2024-09-12

申请号：US18665861

申请日：2024-05-16

申请人： Google LLC

发明人： Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff

IPC分类号： G10L13/00 , G06F3/16 , G06F40/30

CPC分类号： G10L13/00 , G06F3/167 , G06F40/30

摘要： A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.

2.

发明授权
Systems and methods for a text-to-speech interface 有权

公开(公告)号：US11145288B2

公开(公告)日：2021-10-12

申请号：US16418492

申请日：2019-05-21

申请人： Google LLC

发明人： Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff

IPC分类号： G10L13/00 , G10L13/08 , G06F3/16 , G06F40/30

摘要： A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.

3.

发明授权
Providing a personal assistant module with a selectively-traversable state machine 有权

公开(公告)号：US10694344B2

公开(公告)日：2020-06-23

申请号：US16585054

申请日：2019-09-27

申请人： Google LLC

发明人： Adrian Diaconu , Jing Qian , Jack Whyte , Ning Ma , Shir Yehoshua

IPC分类号： G06F9/54 , G10L13/00 , G10L15/26 , G10L15/22 , H04W88/02 , H04W4/16 , H04W76/10

摘要： Techniques are described herein for leveraging an online semantic processor to generate a finite state machine to be delivered to and implemented on a resource-constrained device. In various implementations, a user request may be received by a personal assistant module. Data indicative of the user request may be uploaded, e.g., by the personal assistant module, to an online semantic processor. Data may be received, e.g., by the personal assistant module, from the online semantic processor. The received data may represent a state machine having a plurality of candidate states of the personal assistant module. Each candidate state may be potentially responsive to the request. Resources local to the resource-constrained device may be analyzed to ascertain signal(s) unavailable to the online semantic processor. The personal assistant module may then transition to a given state of the plurality of candidate states. The given state may be selected based on the signal(s).

4.

发明申请
Systems and Methods for a Text-To-Speech Interface 审中-公开

公开(公告)号：US20200035218A1

公开(公告)日：2020-01-30

申请号：US16418492

申请日：2019-05-21

申请人： Google LLC

发明人： Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff

IPC分类号： G10L13/04 , G06F17/27 , G06F3/16

摘要： A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.

5.

发明申请
PROVIDING A PERSONAL ASSISTANT MODULE WITH A SELECTIVELY-TRAVERSABLE STATE MACHINE 审中-公开

公开(公告)号：US20200029182A1

公开(公告)日：2020-01-23

申请号：US16585054

申请日：2019-09-27

申请人： Google LLC

发明人： Adrian Diaconu , Jing Qian , Jack Whyte , Ning Ma , Shir Yehoshua

IPC分类号： H04W4/16 , G06F9/54 , H04W76/10 , G10L13/00 , G10L15/26

摘要： Techniques are described herein for leveraging an online semantic processor to generate a finite state machine to be delivered to and implemented on a resource-constrained device. In various implementations, a user request may be received by a personal assistant module. Data indicative of the user request may be uploaded, e.g., by the personal assistant module, to an online semantic processor. Data may be received, e.g., by the personal assistant module, from the online semantic processor. The received data may represent a state machine having a plurality of candidate states of the personal assistant module. Each candidate state may be potentially responsive to the request. Resources local to the resource-constrained device may be analyzed to ascertain signal(s) unavailable to the online semantic processor. The personal assistant module may then transition to a given state of the plurality of candidate states. The given state may be selected based on the signal(s).

6.

发明授权
Systems and methods for a text-to-speech interface 有权

公开(公告)号：US12020681B2

公开(公告)日：2024-06-25

申请号：US17487114

申请日：2021-09-28

申请人： Google LLC

发明人： Benedict Davies , Guillaume Boniface , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff

IPC分类号： G10L13/00 , G06F3/16 , G06F40/30

CPC分类号： G10L13/00 , G06F3/167 , G06F40/30

摘要： A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.

7.

发明申请
Systems and Methods for a Text-To-Speech Interface 有权

公开(公告)号：US20220013104A1

公开(公告)日：2022-01-13

申请号：US17487114

申请日：2021-09-28

申请人： Google LLC

发明人： Benedict Davies , Guillaume Boniface-Chang , Jack Whyte , Jakub Adamek , Simon Tokumine , Alessio Macri , Matthias Quasthoff

IPC分类号： G10L13/00 , G06F3/16 , G06F40/30

摘要： A computing system and related techniques for selecting content to be automatically converted to speech and provided as an audio signal are provided. A text-to-speech request associated with a first document can be received that includes data associated with a playback position of a selector associated with a text-to-speech interface overlaid on the first document. First content associated with the first document can be determined based at least in part on the playback position, the first content including content that is displayed in the user interface at the playback position. The first document can be analyzed to identify one or more structural features associated with the first content. Speech data can be generated based on the first content and the one or more structural features.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类