-
公开(公告)号:US20220245382A1
公开(公告)日:2022-08-04
申请号:US17165135
申请日:2021-02-02
Applicant: Google LLC
Inventor: Sneha Ashok , Huize Shi , Andreina Reyna
IPC: G06K9/00 , G06N20/00 , G06K9/46 , G10L13/033 , G06F40/242 , G06F3/01 , G06F3/16 , G10H1/00
Abstract: Methods, systems, devices, and tangible non-transitory computer readable media for generating assistive indications are provided. The disclosed technology can include accessing image data that includes at least one image. Character data can be generated based at least in part on the image data and one or more optical character recognition operations. Further, the character data can include one or more characters associated with the at least one image. One or more characters that are associated with one or more recognized words and the one or more characters that are associated with one or more unrecognized words can be determined based on the character data. One or more auditory indications including a synthetic voice reciting the one or more recognized words and the one or more unrecognized words can be generated. Furthermore, the synthetic voice can recite each of the one or more unrecognized words one character at a time.
-
公开(公告)号:US11769323B2
公开(公告)日:2023-09-26
申请号:US17165135
申请日:2021-02-02
Applicant: Google LLC
Inventor: Sneha Ashok , Huize Shi , Andreina Reyna
IPC: G06V30/148 , G06V20/20 , G06N20/00 , G06F40/242 , G06F3/01 , G06F3/16 , G10H1/00 , G10L13/033 , G06V10/40 , G06V30/414
CPC classification number: G06V20/20 , G06F3/016 , G06F3/165 , G06F40/242 , G06N20/00 , G06V10/40 , G06V30/153 , G06V30/414 , G10H1/0008 , G10L13/033
Abstract: Methods, systems, devices, and tangible non-transitory computer readable media for generating assistive indications are provided. The disclosed technology can include accessing image data that includes at least one image. Character data can be generated based at least in part on the image data and one or more optical character recognition operations. Further, the character data can include one or more characters associated with the at least one image. One or more characters that are associated with one or more recognized words and the one or more characters that are associated with one or more unrecognized words can be determined based on the character data. One or more auditory indications including a synthetic voice reciting the one or more recognized words and the one or more unrecognized words can be generated. Furthermore, the synthetic voice can recite each of the one or more unrecognized words one character at a time.
-