-
公开(公告)号:US11562588B2
公开(公告)日:2023-01-24
申请号:US16830950
申请日:2020-03-26
IPC分类号: G06V30/412 , G06K9/62 , G06F3/04817 , G06V30/10
摘要: Interfaces and systems are provided for harvesting ground truth from forms to be used in training models based on key-value pairings in the forms and to later use the trained models to identify related key-value pairings in new forms. Initially, forms are identified and clustered to identify a subset of forms to label with the key-value pairings. Users provide input to identify keys to use in labeling and then select/highlight text from forms that are presented concurrently with the keys in order to associate the highlighted text with the key(s) as the corresponding key-value pairing(s). After labeling the forms with the key-value pairings, the key-value pairing data is used as ground truth for training a model to independently identify the key-value pairing(s) in new forms. Once trained, the model is used to identify the key-value pairing(s) in new forms.
-
公开(公告)号:US20180330272A1
公开(公告)日:2018-11-15
申请号:US15616655
申请日:2017-06-07
发明人: Yuxiao Hu , Lei Zhang , Christopher Buehler , Cha Zhang , Anna Roth , Cornelia Carapcea
IPC分类号: G06N99/00
CPC分类号: G06N99/005
摘要: A method includes obtaining a first classifier trained on a first dataset having a first dataset class, the first classifier having a plurality of first parameters, obtaining a second dataset having a second dataset class, loading the first parameters into a second classifier, merging a subset of the first dataset class and the second dataset class into a merged class, and training the second classifier using the merged class.
-
公开(公告)号:US09980040B2
公开(公告)日:2018-05-22
申请号:US15441793
申请日:2017-02-24
发明人: Oliver Arthur Whyte , Ross Cutler , Avronil Bhattacharjee , Adarsh Prakash Murthy Kowdle , Adam Kirk , Stanley T. Birchfield , Cha Zhang
CPC分类号: H04R1/406 , G01S3/80 , G06T7/75 , G06T2207/30196 , H04M3/567 , H04M3/568 , H04M2203/509 , H04M2242/30 , H04N5/23219 , H04N5/23296 , H04N7/142 , H04N7/147 , H04N7/15 , H04R3/005 , H04R29/005 , H04R2430/20
摘要: Various examples related to determining a location of an active participant are provided. In one example, image data of a room from an image capture device is received. First audio data from a first microphone array at the image capture device is received. Second audio data from a second microphone array spaced from the image capture device is received. Using a three dimensional model, a location of the second microphone array is determined. Using the first audio data, second audio data, location of the second microphone array, and an angular orientation of the second microphone array, an estimated location of the active participant is determined.
-
公开(公告)号:US12051256B2
公开(公告)日:2024-07-30
申请号:US17473623
申请日:2021-09-13
发明人: Yijuan Lu , Lynsey Liu , Andrei A. Gaivoronski , Yu Cheng , Dinei Afonso Ferreira Florencio , Cha Zhang , John Richard Corring
IPC分类号: G06K9/00 , G06F18/214 , G06F40/284 , G06N20/00 , G06V30/146 , G06V30/19 , G06V30/414 , G06V30/416
CPC分类号: G06V30/414 , G06F18/214 , G06F40/284 , G06N20/00 , G06V30/147 , G06V30/19067 , G06V30/416
摘要: The disclosure herein describes providing signature data of an input document. Text data of the input document is obtained (e.g., OCR data generated from image data) and a first set of signature fields are identified using signature key-value pairs of the text data. A first subset of signed signature fields and a first subset of unsigned signature fields are determined based on mapping to a set of predicted values. A second set of signature fields are determined using a region prediction model applied to image data of the input document. Region images associated with the first subset of unsigned signature fields and with second set of signature fields are obtained and a second set of signed signature fields and a second set of unsigned signature fields are determined using a signature recognition model. Signature output data is provided including signed signature fields and/or unsigned signature fields.
-
公开(公告)号:US20210133438A1
公开(公告)日:2021-05-06
申请号:US16830950
申请日:2020-03-26
摘要: Interfaces and systems are provided for harvesting ground truth from forms to be used in training models based on key-value pairings in the forms and to later use the trained models to identify related key-value pairings in new forms. Initially, forms are identified and clustered to identify a subset of forms to label with the key-value pairings. Users provide input to identify keys to use in labeling and then select/highlight text from forms that are presented concurrently with the keys in order to associate the highlighted text with the key(s) as the corresponding key-value pairing(s). After labeling the forms with the key-value pairings, the key-value pairing data is used as ground truth for training a model to independently identify the key-value pairing(s) in new forms. Once trained, the model is used to identify the key-value pairing(s) in new forms.
-
公开(公告)号:US20180096195A1
公开(公告)日:2018-04-05
申请号:US14952447
申请日:2015-11-25
CPC分类号: G06K9/00234 , G06K9/4642 , G06K9/6857 , G06T7/20
摘要: Examples are disclosed herein that relate to face detection. One example provides a computing device comprising a logic subsystem and a storage subsystem holding instructions executable by the logic subsystem to receive an image, apply a tile array to the image, the tile array comprising a plurality of tiles, and perform face detection on at least a subset of the tiles, where determining whether or not to perform face detection on a given tile is based on a likelihood that the tile includes at least a portion of a human face.
-
公开(公告)号:US09756284B2
公开(公告)日:2017-09-05
申请号:US14817021
申请日:2015-08-03
发明人: Cha Zhang , Timothy A. Large , Zhengyou Zhang , Ruigang Yang
摘要: The described implementations relate to enhancement images, such as in videoconferencing scenarios. One system includes a poriferous display screen having generally opposing front and back surfaces. This system also includes a camera positioned proximate to the back surface to capture an image through the poriferous display screen.
-
公开(公告)号:US09621795B1
公开(公告)日:2017-04-11
申请号:US14991847
申请日:2016-01-08
发明人: Oliver Arthur Whyte , Ross Cutler , Avronil Bhattacharjee , Adarsh Prakash Murthy Kowdle , Adam Kirk , Stanley T. Birchfield , Cha Zhang
CPC分类号: H04R1/406 , G01S3/80 , G06T7/75 , G06T2207/30196 , H04M3/567 , H04M3/568 , H04M2203/509 , H04M2242/30 , H04N5/23219 , H04N5/23296 , H04N7/142 , H04N7/147 , H04N7/15 , H04R3/005 , H04R29/005 , H04R2430/20
摘要: Various examples related to determining a location of an active speaker are provided. In one example, image data of a room from an image capture device is received and a three dimensional model is generated. First audio data from a first microphone array at the image capture device is received. Second audio data from a second microphone array laterally spaced from the image capture device is received. Using the three dimensional model, a location of the second microphone array with respect to the image capture device is determined. Using the audio data and the location and angular orientation of the second microphone array, an estimated location of the active speaker is determined. Using the estimated location, a setting for the image capture device is determined and outputted to highlight the active speaker.
-
公开(公告)号:US20160234221A1
公开(公告)日:2016-08-11
申请号:US14875298
申请日:2015-10-05
发明人: Sasa Junuzovic , Yinpeng Chen , Cha Zhang , Dinei Florencio , Zhengyou Zhang , Alastair Wolman
CPC分类号: H04L63/102 , H04L63/08 , H04L63/083 , H04L63/18 , H04M3/567 , H04W12/06 , H04W12/08 , H04W48/10
摘要: Techniques for automatically connecting to a service controller are described herein. In one example, a service controller device includes a processor and a computer-readable memory storage device storing executable instructions that cause the processor to broadcast at least one of an access credential, connection information or an access credential hash embedded in an audio signal. The processor can also authenticate a client device based on a transmission of at least one of the connection information, the access credential, or the access credential hash from the client device to the client connector and transmit data to the client device in response to authenticating the client device.
摘要翻译: 本文描述了用于自动连接到服务控制器的技术。 在一个示例中,服务控制器设备包括处理器和存储可执行指令的计算机可读存储器设备,该可执行指令使处理器广播嵌入在音频信号中的访问凭证,连接信息或访问凭证散列中的至少一个。 处理器还可以基于从客户端设备到客户机连接器的连接信息,访问凭证或访问凭证散列中的至少一个的传输来认证客户端设备,并且响应于认证该客户端设备向客户端设备发送数据 客户端设备。
-
公开(公告)号:US11093740B2
公开(公告)日:2021-08-17
申请号:US16185737
申请日:2018-11-09
摘要: The disclosed technology is generally directed to optical character recognition for forms. In one example of the technology, optical character recognition is performed on a plurality of forms. The forms of the plurality of forms include at least one type of form. Anchors are determined for the forms, including corresponding anchors for each type of form of the plurality of forms. Feature rules are determined, including corresponding feature rules for each type of form of the plurality of forms. Features and labels are determined for each form of the plurality of forms. A training model is generated based on a ground truth that includes a plurality of key-value pairs corresponding to the plurality of forms, and further based on the determined features and labels for the plurality of forms.
-
-
-
-
-
-
-
-
-