专利检索 ap:("Microsoft Technology Licensing, LLC.") AND inv:"Cha Zhang" 第 1 页

1.

发明授权
Enhanced supervised form understanding 有权

公开(公告)号：US11562588B2

公开(公告)日：2023-01-24

申请号：US16830950

申请日：2020-03-26

申请人： MICROSOFT TECHNOLOGY LICENSING, LLC

发明人： Dinei Afonso Ferreira Florencio , Yu-Yun Dai , Cha Zhang , Shih Chia Wang

IPC分类号： G06V30/412 , G06K9/62 , G06F3/04817 , G06V30/10

摘要： Interfaces and systems are provided for harvesting ground truth from forms to be used in training models based on key-value pairings in the forms and to later use the trained models to identify related key-value pairings in new forms. Initially, forms are identified and clustered to identify a subset of forms to label with the key-value pairings. Users provide input to identify keys to use in labeling and then select/highlight text from forms that are presented concurrently with the keys in order to associate the highlighted text with the key(s) as the corresponding key-value pairing(s). After labeling the forms with the key-value pairings, the key-value pairing data is used as ground truth for training a model to independently identify the key-value pairing(s) in new forms. Once trained, the model is used to identify the key-value pairing(s) in new forms.

2.

发明申请
Method of Adding Classes to Classifier 审中-公开

公开(公告)号：US20180330272A1

公开(公告)日：2018-11-15

申请号：US15616655

申请日：2017-06-07

申请人： Microsoft Technology Licensing, LLC

发明人： Yuxiao Hu , Lei Zhang , Christopher Buehler , Cha Zhang , Anna Roth , Cornelia Carapcea

IPC分类号： G06N99/00

CPC分类号： G06N99/005

摘要： A method includes obtaining a first classifier trained on a first dataset having a first dataset class, the first classifier having a plurality of first parameters, obtaining a second dataset having a second dataset class, loading the first parameters into a second classifier, merging a subset of the first dataset class and the second dataset class into a merged class, and training the second classifier using the merged class.

3.

发明授权
Active speaker location detection 有权

公开(公告)号：US09980040B2

公开(公告)日：2018-05-22

申请号：US15441793

申请日：2017-02-24

申请人： Microsoft Technology Licensing, LLC

发明人： Oliver Arthur Whyte , Ross Cutler , Avronil Bhattacharjee , Adarsh Prakash Murthy Kowdle , Adam Kirk , Stanley T. Birchfield , Cha Zhang

IPC分类号： H04N7/15 , H04R1/40 , H04N7/14

CPC分类号： H04R1/406 , G01S3/80 , G06T7/75 , G06T2207/30196 , H04M3/567 , H04M3/568 , H04M2203/509 , H04M2242/30 , H04N5/23219 , H04N5/23296 , H04N7/142 , H04N7/147 , H04N7/15 , H04R3/005 , H04R29/005 , H04R2430/20

摘要： Various examples related to determining a location of an active participant are provided. In one example, image data of a room from an image capture device is received. First audio data from a first microphone array at the image capture device is received. Second audio data from a second microphone array spaced from the image capture device is received. Using a three dimensional model, a location of the second microphone array is determined. Using the first audio data, second audio data, location of the second microphone array, and an angular orientation of the second microphone array, an estimated location of the active participant is determined.

4.

发明授权
Entry detection and recognition for custom forms 有权

公开(公告)号：US12051256B2

公开(公告)日：2024-07-30

申请号：US17473623

申请日：2021-09-13

申请人： Microsoft Technology Licensing, LLC

发明人： Yijuan Lu , Lynsey Liu , Andrei A. Gaivoronski , Yu Cheng , Dinei Afonso Ferreira Florencio , Cha Zhang , John Richard Corring

IPC分类号： G06K9/00 , G06F18/214 , G06F40/284 , G06N20/00 , G06V30/146 , G06V30/19 , G06V30/414 , G06V30/416

CPC分类号： G06V30/414 , G06F18/214 , G06F40/284 , G06N20/00 , G06V30/147 , G06V30/19067 , G06V30/416

摘要： The disclosure herein describes providing signature data of an input document. Text data of the input document is obtained (e.g., OCR data generated from image data) and a first set of signature fields are identified using signature key-value pairs of the text data. A first subset of signed signature fields and a first subset of unsigned signature fields are determined based on mapping to a set of predicted values. A second set of signature fields are determined using a region prediction model applied to image data of the input document. Region images associated with the first subset of unsigned signature fields and with second set of signature fields are obtained and a second set of signed signature fields and a second set of unsigned signature fields are determined using a signature recognition model. Signature output data is provided including signed signature fields and/or unsigned signature fields.

5.

发明申请
ENHANCED SUPERVISED FORM UNDERSTANDING 有权

公开(公告)号：US20210133438A1

公开(公告)日：2021-05-06

申请号：US16830950

申请日：2020-03-26

申请人： MICROSOFT TECHNOLOGY LICENSING, LLC

发明人： Dinei Afonso Ferreira Florencio , Yu-Yun Dai , Cha Zhang , Shih Chia Wang

IPC分类号： G06K9/00 , G06K9/62

摘要： Interfaces and systems are provided for harvesting ground truth from forms to be used in training models based on key-value pairings in the forms and to later use the trained models to identify related key-value pairings in new forms. Initially, forms are identified and clustered to identify a subset of forms to label with the key-value pairings. Users provide input to identify keys to use in labeling and then select/highlight text from forms that are presented concurrently with the keys in order to associate the highlighted text with the key(s) as the corresponding key-value pairing(s). After labeling the forms with the key-value pairings, the key-value pairing data is used as ground truth for training a model to independently identify the key-value pairing(s) in new forms. Once trained, the model is used to identify the key-value pairing(s) in new forms.

6.

发明申请
PROBABILISTIC FACE DETECTION 审中-公开

公开(公告)号：US20180096195A1

公开(公告)日：2018-04-05

申请号：US14952447

申请日：2015-11-25

申请人： Microsoft Technology Licensing, LLC

发明人： Cristian Canton Ferrer , Stanley T. Birchfield , Adam Kirk , Cha Zhang

IPC分类号： G06K9/00 , G06K9/46 , G06K9/68 , G06T7/20

CPC分类号： G06K9/00234 , G06K9/4642 , G06K9/6857 , G06T7/20

摘要： Examples are disclosed herein that relate to face detection. One example provides a computing device comprising a logic subsystem and a storage subsystem holding instructions executable by the logic subsystem to receive an image, apply a tile array to the image, the tile array comprising a plurality of tiles, and perform face detection on at least a subset of the tiles, where determining whether or not to perform face detection on a given tile is based on a likelihood that the tile includes at least a portion of a human face.

7.

发明授权
Imaging through a display screen 有权

公开(公告)号：US09756284B2

公开(公告)日：2017-09-05

申请号：US14817021

申请日：2015-08-03

申请人： Microsoft Technology Licensing, LLC

发明人： Cha Zhang , Timothy A. Large , Zhengyou Zhang , Ruigang Yang

IPC分类号： H04N7/14 , H04N7/15

CPC分类号： H04N7/144 , H04N7/155

摘要： The described implementations relate to enhancement images, such as in videoconferencing scenarios. One system includes a poriferous display screen having generally opposing front and back surfaces. This system also includes a camera positioned proximate to the back surface to capture an image through the poriferous display screen.

8.

发明授权
Active speaker location detection 有权

公开(公告)号：US09621795B1

公开(公告)日：2017-04-11

申请号：US14991847

申请日：2016-01-08

申请人： Microsoft Technology Licensing, LLC

发明人： Oliver Arthur Whyte , Ross Cutler , Avronil Bhattacharjee , Adarsh Prakash Murthy Kowdle , Adam Kirk , Stanley T. Birchfield , Cha Zhang

IPC分类号： H04N7/15 , H04N5/232 , H04R3/00 , H04R29/00 , G06T7/00 , H04N7/14

CPC分类号： H04R1/406 , G01S3/80 , G06T7/75 , G06T2207/30196 , H04M3/567 , H04M3/568 , H04M2203/509 , H04M2242/30 , H04N5/23219 , H04N5/23296 , H04N7/142 , H04N7/147 , H04N7/15 , H04R3/005 , H04R29/005 , H04R2430/20

摘要： Various examples related to determining a location of an active speaker are provided. In one example, image data of a room from an image capture device is received and a three dimensional model is generated. First audio data from a first microphone array at the image capture device is received. Second audio data from a second microphone array laterally spaced from the image capture device is received. Using the three dimensional model, a location of the second microphone array with respect to the image capture device is determined. Using the audio data and the location and angular orientation of the second microphone array, an estimated location of the active speaker is determined. Using the estimated location, a setting for the image capture device is determined and outputted to highlight the active speaker.

9.

发明申请
AUDIO BASED DISCOVERY AND CONNECTION TO A SERVICE CONTROLLER 有权
标题翻译：基于音频的发现和连接到服务控制器

公开(公告)号：US20160234221A1

公开(公告)日：2016-08-11

申请号：US14875298

申请日：2015-10-05

申请人： Microsoft Technology Licensing, LLC.

发明人： Sasa Junuzovic , Yinpeng Chen , Cha Zhang , Dinei Florencio , Zhengyou Zhang , Alastair Wolman

IPC分类号： H04L29/06 , H04W12/08 , H04W12/06

CPC分类号： H04L63/102 , H04L63/08 , H04L63/083 , H04L63/18 , H04M3/567 , H04W12/06 , H04W12/08 , H04W48/10

摘要： Techniques for automatically connecting to a service controller are described herein. In one example, a service controller device includes a processor and a computer-readable memory storage device storing executable instructions that cause the processor to broadcast at least one of an access credential, connection information or an access credential hash embedded in an audio signal. The processor can also authenticate a client device based on a transmission of at least one of the connection information, the access credential, or the access credential hash from the client device to the client connector and transmit data to the client device in response to authenticating the client device.

摘要翻译： 本文描述了用于自动连接到服务控制器的技术。在一个示例中，服务控制器设备包括处理器和存储可执行指令的计算机可读存储器设备，该可执行指令使处理器广播嵌入在音频信号中的访问凭证，连接信息或访问凭证散列中的至少一个。处理器还可以基于从客户端设备到客户机连接器的连接信息，访问凭证或访问凭证散列中的至少一个的传输来认证客户端设备，并且响应于认证该客户端设备向客户端设备发送数据客户端设备。

10.

发明授权
Supervised OCR training for custom forms 有权

公开(公告)号：US11093740B2

公开(公告)日：2021-08-17

申请号：US16185737

申请日：2018-11-09

申请人： Microsoft Technology Licensing, LLC

发明人： Dinei Afonso Ferreira Florencio , Cha Zhang , Gil Moshe Nahmias , Yu-Yun Dai

IPC分类号： G06K9/00 , G06N20/00 , G06K9/62

摘要： The disclosed technology is generally directed to optical character recognition for forms. In one example of the technology, optical character recognition is performed on a plurality of forms. The forms of the plurality of forms include at least one type of form. Anchors are determined for the forms, including corresponding anchors for each type of form of the plurality of forms. Feature rules are determined, including corresponding feature rules for each type of form of the plurality of forms. Features and labels are determined for each form of the plurality of forms. A training model is generated based on a ground truth that includes a plurality of key-value pairs corresponding to the plurality of forms, and further based on the determined features and labels for the plurality of forms.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类