- 专利标题: Aligning input image data with model input data to generate image annotations
-
申请号: US17133493申请日: 2020-12-23
-
公开(公告)号: US11514648B2公开(公告)日: 2022-11-29
- 发明人: Harpreet Singh Sawhney , Ning Xu , Amol Ashok Ambardekar , Moses Obadeji Olafenwa
- 申请人: MICROSOFT TECHNOLOGY LICENSING, LLC
- 申请人地址: US WA Redmond
- 专利权人: MICROSOFT TECHNOLOGY LICENSING, LLC
- 当前专利权人: MICROSOFT TECHNOLOGY LICENSING, LLC
- 当前专利权人地址: US WA Redmond
- 代理机构: Newport IP, LLC
- 代理商 Jacob P. Rohwer
- 主分类号: G06T19/00
- IPC分类号: G06T19/00 ; G06T7/70 ; G06N3/04 ; G06N3/08 ; G06T17/00
摘要:
An image data annotation system automatically annotates a physical object within individual images frames of an image sequence with relevant object annotations based on a three-dimensional (3D) model of the physical object. Annotating the individual image frames with object annotations includes updating individual image frames within image input data to generate annotated image data that is suitable for reliably training a DNN object detection architecture. Exemplary object annotations that the image data annotation system can automatically apply to individual image frames include, inter alia, object pose, image pose, object masks, 3D bounding boxes composited over the physical object, 2D bounding boxes composited over the physical object, and/or depth map information. Annotating the individual image frames may be accomplished by aligning the 3D model of the physical object with a multi-view reconstruction of the physical object that is generated by inputting an image sequence into a Structure-from-Motion and/or Multi-view Stereo pipeline.
公开/授权文献
信息查询