-
公开(公告)号:US12045279B2
公开(公告)日:2024-07-23
申请号:US17538880
申请日:2021-11-30
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ji Li , Adit Krishnan , Amit Srivastava , Han Hu , Qi Dai , Yixuan Wei , Yue Cao
CPC classification number: G06F16/5866 , G06F16/51 , G06F16/56 , G06N20/00
Abstract: A system and method and for retrieving one or more visual assets includes receiving a search query for the one or more visual assets, the search query including textual data, encoding the textual data into one or more text embedding representations via a trained text representation machine-learning (ML) model, transmitting the one or more text embedding representations to a matching and selection unit, providing visual embedding representations of one or more visual assets to the matching and selection unit, comparing, by the matching and selection unit, the one or more text embedding representations to the visual embedding representations to identify one or more visual asset search results, and providing the one or more visual asset search results for display.
-
公开(公告)号:US20240135576A1
公开(公告)日:2024-04-25
申请号:US18274679
申请日:2022-02-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Zheng Zhang , Han Hu , Yue Cao , Xin TONG , Ze Liu
CPC classification number: G06T7/73 , G06T7/62 , G06T7/90 , G06T17/00 , G06V10/44 , G06V20/64 , G06V2201/07
Abstract: According to implementations of the subject matter described herein, a solution is proposed for three-dimensional (3D) object detection. In this solution, feature representations of a plurality of points are extracted from point cloud data related to a 3D object. Initial feature representations of a set of candidate 3D objects are determined based on the feature representations of the plurality of points. Based on the feature representations of the plurality of points and the initial feature representations of the set of candidate 3D objects, a detection result for the 3D object is generated by determining self-correlations between the set of candidate 3D objects and cross-correlations between the plurality of points and the set of candidate 3D objects. In this way, without grouping points into candidate 3D objects, the 3D object in a 3D scene can be localized and recognized based on the self-correlations and cross-correlations.
-
公开(公告)号:US11961261B2
公开(公告)日:2024-04-16
申请号:US17514836
申请日:2021-10-29
Applicant: Microsoft Technology Licensing, LLC
CPC classification number: G06T7/97 , G06N20/20 , G06T3/40 , G06T7/11 , G06T2207/20081 , G06T2207/20132
Abstract: A scheme for modifying an image is disclosed, which includes receiving a source image having a first image configuration; determining a second image configuration for a target image; providing the received source image to an AI engine trained to identify, based on a set of rules related to visual features, candidate regions from the source image; generating proposal images based on the candidate regions, respectively; determining, based on prior aesthetical evaluation data, an aesthetical value of each regional proposal image; and selecting, based on the determined aesthetical value of each regional proposal image, one of the regional proposal images as the target image; extracting, from the AI engine, the target image; and causing the target image to be displayed via a display of a user device.
-
-