Image classification pipeline
    13.
    发明授权

    公开(公告)号:US10891514B2

    公开(公告)日:2021-01-12

    申请号:US16222905

    申请日:2018-12-17

    Abstract: The present disclosure relates to processing operations configured for an image recognition pipeline that is used to tailor real-time management of image recognition processing for technical scenarios across a plurality of different applications/services. Image recognition processing is optimized at run-time to ensure that latency requirements are met so that image recognition processing results are returned in a timely manner that aids task execution in an application-specific instances. An image recognition pipeline may manage a plurality of image recognition models that comprise a combination of image analysis service (IAS) models and deep learning models. A scheduler of the image recognition pipeline optimizes image recognition processing by selecting at least: a subset of the image recognition models for image recognition processing and a device configuration for execution of the subset of image recognition models, in order to return image recognition results within a threshold time period that satisfies application-specific execution.

    Method and system of retrieving assets from personalized asset libraries

    公开(公告)号:US12242491B2

    公开(公告)日:2025-03-04

    申请号:US17716653

    申请日:2022-04-08

    Abstract: A system and method and for retrieving assets from a personalized asset library includes receiving a search query for searching for assets in one or more asset libraries, the one or more asset libraries including a personalized asset library; encoding the search query into embedding representations via a trained query representation machine-learning (ML) model; comparing, via a matching unit, the query embedding representations to a plurality of asset representations, each of the plurality of asset representations being a representation of one of the plurality of candidate assets; identifying, based on the comparison, at least one of the plurality of the candidate assets as a search result for the search query; and providing the identified plurality of candidate assets for display as the search result. The plurality of asset representations for the one or more assets in the personalized content library are generated automatically without human labeling.

    MULTILINGUAL SUPPORT FOR NATURAL LANGUAGE PROCESSING APPLICATIONS

    公开(公告)号:US20230274096A1

    公开(公告)日:2023-08-31

    申请号:US17681250

    申请日:2022-02-25

    CPC classification number: G06F40/49 G06F40/284 G06F40/242 G06F40/253 G06N20/00

    Abstract: A data processing system implements obtaining textual content in a first language from a first client device and segmenting the textual content into a plurality of first tokens. The system also implements translating the first tokens from the first language to a second language using a bilingual dictionary, extracting features information from the second tokens to create a features vector, providing the feature vector to a first natural language processing model trained to analyze textual input in the second language and to output contextual information indicating one or more topics or subject matter of the first textual content, and providing the contextual information to a first machine learning model configured to analyze the contextual information and to identify one or more content items predicted to be relevant to the contextual information. The system further implements providing the information identifying the one or more content items to the first client device.

    Context based visual enhancement suggestion

    公开(公告)号:US11657209B2

    公开(公告)日:2023-05-23

    申请号:US17232503

    申请日:2021-04-16

    Inventor: Ji Li

    Abstract: For generating visual enhancement suggestions for source content, a system performs storing, in a data storage, a plurality of context data sets, each context data set including a set of visual enhancements and a context for selecting the set of visual enhancements; receiving the source content including source content data and source attribute data; providing, to an artificial intelligence (AI) engine, the received source content, the AI engine configured to select, based on the source content and the context data sets, a first set of visual enhancements and apply the selected first set of visual enhancements to the source content to generate a first visual enhancement suggestion for the source content; extracting, from the AI engine, the first visual enhancement suggestion; and causing the first visual enhancement suggestion to be displayed via a display of a user device.

    METHOD AND SYSTEM OF USING DOMAIN SPECIFIC KNOWLEDGE IN RETRIEVING MULTIMODAL ASSETS

    公开(公告)号:US20240248901A1

    公开(公告)日:2024-07-25

    申请号:US18158121

    申请日:2023-01-23

    CPC classification number: G06F16/24578 G06F16/24556 G06F16/248

    Abstract: A system for retrieving multimodal assets using domain-specific knowledge includes receiving a search query for searching for multimodal assets; encoding the search query into a first query representation via a first trained query representation machine-learning (ML) model and a second query representation via a second trained query representation ML model; comparing the first query representation to a plurality of multimodal representations to calculate a first similarity score, each of the plurality of multimodal representations being a representation of one of the plurality of candidate multimodal assets; comparing the second query representation to a plurality of domain-specific representations to calculate a second similarity score, the domain-specific representations being representations of domain-specific data associated with one or more of the plurality of the multimodal representations; calculating a third similarity score based on keyword matching between the domain-specific data and the one or more search terms in the search query; aggregating the first, second and third similarity scores to calculate a total similarity score for each of the plurality of candidate multimodal assets; ranking the plurality of candidate multimodal assets based on the total similarity scores to identify search results for the search query; and providing the identified candidate multimodal assets for display as the search results.

Patent Agency Ranking