Cross-media knowledge semantic representation method and apparatus

    公开(公告)号:US12106589B2

    公开(公告)日:2024-10-01

    申请号:US18491818

    申请日:2023-10-23

    Applicant: ZHEJIANG LAB

    Inventor: Feng Lin Yunhe Pan

    CPC classification number: G06V20/70 G06F40/30 G16H30/40 G06V2201/03

    Abstract: A cross-media knowledge semantic representation method and apparatus. The method comprises: performing data acquisition according to a preset semantic description; inputting data information of a topological structure acquired by the data acquisition into a preset stack of an automat corresponding to the semantic description, the finite state set is used for indicating states included in the automat, and the input vocabulary list is used for indicating vocabularies included in the automat; mapping the data information by the automat to obtain key frames corresponding respectively to substructures and/or branches of a target object acquired by the data acquisition; and generating a visual semantic representation of the topological structure according to the key frames corresponding respectively to the substructures and/or branches of the target object acquired by the data acquisition, such that cross-media knowledge alignment is realized.

    Apparatus and method for generating training data

    公开(公告)号:US12105771B2

    公开(公告)日:2024-10-01

    申请号:US17455513

    申请日:2021-11-18

    Abstract: Disclosed are an apparatus and method for generating training data. An apparatus for generating training data according to an embodiment includes a reading result acquirer that acquires one or more oral endoscopic images and clinical information related to each of the one or more oral endoscopic images, provides the one or more oral endoscopic images and the clinical information to a user terminal of each of a plurality of preset medical staffs, and receives a reading result for each of the one or more oral endoscopic images from the user terminal of each of the plurality of medical staffs, and a labeling unit that selects one or more labeling target images from among the one or more oral endoscopic images based on the reading result received from the user terminal of each of the plurality of medical staffs and determines one or more labels for each of the one or more labeling target images based on the reading result.

Patent Agency Ranking