MULTIMODALITY-BASED IMAGE TAGGING APPARATUS AND METHOD
    1.
    发明申请
    MULTIMODALITY-BASED IMAGE TAGGING APPARATUS AND METHOD 有权
    基于多尺度的图像标记设备和方法

    公开(公告)号:US20140379730A1

    公开(公告)日:2014-12-25

    申请号:US14307687

    申请日:2014-06-18

    Inventor: Xi LIU Rujie LIU

    Abstract: Embodiments provide a multimodality-based image tagging apparatus and a method for the same. The image tagging apparatus includes: a score generating unit configured to generate, for an inquiry image, multiple groups of first scores about all tags in an tagging dictionary by using a training image and multiple modalities of an image; a late-fusion unit configured to fuse the obtained multiple groups of scores to obtain final scores about all the tags; and a tag selecting unit configured to select one or more tag(s) with relatively large tag scores as tag(s) of the inquiry image according to the final scores about all the tags. With the embodiments, multiple modalities may be effectively fused, and a more robust and accurate image tagging result may be obtained.

    Abstract translation: 实施例提供了一种基于多模态的图像标记装置及其方法。 图像标记装置包括:分数生成单元,被配置为通过使用训练图像和图像的多个模式,为查询图像生成关于标签词典中的所有标签的多组第一分数; 后融合单元,被配置为熔合所获得的多组分数以获得关于所有标签的最终分数; 以及标签选择单元,被配置为根据关于所有标签的最终分数,选择具有较大标签分数的一个或多个标签作为查询图像的标签。 利用实施例,可以有效地融合多种模态,并且可以获得更鲁棒和准确的图像标记结果。

Patent Agency Ranking