Identifying entities in semi-structured content

    公开(公告)号:US10353905B2

    公开(公告)日:2019-07-16

    申请号:US14695996

    申请日:2015-04-24

    Abstract: Identifying entities in semi-structured content is described. A system assigns a corresponding entity type based on a corresponding entity type score for each token in a sequence of tokens in semi-structured content, based on multiple entity types, wherein each token is a corresponding character set. The system assigns a corresponding boundary type based on a corresponding boundary type score for each token in the sequence of tokens, based on a begin boundary type or a continue boundary type. The system identifies an entity based on a corresponding entity type score and a corresponding boundary type for each token in the sequence of tokens. The system outputs the sequence of tokens as an identified set of entities based on the identified entity.

    IDENTIFYING ENTITIES IN SEMI-STRUCTURED CONTENT
    2.
    发明申请
    IDENTIFYING ENTITIES IN SEMI-STRUCTURED CONTENT 审中-公开
    识别半结构化内容中的实体

    公开(公告)号:US20160314123A1

    公开(公告)日:2016-10-27

    申请号:US14695996

    申请日:2015-04-24

    Abstract: Identifying entities in semi-structured content is described. A system assigns a corresponding entity type based on a corresponding entity type score for each token in a sequence of tokens in semi-structured content, based on multiple entity types, wherein each token is a corresponding character set. The system assigns a corresponding boundary type based on a corresponding boundary type score for each token in the sequence of tokens, based on a begin boundary type or a continue boundary type. The system identifies an entity based on a corresponding entity type score and a corresponding boundary type for each token in the sequence of tokens. The system outputs the sequence of tokens as an identified set of entities based on the identified entity.

    Abstract translation: 描述半结构化内容中识别实体。 系统基于多个实体类型,在半结构化内容中的令牌序列中基于每个令牌的相应实体类型分数分配对应的实体类型,其中每个令牌是对应的字符集。 基于开始边界类型或继续边界类型,系统基于令牌序列中的每个令牌的相应边界类型分数来分配相应的边界类型。 该系统基于相应的实体类型分数和令牌序列中的每个令牌的对应边界类型来识别实体。 该系统基于所识别的实体将令牌序列作为确定的一组实体输出。

Patent Agency Ranking