COMPARING DOCUMENT CONTENTS USING A CONSTRUCTED TOPIC MODEL
    1.
    发明申请
    COMPARING DOCUMENT CONTENTS USING A CONSTRUCTED TOPIC MODEL 审中-公开
    使用构建的主题模型比较文档内容

    公开(公告)号:US20150310096A1

    公开(公告)日:2015-10-29

    申请号:US14695688

    申请日:2015-04-24

    IPC分类号: G06F17/30

    CPC分类号: G06F16/367

    摘要: Comparing document contents is provided. An ontological concept is extracted from a text snippet of a corpus document. One or more feature vectors are constructed that include associative information that describes an ontology that includes the focused concept. A topic model is trained using the one or more feature vectors. First and second topic sets are respectively extracted from first and second documents using the topic model. One or more topics from the first topic set are matched, using the topic model, with one or more topics from the second topic set to construct a matched topic set. Semantic analyses are respectively performed on first and second text snippet sets, wherein the first and second text snippet sets are chosen based, at least in part, on the matched topic set. Text snippets are matched based, at least in part, on the first and second semantic analyses.

    摘要翻译: 提供比较文件内容。 从语料库文档的文本片段中提取本体论概念。 构造一个或多个特征向量,其包括描述包括聚焦概念的本体的关联信息。 使用一个或多个特征向量训练主题模型。 使用主题模型分别从第一和第二文档提取第一和第二主题集。 使用主题模型与来自第二主题集的一个或多个主题来匹配来自第一主题集的一个或多个主题以构建匹配的主题集。 分别对第一和第二文本片段集执行语义分析,其中,至少部分地基于匹配的主题集来选择第一和第二文本片段集合。 文本片段至少部分地基于第一和第二语义分析进行匹配。

    ENRICHING WEBSITE CONTENT
    2.
    发明申请
    ENRICHING WEBSITE CONTENT 有权
    丰富的网站内容

    公开(公告)号:US20140040723A1

    公开(公告)日:2014-02-06

    申请号:US13947197

    申请日:2013-07-22

    IPC分类号: G06F17/22

    摘要: A method for enriching contents of a website includes obtaining a corpus from the current website and other websites, and extracting object features from the corpus, wherein the corpus comprises specifications of the object and user reviews about the object; according to the corpus, constructing multi-dimensional vectors for the extracted features; for a specified feature, making similarity comparison of its multi-dimensional vector and multi-dimensional vectors of other extracted features; determining features with similarities higher than a predetermined threshold as the same features, and reinforcing the current website with features different from that of the object on the current website and their corresponding attributes.

    摘要翻译: 一种丰富网站内容的方法,包括从当前网站和其他网站获取语料库,并从语料库中提取对象特征,其中语料库包括对象的规范和关于对象的用户评论; 根据语料库,为提取的特征构建多维向量; 对于特定的特征,对其多维向量和其他提取特征的多维向量进行相似性比较; 确定具有高于预定阈值的相似性的特征作为相同特征,并且利用与当前网站上的对象的特征不同的特征及其相应属性来加强当前网站。

    Pushing specific content to a predetermined webpage
    4.
    发明授权
    Pushing specific content to a predetermined webpage 有权
    将特定内容推送到预定网页

    公开(公告)号:US09230035B2

    公开(公告)日:2016-01-05

    申请号:US14012085

    申请日:2013-08-28

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method and an apparatus for pushing specific content for a predetermined webpage, and a website server. The method for pushing specific content for text content on a predetermined webpage comprises: subjecting text content on a predetermined webpage to emotional analysis; determining a matching degree between a result of the emotional analysis and an emotion expressed by specific content to be pushed; and responding to that the matching degree determined above satisfies a predetermined condition, combining a part of the text content with the specific content to be pushed, thereby forming content to be pushed specific for users. By using the technology of the present invention, user can be avoided from feeling disgust for content to be pushed and accuracy of push can be enhanced.

    摘要翻译: 一种用于推送预定网页的特定内容的方法和装置,以及网站服务器。 在预定网页上推送文本内容的特定内容的方法包括:对预定网页上的文本内容进行情感分析; 确定情感分析的结果与被推送的特定内容所表达的情感之间的匹配程度; 并且响应于以上确定的匹配度满足预定条件,将一部分文本内容与要推送的特定内容相结合,从而形成要针对用户推送的内容。 通过使用本发明的技术,可以避免用户对被推送的内容感到厌恶,并且能够提高推送精度。

    METHOD AND APPARATUS FOR PERFORMING EXTENDED SEARCH
    5.
    发明申请
    METHOD AND APPARATUS FOR PERFORMING EXTENDED SEARCH 审中-公开
    用于执行扩展搜索的方法和装置

    公开(公告)号:US20150269174A1

    公开(公告)日:2015-09-24

    申请号:US14730905

    申请日:2015-06-04

    IPC分类号: G06F17/30

    摘要: A method and apparatus for performing extended search are provided. The method includes receiving user-inputted keywords; extending the user-inputted keywords according to geographical information to acquire extended keywords; performing a search by using the extended keywords; and returning search results to the user. With the present technical solutions, privilege control can be effectively performed in a cloud storage system. With the present embodiments, more information may be provided to a user for reference.

    摘要翻译: 提供了一种用于执行扩展搜索的方法和装置。 该方法包括接收用户输入的关键字; 根据地理信息扩展用户输入的关键字,获取扩展关键词; 使用扩展关键字执行搜索; 并将搜索结果返回给用户。 利用现有的技术方案,可以在云存储系统中有效地执行特权控制。 利用本实施例,可以向用户提供更多的信息以供参考。

    Analyzing messages in social networks

    公开(公告)号:US10389677B2

    公开(公告)日:2019-08-20

    申请号:US15389536

    申请日:2016-12-23

    IPC分类号: H04L12/58 G06F17/27 G06Q50/00

    摘要: Embodiments of the invention provide a computer-implemented method, computing system and computer program product for analyzing a message in a social network. The method comprises identifying an entity from the message; detecting historical popularity of the entity in a social network; identifying a topic from the message; detecting historical popularity of the topic in the social network; and generating an entity-topic correlation factor for the entity and the topic based on the historical popularity of the entity and the historical popularity of the topic. Results obtained with embodiments of the invention may be provided to popularity prediction tools for improving popularity prediction of messages in social networks.

    COMPARING TABLES WITH SEMANTIC VECTORS
    7.
    发明申请

    公开(公告)号:US20190130029A1

    公开(公告)日:2019-05-02

    申请号:US15794943

    申请日:2017-10-26

    IPC分类号: G06F17/30

    摘要: A data processing system identifies a first topic for a first table, identifies a second topic for a second table, collects at least one first table attribute comprising at least one row name for the first table, and collects at least one second table attribute comprising at least one row name for the second table. The at least one first table attribute and the at least one second table attribute are placed in at least one semantic category. The at least one first table attribute is converted into at least one semantic vector for the first table, and the at least one second table attribute is converted into at least one semantic vector for the second table. The at least one semantic vector for the first table is compared with the at least one semantic vector for the second table to identify as related at least one row of the first table and at least one row of the second table. The at least one row of the first table and the at least one row of the second table are provided to a communication device with an identification as related.

    Comparing tables with semantic vectors

    公开(公告)号:US10997228B2

    公开(公告)日:2021-05-04

    申请号:US15794943

    申请日:2017-10-26

    IPC分类号: G06F16/35 G06F16/31 G06F16/21

    摘要: A data processing system identifies a first topic for a first table, identifies a second topic for a second table, collects at least one first table attribute comprising at least one row name for the first table, and collects at least one second table attribute comprising at least one row name for the second table. The at least one semantic vector for the first table is compared with the at least one semantic vector for the second table to identify as related at least one row of the first table and at least one row of the second table. The at least one row of the first table and the at least one row of the second table are provided to a communication device with an identification as related.

    Conformity determination of cross-regional affairs

    公开(公告)号:US10956915B2

    公开(公告)日:2021-03-23

    申请号:US15226055

    申请日:2016-08-02

    IPC分类号: G06Q30/00

    摘要: A method, a device and a computer program for conformity determination of cross-regional affairs. The method comprises obtaining characteristics from a description of an affair at least crossing a local region and a non-local region. The method further comprises generating a multi-level constraint based on the characteristics from a knowledge base, and the knowledge base includes regulations for cross-regional affairs, and the multi-level constraint includes a local constraint associated with the local region and a non-local constraint associated with the local region and the non-local region. Moreover, the method also comprises determining conformity of the affair to the multi-level constraint. The method can determine the conformity of the cross-regional affairs automatically, thereby reducing the consumption of human resources and the inconformity risk in the cross-regional affairs.

    Method and apparatus for performing extended search

    公开(公告)号:US10268771B2

    公开(公告)日:2019-04-23

    申请号:US14730905

    申请日:2015-06-04

    IPC分类号: G06F17/30 G06F7/24

    摘要: A method and apparatus for performing extended search are provided. The method includes receiving user-inputted keywords; extending the user-inputted keywords according to geographical information to acquire extended keywords; performing a search by using the extended keywords; and returning search results to the user. With the present technical solutions, privilege control can be effectively performed in a cloud storage system. With the present embodiments, more information may be provided to a user for reference.