-
公开(公告)号:US20150310096A1
公开(公告)日:2015-10-29
申请号:US14695688
申请日:2015-04-24
发明人: Shenghua Bao , Hong Lei Guo , Zhi Li Guo , Davide Pasetto , Wei Hong Qian , Zhong Su
IPC分类号: G06F17/30
CPC分类号: G06F16/367
摘要: Comparing document contents is provided. An ontological concept is extracted from a text snippet of a corpus document. One or more feature vectors are constructed that include associative information that describes an ontology that includes the focused concept. A topic model is trained using the one or more feature vectors. First and second topic sets are respectively extracted from first and second documents using the topic model. One or more topics from the first topic set are matched, using the topic model, with one or more topics from the second topic set to construct a matched topic set. Semantic analyses are respectively performed on first and second text snippet sets, wherein the first and second text snippet sets are chosen based, at least in part, on the matched topic set. Text snippets are matched based, at least in part, on the first and second semantic analyses.
摘要翻译: 提供比较文件内容。 从语料库文档的文本片段中提取本体论概念。 构造一个或多个特征向量,其包括描述包括聚焦概念的本体的关联信息。 使用一个或多个特征向量训练主题模型。 使用主题模型分别从第一和第二文档提取第一和第二主题集。 使用主题模型与来自第二主题集的一个或多个主题来匹配来自第一主题集的一个或多个主题以构建匹配的主题集。 分别对第一和第二文本片段集执行语义分析,其中,至少部分地基于匹配的主题集来选择第一和第二文本片段集合。 文本片段至少部分地基于第一和第二语义分析进行匹配。
-
公开(公告)号:US20140040723A1
公开(公告)日:2014-02-06
申请号:US13947197
申请日:2013-07-22
发明人: Sheng Hua Bao , Ke Ke Cai , Hong Lei Guo , Zhong Su , Xian Wu , Li Zhang , Shuo Zhang
IPC分类号: G06F17/22
CPC分类号: G06F17/2247 , G06F17/00 , G06F17/2785 , G06Q30/00 , G06Q30/0277 , G06Q30/0641 , G06Q30/0643
摘要: A method for enriching contents of a website includes obtaining a corpus from the current website and other websites, and extracting object features from the corpus, wherein the corpus comprises specifications of the object and user reviews about the object; according to the corpus, constructing multi-dimensional vectors for the extracted features; for a specified feature, making similarity comparison of its multi-dimensional vector and multi-dimensional vectors of other extracted features; determining features with similarities higher than a predetermined threshold as the same features, and reinforcing the current website with features different from that of the object on the current website and their corresponding attributes.
摘要翻译: 一种丰富网站内容的方法,包括从当前网站和其他网站获取语料库,并从语料库中提取对象特征,其中语料库包括对象的规范和关于对象的用户评论; 根据语料库,为提取的特征构建多维向量; 对于特定的特征,对其多维向量和其他提取特征的多维向量进行相似性比较; 确定具有高于预定阈值的相似性的特征作为相同特征,并且利用与当前网站上的对象的特征不同的特征及其相应属性来加强当前网站。
-
公开(公告)号:US10127442B2
公开(公告)日:2018-11-13
申请号:US15177728
申请日:2016-06-09
发明人: Ke Ke Cai , Hong Lei Guo , Zhi Li Guo , Feng Jin , Yong Qin , Zhong Su
摘要: Embodiments of the present disclosure relate to non-sequential document comparison. A first plurality of segments in a first document and a second plurality of segments in a second document are obtained. In response to a first segment from the first plurality of segments being associated with a second segment from the second plurality of segments, a third segment from the first plurality of segments is associated with a fourth segment from the second plurality of segments.
-
公开(公告)号:US09230035B2
公开(公告)日:2016-01-05
申请号:US14012085
申请日:2013-08-28
发明人: Sheng Hua Bao , Hong Lei Guo , Zhili Guo , Zhong Su , Hui Jia Zhu
CPC分类号: G06F17/3089 , G06F17/30867 , G06F17/30899
摘要: A method and an apparatus for pushing specific content for a predetermined webpage, and a website server. The method for pushing specific content for text content on a predetermined webpage comprises: subjecting text content on a predetermined webpage to emotional analysis; determining a matching degree between a result of the emotional analysis and an emotion expressed by specific content to be pushed; and responding to that the matching degree determined above satisfies a predetermined condition, combining a part of the text content with the specific content to be pushed, thereby forming content to be pushed specific for users. By using the technology of the present invention, user can be avoided from feeling disgust for content to be pushed and accuracy of push can be enhanced.
摘要翻译: 一种用于推送预定网页的特定内容的方法和装置,以及网站服务器。 在预定网页上推送文本内容的特定内容的方法包括:对预定网页上的文本内容进行情感分析; 确定情感分析的结果与被推送的特定内容所表达的情感之间的匹配程度; 并且响应于以上确定的匹配度满足预定条件,将一部分文本内容与要推送的特定内容相结合,从而形成要针对用户推送的内容。 通过使用本发明的技术,可以避免用户对被推送的内容感到厌恶,并且能够提高推送精度。
-
公开(公告)号:US20150269174A1
公开(公告)日:2015-09-24
申请号:US14730905
申请日:2015-06-04
发明人: Keke Cai , Hong Lei Guo , Zhong Su , Hui Jia Zhu
IPC分类号: G06F17/30
CPC分类号: G06F17/3087 , G06F7/24 , G06F17/30241 , G06F17/30424 , G06F17/30542 , G06F17/30672 , G06F17/30867
摘要: A method and apparatus for performing extended search are provided. The method includes receiving user-inputted keywords; extending the user-inputted keywords according to geographical information to acquire extended keywords; performing a search by using the extended keywords; and returning search results to the user. With the present technical solutions, privilege control can be effectively performed in a cloud storage system. With the present embodiments, more information may be provided to a user for reference.
摘要翻译: 提供了一种用于执行扩展搜索的方法和装置。 该方法包括接收用户输入的关键字; 根据地理信息扩展用户输入的关键字,获取扩展关键词; 使用扩展关键字执行搜索; 并将搜索结果返回给用户。 利用现有的技术方案,可以在云存储系统中有效地执行特权控制。 利用本实施例,可以向用户提供更多的信息以供参考。
-
公开(公告)号:US10389677B2
公开(公告)日:2019-08-20
申请号:US15389536
申请日:2016-12-23
发明人: Ke Ke Cai , Hong Lei Guo , Jian Min Jiang , Zhong Su , Chang Hua Sun , Guo Yu Tang
摘要: Embodiments of the invention provide a computer-implemented method, computing system and computer program product for analyzing a message in a social network. The method comprises identifying an entity from the message; detecting historical popularity of the entity in a social network; identifying a topic from the message; detecting historical popularity of the topic in the social network; and generating an entity-topic correlation factor for the entity and the topic based on the historical popularity of the entity and the historical popularity of the topic. Results obtained with embodiments of the invention may be provided to popularity prediction tools for improving popularity prediction of messages in social networks.
-
公开(公告)号:US20190130029A1
公开(公告)日:2019-05-02
申请号:US15794943
申请日:2017-10-26
发明人: Ke Ke Cai , Hong Lei Guo , Hamid Reza Motahari Nezhad , Zhong Su , Li Zhang
IPC分类号: G06F17/30
摘要: A data processing system identifies a first topic for a first table, identifies a second topic for a second table, collects at least one first table attribute comprising at least one row name for the first table, and collects at least one second table attribute comprising at least one row name for the second table. The at least one first table attribute and the at least one second table attribute are placed in at least one semantic category. The at least one first table attribute is converted into at least one semantic vector for the first table, and the at least one second table attribute is converted into at least one semantic vector for the second table. The at least one semantic vector for the first table is compared with the at least one semantic vector for the second table to identify as related at least one row of the first table and at least one row of the second table. The at least one row of the first table and the at least one row of the second table are provided to a communication device with an identification as related.
-
公开(公告)号:US10997228B2
公开(公告)日:2021-05-04
申请号:US15794943
申请日:2017-10-26
发明人: Ke Ke Cai , Hong Lei Guo , Hamid Reza Motahari Nezhad , Zhong Su , Li Zhang
摘要: A data processing system identifies a first topic for a first table, identifies a second topic for a second table, collects at least one first table attribute comprising at least one row name for the first table, and collects at least one second table attribute comprising at least one row name for the second table. The at least one semantic vector for the first table is compared with the at least one semantic vector for the second table to identify as related at least one row of the first table and at least one row of the second table. The at least one row of the first table and the at least one row of the second table are provided to a communication device with an identification as related.
-
公开(公告)号:US10956915B2
公开(公告)日:2021-03-23
申请号:US15226055
申请日:2016-08-02
发明人: Ke Ke Cai , Hong Lei Guo , Zhi Li Guo , Feng Jin , Zhong Su
IPC分类号: G06Q30/00
摘要: A method, a device and a computer program for conformity determination of cross-regional affairs. The method comprises obtaining characteristics from a description of an affair at least crossing a local region and a non-local region. The method further comprises generating a multi-level constraint based on the characteristics from a knowledge base, and the knowledge base includes regulations for cross-regional affairs, and the multi-level constraint includes a local constraint associated with the local region and a non-local constraint associated with the local region and the non-local region. Moreover, the method also comprises determining conformity of the affair to the multi-level constraint. The method can determine the conformity of the cross-regional affairs automatically, thereby reducing the consumption of human resources and the inconformity risk in the cross-regional affairs.
-
公开(公告)号:US10268771B2
公开(公告)日:2019-04-23
申请号:US14730905
申请日:2015-06-04
发明人: Keke Cai , Hong Lei Guo , Zhong Su , Hui Jia Zhu
摘要: A method and apparatus for performing extended search are provided. The method includes receiving user-inputted keywords; extending the user-inputted keywords according to geographical information to acquire extended keywords; performing a search by using the extended keywords; and returning search results to the user. With the present technical solutions, privilege control can be effectively performed in a cloud storage system. With the present embodiments, more information may be provided to a user for reference.
-
-
-
-
-
-
-
-
-