-
公开(公告)号:US08538898B2
公开(公告)日:2013-09-17
申请号:US13118404
申请日:2011-05-28
Applicant: Zhengdong Lu , Zaiqing Nie , Gang Luo , Yong Cao , Ji-Rong Wen , Wei-Ying Ma
Inventor: Zhengdong Lu , Zaiqing Nie , Gang Luo , Yong Cao , Ji-Rong Wen , Wei-Ying Ma
IPC: G06N5/00
CPC classification number: G06N99/005 , G06F17/30616
Abstract: A “Name Disambiguator” provides various techniques for implementing an interactive framework for resolving or disambiguating entity names (associated with objects such as publications) for entity searches where two or more same or similar names may refer to different entities. More specifically, the Name Disambiguator uses a combination of user input and automatic models to address the disambiguation problem. In various embodiments, the Name Disambiguator uses a two part process, including: 1) a global SVM trained from large sets of documents or objects in a simulated interactive mode, and 2) further personalization of local SVM models (associated with individual names or groups of names such as, for example, a group of coauthors) derived from the global SVM model. The result of this process is that large sets of documents or objects are rapidly and accurately condensed or clustered into ordered sets by that are organized by entity names.
Abstract translation: “名称歧义者”提供了各种技术,用于实现用于解析或消除实体名称(与诸如出版物的对象相关联)的交互式框架,用于实体搜索,其中两个或多个相同或相似的名称可以指代不同的实体。 更具体地说,名称消歧器使用用户输入和自动模型的组合来解决消歧问题。 在各种实施例中,名称消歧器使用两部分过程,包括:1)以模拟交互模式从大量文档或对象训练的全局SVM,以及2)本地SVM模型的进一步个性化(与个体名称或组相关联 来自全球SVM模型的名称,例如一组合作者。 这个过程的结果是,大量的文档或对象可以通过按实体名称组织的快速,准确的浓缩或聚类成有序集。
-
公开(公告)号:US20120303557A1
公开(公告)日:2012-11-29
申请号:US13118404
申请日:2011-05-28
Applicant: Zhengdong Lu , Zaiqing Nie , Gang Luo , Yong Cao , Ji-Rong Wen , Wei-Ying Ma
Inventor: Zhengdong Lu , Zaiqing Nie , Gang Luo , Yong Cao , Ji-Rong Wen , Wei-Ying Ma
IPC: G06F15/18
CPC classification number: G06N99/005 , G06F17/30616
Abstract: A “Name Disambiguator” provides various techniques for implementing an interactive framework for resolving or disambiguating entity names (associated with objects such as publications) for entity searches where two or more same or similar names may refer to different entities. More specifically, the Name Disambiguator uses a combination of user input and automatic models to address the disambiguation problem. In various embodiments, the Name Disambiguator uses a two part process, including: 1) a global SVM trained from large sets of documents or objects in a simulated interactive mode, and 2) further personalization of local SVM models (associated with individual names or groups of names such as, for example, a group of coauthors) derived from the global SVM model. The result of this process is that large sets of documents or objects are rapidly and accurately condensed or clustered into ordered sets by that are organized by entity names.
Abstract translation: 名称消歧者提供各种技术,用于实现用于解析或消除实体名称(与诸如出版物的对象相关联)的交互式框架,用于实体搜索,其中两个或多个相同或相似的名称可以指代不同的实体。 更具体地说,名称消歧器使用用户输入和自动模型的组合来解决消歧问题。 在各种实施例中,名称消歧器使用两部分过程,包括:1)以模拟交互模式从大量文档或对象训练的全局SVM,以及2)本地SVM模型的进一步个性化(与个体名称或组相关联 来自全球SVM模型的名称,例如一组合作者。 这个过程的结果是,大量的文档或对象可以通过按实体名称组织的快速,准确的浓缩或聚类成有序集。
-