Document text processing using edge detection
    1.
    发明授权
    Document text processing using edge detection 有权
    使用边缘检测记录文本处理

    公开(公告)号:US09569413B2

    公开(公告)日:2017-02-14

    申请号:US13465833

    申请日:2012-05-07

    CPC classification number: G06F17/2241 G06F17/30719

    Abstract: A document is received that has a plurality of lines with text. This document includes text associated with at least one topic of interest and text not associated with the at least one topic of interest. Thereafter, it is determined, for each line in the document, a length of the line and a number of off-topic indicators with the off-topic indicators characterizing portions of the document as likely being not being associated with the at least one topic of interest. Thereafter, a density for each line can be determined based on the determined line length and the determined number of off-topic indicators. The determined densities for each line are used to identify portions of the documents likely associated with the at least one topic of interest so that data characterizing the identified portions of the document can be provided. Related apparatus, systems, techniques and articles are also described.

    Abstract translation: 收到具有多行文本的文档。 本文档包括与至少一个感兴趣的主题相关联的文本和与该至少一个感兴趣的主题不相关联的文本。 此后,对于文档中的每一行,确定具有表示文档部分的偏离主题指示符的线的长度和偏离主题指示符的数量可能不与至少一个主题相关联 利益。 此后,可以基于确定的行长度和确定的脱离主题指示数来确定每行的密度。 用于确定每行的密度用于识别可能与所述至少一个感兴趣的主题相关联的文档的部分,从而可以提供表征文档的所识别的部分的数据。 还描述了相关设备,系统,技术和物品。

    Document Text Processing Using Edge Detection
    2.
    发明申请
    Document Text Processing Using Edge Detection 有权
    使用边缘检测的文档处理

    公开(公告)号:US20130297999A1

    公开(公告)日:2013-11-07

    申请号:US13465833

    申请日:2012-05-07

    CPC classification number: G06F17/2241 G06F17/30719

    Abstract: A document is received that has a plurality of lines with text. This document includes text associated with at least one topic of interest and text not associated with the at least one topic of interest. Thereafter, it is determined, for each line in the document, a length of the line and a number of off-topic indicators with the off-topic indicators characterizing portions of the document as likely being not being associated with the at least one topic of interest. Thereafter, a density for each line can be determined based on the determined line length and the determined number of off-topic indicators. The determined densities for each line are used to identify portions of the documents likely associated with the at least one topic of interest so that data characterizing the identified portions of the document can be provided. Related apparatus, systems, techniques and articles are also described.

    Abstract translation: 收到具有多行文本的文档。 本文档包括与至少一个感兴趣的主题相关联的文本和与该至少一个感兴趣的主题不相关联的文本。 此后,对于文档中的每一行,确定具有表示文档部分的偏离主题指示符的线的长度和偏离主题指示符的数量可能不与至少一个主题相关联 利益。 此后,可以基于确定的行长度和确定的脱离主题指示数来确定每行的密度。 用于确定每行的密度用于识别可能与所述至少一个感兴趣的主题相关联的文档的部分,从而可以提供表征文档的所识别的部分的数据。 还描述了相关设备,系统,技术和物品。

    Entity Name Variant Generator
    3.
    发明申请
    Entity Name Variant Generator 审中-公开
    实体名称变体发生器

    公开(公告)号:US20130297634A1

    公开(公告)日:2013-11-07

    申请号:US13465848

    申请日:2012-05-07

    CPC classification number: G06F17/273 G06F17/278

    Abstract: Data is received that comprises an entity name. Thereafter, it is determined (i) whether there are any punctuation variations for the entity name, (ii) whether there is at least one character to drop from the entity name, and (iii) whether there are alternative equivalents of at least a portion of the entity name. After such determinations have been made, a plurality of variants for the entity name is generated based on a combination of each determined punctuation variation, determined at least one character to drop, and determined alternative equivalent. Related apparatus, systems, techniques and articles are also described.

    Abstract translation: 收到包含实体名称的数据。 此后,确定(i)实体名称是否存在任何标点符号变化,(ii)是否存在至少一个字符从实体名称中删除,以及(iii)是否存在至少一部分的替代等同物 的实体名称。 在进行这样的确定之后,基于确定的至少一个字符和下降的确定的标点符号变化的组合以及确定的替代等效物来生成用于实体名称的多个变体。 还描述了相关设备,系统,技术和物品。

    Enterprise Resource Planning System Entity Event Monitoring
    4.
    发明申请
    Enterprise Resource Planning System Entity Event Monitoring 审中-公开
    企业资源规划系统实体事件监控

    公开(公告)号:US20130297361A1

    公开(公告)日:2013-11-07

    申请号:US13465869

    申请日:2012-05-07

    CPC classification number: G06Q10/0631

    Abstract: A company is associated, in an enterprise resource planning system, with a plurality of business entities that each have at least one structured record used by the enterprise resource planning system to characterize the business entity. Thereafter, documents are obtained from a plurality of information sources that characterize events associated with each business entity. It is then determined, using pre-defined business rules, which of the events are pertinent to the company so that enhancement records can be generated for the events determined to be pertinent to the company. These enhancement records characterize the corresponding event and are linked to the structured record for the corresponding business entity. Related apparatus, systems, techniques and articles are also described.

    Abstract translation: 公司在企业资源计划系统中与多个业务实体相关联,每个业务实体至少有一个由企业资源规划系统使用的结构化记录来表征业务实体。 此后,从表示与每个业务实体相关联的事件的多个信息源获得文档。 然后,使用预定义的业务规则确定哪个事件与公司相关,以便可以为确定与公司相关的事件生成增强记录。 这些增强记录表征相应的事件,并链接到相应业务实体的结构化记录。 还描述了相关设备,系统,技术和物品。

    Creation of event types for news mining for enterprise resource planning

    公开(公告)号:US09639818B2

    公开(公告)日:2017-05-02

    申请号:US14015779

    申请日:2013-08-30

    Applicant: Mohammad Shami

    Inventor: Mohammad Shami

    CPC classification number: G06Q10/0631

    Abstract: An event type generator may provide a training set for classifying documents with respect to an event type. The event type generator may include a request handler to receive the event type and at least one example document, a text analyzer to extract first entities from the at least one example document, and a result manager to execute a first search against an indexed corpus of documents, to obtain first search results, and further to receive at least one selected document from the first search results. The request handler may extract second entities from the at least one selected document, and execute a second search against the indexed corpus of documents, to obtain second search results. The event type generator may thus provide the at least one example document, the first search results, and the second search results as the training set.

    System and method of record matching in a database
    6.
    发明授权
    System and method of record matching in a database 有权
    数据库中记录匹配的系统和方法

    公开(公告)号:US09218372B2

    公开(公告)日:2015-12-22

    申请号:US13565484

    申请日:2012-08-02

    CPC classification number: G06F17/30303

    Abstract: A system and method of record matching using regular expressions and finite state representations. In this manner, the time (or computational effort) involved in record matching is reduced.

    Abstract translation: 使用正则表达式和有限状态表示记录匹配的系统和方法。 以这种方式,减少了记录匹配所涉及的时间(或计算工作量)。

    News mining for enterprise resource planning
    7.
    发明授权
    News mining for enterprise resource planning 有权
    企业资源规划新闻挖掘

    公开(公告)号:US09443214B2

    公开(公告)日:2016-09-13

    申请号:US14015764

    申请日:2013-08-30

    Applicant: Mohammad Shami

    Inventor: Mohammad Shami

    CPC classification number: G06Q10/06315

    Abstract: A system may include a record generator to receive a plurality of documents associated with a plurality of suppliers and provide supplier-specific data records based thereon. The record generator may include an event classifier configured to execute a supplier-independent, event-based classification of each document, to thereby obtain event-classified documents. The record generator may include a supplier query generator configured to query the plurality of documents to obtain potential supplier matches from the plurality of suppliers, and a supplier match analyzer configured to analyze each potential supplier match of the potential supplier matches, to thereby obtain supplier matches. The record generator may include a supplier relevance analyzer configured to relate, for each event-classified document, any supplier identified therein to at least one event of the event-classified document, to thereby obtain supplier-event relationships. Thus, the record generator may provide supplier-specific data records, based on the supplier event relationship.

    Abstract translation: 系统可以包括记录生成器,用于接收与多个供应商相关联的多个文档,并且基于此提供供应商特定的数据记录。 记录生成器可以包括事件分类器,其被配置为执行每个文档的与供应商无关的基于事件的分类,从而获得事件分类的文档。 记录生成器可以包括供应商查询生成器,其被配置为查询多个文档以从多个供应商获得潜在的供应商匹配;以及供应商匹配分析器,其被配置为分析潜在供应商匹配的每个潜在供应商匹配,从而获得供应商匹配 。 记录生成器可以包括供应商相关性分析器,其被配置为将每个事件分类文档与其中识别的任何供应商与事件分类文档的至少一个事件相关联,从而获得供应商事件关系。 因此,记录生成器可以基于供应商事件关系来提供供应商特定的数据记录。

    CREATION OF EVENT TYPES FOR NEWS MINING FOR ENTERPRISE RESOURCE PLANNING
    8.
    发明申请
    CREATION OF EVENT TYPES FOR NEWS MINING FOR ENTERPRISE RESOURCE PLANNING 有权
    创新企业资源规划新闻采矿活动类型

    公开(公告)号:US20150066552A1

    公开(公告)日:2015-03-05

    申请号:US14015779

    申请日:2013-08-30

    Applicant: Mohammad Shami

    Inventor: Mohammad Shami

    CPC classification number: G06Q10/0631

    Abstract: An event type generator may provide a training set for classifying documents with respect to an event type. The event type generator may include a request handler to receive the event type and at least one example document, a text analyzer to extract first entities from the at least one example document, and a result manager to execute a first search against an indexed corpus of documents, to obtain first search results, and further to receive at least one selected document from the first search results. The request handler may extract second entities from the at least one selected document, and execute a second search against the indexed corpus of documents, to obtain second search results. The event type generator may thus provide the at least one example document, the first search results, and the second search results as the training set.

    Abstract translation: 事件类型生成器可以提供关于事件类型对文档进行分类的训练集。 事件类型生成器可以包括用于接收事件类型的请求处理程序和至少一个示例文档,从至少一个示例文档中提取第一实体的文本分析器,以及结果管理器,以针对索引的语料库执行第一搜索 文件,以获得第一搜索结果,并进一步从第一搜索结果接收至少一个选定的文档。 所述请求处理器可以从所述至少一个所选择的文档中提取第二实体,并针对索引的文档语料库执行第二搜索,以获得第二搜索结果。 因此,事件类型生成器可以提供至少一个示例文档,第一搜索结果和第二搜索结果作为训练集。

    System and Method of Record Matching in a Database
    9.
    发明申请
    System and Method of Record Matching in a Database 有权
    数据库中记录匹配的系统和方法

    公开(公告)号:US20140040313A1

    公开(公告)日:2014-02-06

    申请号:US13565484

    申请日:2012-08-02

    CPC classification number: G06F17/30303

    Abstract: A system and method of record matching using regular expressions and finite state representations. In this manner, the time (or computational effort) involved in record matching is reduced.

    Abstract translation: 使用正则表达式和有限状态表示记录匹配的系统和方法。 以这种方式,减少了记录匹配所涉及的时间(或计算工作量)。

    NEWS MINING FOR ENTERPRISE RESOURCE PLANNING
    10.
    发明申请
    NEWS MINING FOR ENTERPRISE RESOURCE PLANNING 有权
    企业资源规划新闻采矿

    公开(公告)号:US20150066567A1

    公开(公告)日:2015-03-05

    申请号:US14015764

    申请日:2013-08-30

    Applicant: Mohammad Shami

    Inventor: Mohammad Shami

    CPC classification number: G06Q10/06315

    Abstract: A system may include a record generator to receive a plurality of documents associated with a plurality of suppliers and provide supplier-specific data records based thereon. The record generator may include an event classifier configured to execute a supplier-independent, event-based classification of each document, to thereby obtain event-classified documents. The record generator may include a supplier query generator configured to query the plurality of documents to obtain potential supplier matches from the plurality of suppliers, and a supplier match analyzer configured to analyze each potential supplier match of the potential supplier matches, to thereby obtain supplier matches. The record generator may include a supplier relevance analyzer configured to relate, for each event-classified document, any supplier identified therein to at least one event of the event-classified document, to thereby obtain supplier-event relationships. Thus, the record generator may provide supplier-specific data records, based on the supplier event relationship.

    Abstract translation: 系统可以包括记录生成器,用于接收与多个供应商相关联的多个文档,并且基于此提供供应商特定的数据记录。 记录生成器可以包括事件分类器,其被配置为执行每个文档的与供应商无关的基于事件的分类,从而获得事件分类的文档。 记录生成器可以包括供应商查询生成器,其被配置为查询多个文档以从多个供应商获得潜在的供应商匹配;以及供应商匹配分析器,其被配置为分析潜在供应商匹配的每个潜在供应商匹配,从而获得供应商匹配 。 记录生成器可以包括供应商相关性分析器,其被配置为将每个事件分类文档与其中识别的任何供应商与事件分类文档的至少一个事件相关联,从而获得供应商事件关系。 因此,记录生成器可以基于供应商事件关系来提供供应商特定的数据记录。

Patent Agency Ranking