Method and system for extracting opinions from text documents
    1.
    发明授权
    Method and system for extracting opinions from text documents 有权
    从文本文件中提取意见的方法和系统

    公开(公告)号:US08200477B2

    公开(公告)日:2012-06-12

    申请号:US10692025

    申请日:2003-10-22

    Abstract: A method and system for extracting opinions about a subject of interest from a text document in which each sentence is analyzed individually to identify the opinions. The most relevant feature terms related to the subject are extracted from the document based on their relevancy scores. Candidate feature terms are definite noun phrases at the beginning of the sentences. For each sentence that refers to the subject or a feature term, the invention determines whether the sentence includes an opinion polarity about the subject or the feature term. The opinion polarity is detected by identifying opinion terms in the sentence using an opinion dictionary or an opinion rule base, parsing the sentence with an English parser to identify grammatical components in the sentence and their relationships, and finding a matching entry in the dictionary or the rule base.

    Abstract translation: 一种方法和系统,用于从单独分析每个句子的文本文档中提取有关感兴趣主题的意见,以识别意见。 根据相关性分数,从文档中提取与该主题相关的最相关的功能术语。 候选特征术语是句子开头的明确的名词短语。 对于涉及主题或特征术语的每个句子,本发明确定句子是否包括关于被摄体或特征项的意见极性。 通过使用意见词典或意见规则库识别句子中的意见词来检测意见极性,用英语解析器解析句子以识别句子中的语法成分及其关系,并在字典中找到匹配的条目或 规则库。

    Method and system for extracting opinions from text documents
    2.
    发明申请
    Method and system for extracting opinions from text documents 有权
    从文本文件中提取意见的方法和系统

    公开(公告)号:US20050091038A1

    公开(公告)日:2005-04-28

    申请号:US10692025

    申请日:2003-10-22

    Abstract: A method and system for extracting opinions about a subject of interest from a text document in which each sentence is analyzed individually to identify the opinions. The most relevant feature terms related to the subject are extracted from the document based on their relevancy scores. Candidate feature terms are definite noun phrases at the beginning of the sentences. For each sentence that refers to the subject or a feature term, the invention determines whether the sentence includes an opinion polarity about the subject or the feature term. The opinion polarity is detected by identifying opinion terms in the sentence using an opinion dictionary or an opinion rule base, parsing the sentence with an English parser to identify grammatical components in the sentence and their relationships, and finding a matching entry in the dictionary or the rule base.

    Abstract translation: 一种方法和系统,用于从单独分析每个句子的文本文档中提取有关感兴趣主题的意见,以识别意见。 根据相关性分数,从文档中提取与该主题相关的最相关的功能术语。 候选特征术语是句子开头的明确的名词短语。 对于涉及主题或特征术语的每个句子,本发明确定句子是否包括关于被摄体或特征项的意见极性。 通过使用意见词典或意见规则库识别句子中的意见词来检测意见极性,用英语解析器解析句子以识别句子中的语法成分及其关系,并在字典中找到匹配的条目或 规则库。

    SYSTEM FOR PROVIDING FEEDS FOR ENTITIES NOT ASSOCIATED WITH FEED SERVICES
    3.
    发明申请
    SYSTEM FOR PROVIDING FEEDS FOR ENTITIES NOT ASSOCIATED WITH FEED SERVICES 审中-公开
    用于为与饲料服务不相关的实体提供饲料的系统

    公开(公告)号:US20100082745A1

    公开(公告)日:2010-04-01

    申请号:US12239972

    申请日:2008-09-29

    CPC classification number: G06F16/958

    Abstract: A system is described for providing feeds for entities not associated with feed services. The system may include a processor, a memory and an interface. The memory may store an identifier of an entity, an update condition and a feed. The entity may include content, and the update condition may describe an update to the content. The interface may communicate with a device of the user. The processor may receive the identifier of the entity and the update condition of the entity via the interface. The processor may generate a feed for the entity and the processor may add the content to the feed when the content is updated in accordance with the update condition. The processor may then provide the feed to the device of the user via the interface.

    Abstract translation: 描述了用于为不与饲料服务相关联的实体提供饲料的系统。 该系统可以包括处理器,存储器和接口。 存储器可以存储实体的标识符,更新条件和馈送。 实体可以包括内容,并且更新条件可以描述对内容的更新。 接口可以与用户的设备进行通信。 处理器可以经由接口接收实体的标识符和实体的更新条件。 处理器可以为实体生成馈送,并且当根据更新条件更新内容时,处理器可以将内容添加到馈送。 处理器然后可以经由接口向用户的设备提供馈送。

    SYSTEM FOR ASSOCIATING DATA ITEMS WITH CONTEXT
    4.
    发明申请
    SYSTEM FOR ASSOCIATING DATA ITEMS WITH CONTEXT 审中-公开
    与上下文相关的数据项系统

    公开(公告)号:US20100082629A1

    公开(公告)日:2010-04-01

    申请号:US12239982

    申请日:2008-09-29

    CPC classification number: G06Q30/02 G06F16/24575 G06F16/24578

    Abstract: A system is described for associating data items with context. The system may include a processor, a memory and an interface. The processor may identify an action performed by a user and may determine the spatial, temporal, social and topical attributes of the action. The spatial attribute of the action may relate to the user's location, the temporal attribute may relate to the time the action was performed, the social attribute may relate to a social relation of the user, and the topical attribute may relate to a topic of interest to the user. The processor may store an association between a descriptor of the action, the spatial attribute of the action, the temporal attribute of the action, the social attribute of the action and the topical attribute of the action in the memory. The processor may use the stored association to provide a contextually relevant data item via the interface.

    Abstract translation: 描述了将数据项与上下文相关联的系统。 该系统可以包括处理器,存储器和接口。 处理器可以识别用户执行的动作,并且可以确定动作的空间,时间,社交和主题属性。 动作的空间属性可以涉及用户的位置,时间属性可以涉及动作被执行的时间,社会属性可能与用户的社会关系有关,并且主题属性可以涉及感兴趣的话题 给用户 处理器可以存储动作的描述符,动作的空间属性,动作的时间属性,动作的社会属性和动作的主题属性在存储器中的关联。 处理器可以使用所存储的关联来经由该接口提供与情境相关的数据项。

    System and method of automatic discovery of terms in a document that are relevant to a given target topic
    5.
    发明授权
    System and method of automatic discovery of terms in a document that are relevant to a given target topic 有权
    自动发现文档中与给定目标主题相关的术语的系统和方法

    公开(公告)号:US06651058B1

    公开(公告)日:2003-11-18

    申请号:US09439758

    申请日:1999-11-15

    Abstract: A computer program product is provided as an automatic mining system to discover terms that are relevant to a given target topic from a large databases of unstructured information such as the World Wide Web. The operation of the automatic mining system is performed in three stages: The first stage is carried out by a new terms discoverer for discovering the terms in a document, the second stage is carried out by a candidate terms discoverer for discovering potentially relevant terms, and the third stage is carried out by a relevant terms discoverer for refining or testing the discovered relevance to filter false relevance. The new terms discoverer includes a system for the automatic mining of patterns and relations, a system for the automatic mining of new relationships, and a system for selecting new terms from relations. In one embodiment, the system for the automatic mining of patterns and relations identifies a set of related terms on the WWW with a high degree of confidence, using a duality concept, and includes a terms database and two identifiers: a relation identifier and a pattern identifier. The system for the automatic mining of new relationships includes a database a knowledge module and a statistics module. The knowledge module includes a stemming unit, a synonym check unit, and a domain knowledge check unit. The candidate terms discoverer includes a metadata extractor, a document vector module, an association module, a filtering module, and a database. The relevant terms discoverer includes a stop word filter and a system for the automatic construction of generalization—specialization hierarchy of terms comprised of a terms database, an augmentation module, a generalization detection module, and a hierarchy database.

    Abstract translation: 提供计算机程序产品作为自动挖掘系统,以从诸如万维网的非结构化信息的大型数据库中发现与给定目标主题相关的术语。 自动采矿系统的运行分为三个阶段:第一阶段是由一个新的术语发现者进行的,用于发现文件中的术语,第二阶段是由候选词条发现者进行的,用于发现潜在的相关术语,以及 第三阶段由相关术语发现者进行,用于提炼或测试发现的相关性以过滤虚假相关性。 新术语发现者包括自动挖掘模式和关系的系统,用于自动挖掘新关系的系统以及从关系中选择新术语的系统。 在一个实施例中,用于自动挖掘模式和关系的系统使用二元性概念以高度的置信度在WWW上识别一组相关术语,并且包括术语数据库和两个标识符:关系标识符和模式 标识符 用于自动挖掘新关系的系统包括数据库知识模块和统计模块。 知识模块包括词干单元,同义词检查单元和域知识检查单元。 候选词语发现者包括元数据提取器,文档向量模块,关联模块,过滤模块和数据库。 相关术语发现者包括停止词过滤器和用于自动构建由术语数据库,增强模块,泛化检测模块和层次数据库组成的术语的泛化专业化层次结构的系统。

    Methods and Systems for Optimizing Webpage Content Based on a Screen Orientation of a Device
    6.
    发明申请
    Methods and Systems for Optimizing Webpage Content Based on a Screen Orientation of a Device 审中-公开
    基于设备屏幕方向优化网页内容的方法和系统

    公开(公告)号:US20100083163A1

    公开(公告)日:2010-04-01

    申请号:US12240323

    申请日:2008-09-29

    Abstract: Systems and methods for optimizing webpage content based on a screen orientation of a device are disclosed. Generally, a plurality of content chunks comprising a webpage to be displayed on a device is identified. An indication of a screen orientation of the device is received and webpage content to be displayed on the device is modified based at least in part on the screen orientation of the device, the identified plurality of content chunks, and a focus priority associated with each content chunk of the plurality of content chunks.

    Abstract translation: 公开了基于设备的屏幕方向优化网页内容的系统和方法。 通常,识别包括要显示在设备上的网页的多个内容块。 接收设备的屏幕方向的指示,并且至少部分地基于设备的屏幕取向,所识别的多个内容块以及与每个内容相关联的焦点优先级来修改要在设备上显示的网页内容 多个内容块的块。

    COMMUNITY CACHING NETWORKS
    7.
    发明申请
    COMMUNITY CACHING NETWORKS 有权
    社区缓存网络

    公开(公告)号:US20100082764A1

    公开(公告)日:2010-04-01

    申请号:US12240524

    申请日:2008-09-29

    CPC classification number: H04L12/6418

    Abstract: A system for sharing data within a network, the system including a first peer device coupled with the network that comprises local cache storage configured to store data comprising at least one entry designated as network accessible cache data and a cache control module operative to control access to the data stored in the local cache storage. The system further includes a second peer device coupled with the first peer device via the network where the second peer device is configured to request network accessible cache data stored in the local cache storage of the first peer device. Furthermore, the cache control module of the first peer device is configured to transmit at least a portion of the requested network accessible cache data to the second peer device in response to the request for network accessible data stored from the second peer device.

    Abstract translation: 一种用于在网络内共享数据的系统,所述系统包括与所述网络耦合的第一对等设备,所述第一对等设备包括本地高速缓存存储器,其被配置为存储包括指定为网络可访问高速缓存数据的至少一个条目的数据, 数据存储在本地缓存中。 该系统还包括经由网络与第一对等设备耦合的第二对等设备,其中第二对等设备被配置为请求存储在第一对等设备的本地高速缓存存储器中的网络可访问高速缓存数据。 此外,第一对等设备的高速缓存控制模块被配置为响应于从第二对等设备存储的网络可访问数据的请求,向第二对等设备发送所请求的网络可访问高速缓存数据的至少一部分。

    Dimension reduction for data mining application
    8.
    发明授权
    Dimension reduction for data mining application 失效
    数据挖掘应用的维度减少

    公开(公告)号:US6032146A

    公开(公告)日:2000-02-29

    申请号:US21038

    申请日:1998-02-09

    Abstract: A method, apparatus, and article of manufacture for a computer-implemented random reliability engine for computer-implemented dimension reduction using association rules for data mining application. The data mining is performed by the computer to retrieve data from a data store stored on a data storage device coupled to the computer. The data store has records that have multiple attributes. The multiple attributes of a table are clustered to produce a plurality of sets of attributes. Each set of attributes is clustered to obtain data mining attributes.

    Abstract translation: 一种用于计算机实现的随机可靠性引擎的方法,装置和制品,其使用用于数据挖掘应用的关联规则进行计算机实现的维度缩减。 数据挖掘由计算机执行以从存储在耦合到计算机的数据存储设备上的数据存储器中检索数据。 数据存储具有多个属性的记录。 表的多个属性被聚集以产生多组属性。 将每组属性聚类以获取数据挖掘属性。

    SYSTEM FOR PROVIDING CONTEXTUALLY RELEVANT DATA
    9.
    发明申请
    SYSTEM FOR PROVIDING CONTEXTUALLY RELEVANT DATA 审中-公开
    提供相关数据的系统

    公开(公告)号:US20100082398A1

    公开(公告)日:2010-04-01

    申请号:US12239964

    申请日:2008-09-29

    Abstract: A system is described for providing contextually relevant data. The system may include a processor, a memory and an interface. The processor may receive a query associated with a user. The processor may determine a spatial context, a temporal context, a social context and a topical context of the query. The spatial context may represent the location of the user, and the temporal context may represent a time of the query. The topical context may represent an item of interest to the user, and the social context may represent an item of interest to other users associated with the user. The processor may identify a plurality of search results for the query searched for. Each search result may be associated with one or more of the spatial context, the temporal context, the social context and the topical context. The processor may provide the identified search results to the user.

    Abstract translation: 描述了一种用于提供上下文相关数据的系统。 该系统可以包括处理器,存储器和接口。 处理器可以接收与用户相关联的查询。 处理器可以确定查询的空间上下文,时间上下文,社会语境和主题上下文。 空间上下文可以表示用户的位置,并且时间上下文可以表示查询的时间。 主题上下文可以表示用户感兴趣的项目,并且社交语境可以表示与用户相关联的其他用户感兴趣的项目。 处理器可以标识搜索到的查询的多个搜索结果。 每个搜索结果可以与空间上下文,时间上下文,社会语境和主题上下文中的一个或多个相关联。 处理器可以向用户提供所识别的搜索结果。

    System and method for the automatic mining of new relationships
    10.
    发明授权
    System and method for the automatic mining of new relationships 失效
    自动挖掘新关系的系统和方法

    公开(公告)号:US06539376B1

    公开(公告)日:2003-03-25

    申请号:US09440626

    申请日:1999-11-15

    Abstract: An automatic mining system that identifies a set of relevant terms from a large text database of unstructured information, such as the World Wide Web with a high degree of confidence. The automatic mining system includes a software program that enables the discovery of new relationships by association mining and refinement of co-occurrences, using automatic and iterative recognition of new binary relations through phrases that embody related pairs, by applying lexicographic and statistical techniques to classify the relations, and further by applying a minimal amount of domain knowledge of the relevance of the terms and relations. The automatic mining system includes a knowledge module and a statistics module. The knowledge module is comprised of a stemming unit, a synonym check unit, and a domain knowledge check unit. The stemming unit determines if the relation being analyzed shares a common root with a previously mined relation. The synonym check unit identifies the synonyms of the relation, and the domain knowledge check unit considers extrinsic factors for indications that would further clarify the relationship being mined. The statistics module optimizes the confidence level in the relationship.

    Abstract translation: 一种自动采矿系统,用于从非结构化信息的大型文本数据库(例如万维网)中高度信任地识别一组相关术语。 自动采矿系统包括一个软件程序,可以通过关联挖掘和共同出现的细化来发现新的关系,通过应用词典和统计技术来分类新的二元关系,通过使用相关对的短语自动和迭代地识别新的二元关系 关系,并进一步通过应用最少量的领域知识的术语和关系的相关性。 自动采矿系统包括知识模块和统计模块。 知识模块由词干单元,同义词检查单元和域知识检查单元组成。 干扰单元确定正在分析的关系是否与先前开采的关系共享共同的根。 同义词检查单元识别关系的同义词,域知识检查单元考虑用于进一步澄清正在开采的关系的指示的外在因素。 统计模块优化关系中的置信水平。

Patent Agency Ranking