Efficient lexical trending topic detection over streams of data using a modified sequitur algorithm
    1.
    发明授权
    Efficient lexical trending topic detection over streams of data using a modified sequitur algorithm 有权
    使用修改的Sequitur算法对数据流进行有效的词汇趋势主题检测

    公开(公告)号:US08838599B2

    公开(公告)日:2014-09-16

    申请号:US12780850

    申请日:2010-05-14

    CPC classification number: G06F17/30616

    Abstract: Embodiments are directed towards a Modified Sequitur algorithm (MSA) using pipelining and indexed arrays to identify trending topics within a plurality of documents having user generated content (UGC). The documents are parallelized and distributed across a plurality of network devices, which place at least some of the received documents into a buffer for which the MSA may then be applied to the documents within the buffer to identify n-grams or phrases within the documents' contents. The identified phrases are further analyzed to remove extraneous co-occurrences of phrases, and/or words based on a part of speech analysis. A weighting of the remaining phrases is used to identify trending topic phrases. Links to content in the plurality of UGC documents that is associated with the trending topic phrases may then be displayed to a client device.

    Abstract translation: 实施例针对使用流水线和索引数组来修改具有用户生成内容(UGC)的多个文档内的趋势主题的修改的序列算法(MSA)。 这些文档被并行化并且分布在多个网络设备上,这些网络设备将至少一些接收到的文档放置在缓冲器中,然后可以将MSA应用于缓冲器中的文档,以识别文档中的n个或多个短语, 内容。 进一步分析识别的短语,以消除基于词性分析的短语和/或单词的无关共存。 使用剩余短语的加权来识别趋势主题短语。 然后可以将与趋势主题短语相关联的多个UGC文档中的内容的链接显示给客户端设备。

    Push pull caching for social network information
    2.
    发明授权
    Push pull caching for social network information 有权
    推拉缓存用于社交网络信息

    公开(公告)号:US08655842B2

    公开(公告)日:2014-02-18

    申请号:US12542144

    申请日:2009-08-17

    Applicant: Zhichen Xu

    Inventor: Zhichen Xu

    Abstract: Embodiments are directed towards modifying a distribution of writers as either a push writer or a pull writer based on a cost model that decides for a given content reader whether it is more effective for the writer to be a pull writer or a push writer. A cache is maintained for each content reader for caching content items pushed by a push writer in the content writer's push list of writers when the content is generated. At query time, content items are pulled by the content reader based on writers a content reader's pull list. One embodiment of the cost model employs data about a previous number of requests for content items for a given writer for a number of previous blended display results of content items. When a writer is determined to be popular, mechanisms are proposed for pushing content items to a plurality of content readers.

    Abstract translation: 实施例旨在基于决定给定内容读取器的成本模型来将作者的分布修改为推送写入器或拉写入器,以使作者是更有效的作为拉写入器还是推动写入器。 为每个内容读取器维护高速缓存,用于在生成内容时缓存内容写入者的推送列表中的推送写入器推送的内容项目。 在查询时,内容读取器根据写入者提取内容读取器的列表。 成本模型的一个实施例使用关于给定写入器的先前数量的内容项的请求的数据用于多个内容项的先前混合显示结果。 当作者被确定为流行时,提出将内容项推送到多个内容阅读器的机制。

    Distributing content indices
    3.
    发明授权
    Distributing content indices 失效
    分发内容索引

    公开(公告)号:US08117215B2

    公开(公告)日:2012-02-14

    申请号:US12889740

    申请日:2010-09-24

    CPC classification number: G06F17/30864

    Abstract: A query-centric system and process for distributing reverse indices for a distributed content system. Relevance ranking techniques in organizing distributed system indices. Query-centric configuration subprocesses (1) analyze query data, partitioning terms for reverse index server(s) (RIS), (2) distribute each partitioned data set by generally localizing search terms for the RIS that have some query-centric correlation, and (3) generate and maintain a map for the partitioned reverse index system terms by mapping the terms for the reverse index to a plurality of different index server nodes. Indexing subprocess element builds distributed reverse indices from content host indices. Routines of the query execution use the map derived in the configuration to more efficiently return more relevant search results to the searcher.

    Abstract translation: 以分布式内容系统分发反向索引的以查询为中心的系统和流程。 组织分布式系统指标的相关性排名技术。 以查询为中心的配置子过程(1)分析查询数据,反向索引服务器(RIS)的划分术语,(2)通过对具有一些以查询为中心的相关性的RIS的搜索项进行一般定位,分配每个分区数据集; (3)通过将反向索引的术语映射到多个不同的索引服务器节点来生成并维护分区反向索引系统术语的映射。 索引子进程元素从内容主机索引构建分布式反向索引。 查询执行的例程使用在配置中导出的映射更有效地将更相关的搜索结果返回给搜索者。

    System and method for automatically organizing bookmarks through the use of tag data
    4.
    发明授权
    System and method for automatically organizing bookmarks through the use of tag data 有权
    通过使用标签数据自动组织书签的系统和方法

    公开(公告)号:US08010532B2

    公开(公告)日:2011-08-30

    申请号:US11624072

    申请日:2007-01-17

    CPC classification number: G06F17/30884

    Abstract: The present invention is directed towards systems and method for organization of bookmarks. The method according to one embodiment comprises retrieving one or more bookmarks associated with one or more content items, a given bookmark generated by a user of a client device and identifying one or more tags associated with one or uniform resource locators corresponding to the or more bookmarks. A bookmark folder hierarchy is created through use of a clustering algorithm on the basis of the one or more tags associated with the one or more uniform resource locators corresponding to the one or more bookmarks.

    Abstract translation: 本发明涉及用于组织书签的系统和方法。 根据一个实施例的方法包括检索与一个或多个内容项目相关联的一个或多个书签,由客户端设备的用户生成的给定书签,以及识别与一个或多个与该书签相对应的一个或多个统一资源定位符相关联的一个或多个标签 。 基于与一个或多个书签相对应的一个或多个统一资源定位符相关联的一个或多个标签,通过使用聚类算法来创建书签文件夹层次结构。

    Increasing peer privacy
    5.
    发明授权
    Increasing peer privacy 有权
    增加对等隐私

    公开(公告)号:US07865715B2

    公开(公告)日:2011-01-04

    申请号:US10084499

    申请日:2002-02-28

    Applicant: Zhichen Xu Li Xiao

    Inventor: Zhichen Xu Li Xiao

    CPC classification number: H04L63/0407 H04L63/0442

    Abstract: In a method for increasing peer privacy, a path for information is formed from a provider to a requestor through a plurality of peers in response to a received request for the information. Each peer of the plurality of peers receives a respective set-up message comprising of a predetermined label and an identity of a next peer for the information. The information is transferred over the path in a message, where the message comprises a message label configured to determine a next peer according to the path in response to the message label matching the previously received predetermined label.

    Abstract translation: 在增加对等体隐私的方法中,响应于接收到的信息请求,通过多个对等体从提供者到请求者形成信息路径。 多个对等体的每个对等体接收相应的建立消息,其包括预定标签和信息的下一个对等体的标识。 信息通过消息中的路径传送,其中消息包括被配置为响应于与先前接收到的预定标签匹配的消息标签的路径来确定下一对等体的消息标签。

    AUTOMATED SCREEN SCRAPING VIA GRAMMAR INDUCTION
    6.
    发明申请
    AUTOMATED SCREEN SCRAPING VIA GRAMMAR INDUCTION 有权
    自动筛选通过GRAMMAR感应

    公开(公告)号:US20100256974A1

    公开(公告)日:2010-10-07

    申请号:US12417773

    申请日:2009-04-03

    CPC classification number: G06F17/248 G06F17/2715

    Abstract: A method and a computer-readable medium are provided which perform screen scraping via grammar induction. The computer-readable medium stores instructions of the method, the instructions directing a computer processor to intercept display information transmitted to a computer-implemented display device representing information stored in a data source; induce a grammar via statistical analysis of the intercepted display information; provide the grammar to a parser-generator to generate a parser corresponding to the induced grammar; and perform screen scraping using the generated parser.

    Abstract translation: 提供了一种通过语法感应来执行屏幕刮擦的方法和计算机可读介质。 所述计算机可读介质存储所述方法的指令,所述指令指示计算机处理器拦截发送到计算机实现的显示设备的显示信息,所述显示信息表示存储在数据源中的信息; 通过对截取的显示信息的统计分析来引发语法; 为解析器生成器提供语法以产生对应于引导语法的解析器; 并使用生成的解析器执行屏幕抓取。

    Apparatus and method for controlling content access based on shared annotations for annotated users in a folksonomy scheme
    7.
    发明授权
    Apparatus and method for controlling content access based on shared annotations for annotated users in a folksonomy scheme 有权
    用于在民间学习方案中基于注释用户的共享注释来控制内容访问的装置和方法

    公开(公告)号:US07761436B2

    公开(公告)日:2010-07-20

    申请号:US11325254

    申请日:2006-01-03

    CPC classification number: G06F17/30867 G06F17/30997

    Abstract: A method for sharing content with a user includes receiving from a user a first set of keywords for annotating an annotated user; receiving from the user a second set of keywords that designate whether annotated content annotated by at least one keyword included in the second set of keywords may be shared with the annotated user; storing in a data store a first association of the first set of keywords with the annotated user, and a second association of the second set of keywords with the annotated user; receiving a keyword selection for a select keyword and an identifier for the annotated user; and displaying on the client system content annotated by the select keyword if the annotated user is annotated by at least one keyword in the first set of keywords, and if the select keyword is included in the second set of keywords.

    Abstract translation: 用于与用户共享内容的方法包括从用户接收用于注释注释用户的第一组关键字; 从所述用户接收第二组关键字,所述第二组关键字指定由所述第二组关键字中包括的至少一个关键字注释的注释内容是否可以与所注注的用户共享; 在数据存储器中存储第一关键字集合与注释用户的第一关联,以及第二关键字集合与注释用户的第二关联; 接收关键字选择用于选择关键字和标注用户的标识符; 以及如果所述注释用户在所述第一组关键字中由至少一个关键字注释,并且所述选择关键字是否包括在所述第二组关键字中,则在所述客户端系统上显示由所述选择关键字注释的内容。

    Communicating between wireless communities
    8.
    发明授权
    Communicating between wireless communities 有权
    在无线社区之间进行沟通

    公开(公告)号:US07643458B1

    公开(公告)日:2010-01-05

    申请号:US11137838

    申请日:2005-05-25

    CPC classification number: H04W40/20 H04L45/64

    Abstract: A message is received from a first wireless node in a first wireless community. The message is for a second wireless node in a second wireless community. Location information for the second wireless node is determined using a distributed hash table (DHT) overlay network. The message is routed to a second wireless community using the location information.

    Abstract translation: 从第一无线社区中的第一无线节点接收消息。 该消息是针对第二无线社区中的第二无线节点。 使用分布式哈希表(DHT)覆盖网络来确定第二无线节点的位置信息。 使用位置信息将消息路由到第二个无线社区。

    Semantic file system
    9.
    发明授权
    Semantic file system 有权
    语义文件系统

    公开(公告)号:US07617250B2

    公开(公告)日:2009-11-10

    申请号:US10666577

    申请日:2003-09-22

    CPC classification number: G06F17/30067

    Abstract: A data model represents semantic information associated with objects stored in a file system. The data model includes a first object identifier, a second object identifier and a relation identifier. The first object identifier identifies a first object stored in the file system. The second object identifier identifies a second object stored in the file system, wherein the second object is related to the first object. The relation identifier identifies a relationship between the first object and the second object.

    Abstract translation: 数据模型表示与存储在文件系统中的对象相关联的语义信息。 数据模型包括第一对象标识符,第二对象标识符和关系标识符。 第一个对象标识符标识存储在文件系统中的第一个对象。 第二对象标识符识别存储在文件系统中的第二对象,其中第二对象与第一对象相关。 关系标识符识别第一对象和第二对象之间的关系。

    Creating expressway for overlay routing
    10.
    发明授权
    Creating expressway for overlay routing 有权
    创建高速公路进行覆盖路由

    公开(公告)号:US07554988B2

    公开(公告)日:2009-06-30

    申请号:US10237618

    申请日:2002-09-10

    Abstract: In a method for creating expressway for overlay routing, an existing peer-to-peer network is organized into a plurality of zones. The plurality of zones is organized into a plurality of levels. Neighboring zones are identified for each zone of the plurality of zones. One or more representatives are identified for each neighboring zone. A routing table is created based the plurality of zones, the neighboring zones, the one or more representatives, and the plurality of levels.

    Abstract translation: 在用于创建用于覆盖路由的高速公路的方法中,现有的对等网络被组织成多个区域。 多个区域被组织成多个级别。 为多个区域的每个区域识别相邻区域。 为每个相邻区域标识一个或多个代表。 基于多个区域,相邻区域,一个或多个代表和多个级别来创建路由表。

Patent Agency Ranking