发明授权
US08965893B2 System and method for grouping multiple streams of data 有权
用于对多个数据流进行分组的系统和方法

System and method for grouping multiple streams of data
摘要:
A document clustering system and method of assigning a document to a cluster of documents containing related content are provided. Each cluster is associated with a cluster summary describing the content of the documents in the cluster. The method comprises: determining, at a document clustering system, whether the document should be grouped with one or more previously created cluster summaries, the previously created cluster summaries being stored in a memory in a B-tree data structure; and if it is determined that the document should not be grouped with the one or more previously created cluster summaries, then creating, at a document clustering system, a cluster summary based on the content of the document and storing the created cluster summary in the B-tree data structure.
公开/授权文献
信息查询
0/0