DYNAMIC CLUSTERING OF SPARSE DATA UTILIZING HASH PARTITIONS

Invention Application

US20210326361A1 DYNAMIC CLUSTERING OF SPARSE DATA UTILIZING HASH PARTITIONS 有权

Please log in to see more content

Patent Title: DYNAMIC CLUSTERING OF SPARSE DATA UTILIZING HASH PARTITIONS
Application No.: US16852110

Application Date: 2020-04-17
Publication No.: US20210326361A1

Publication Date: 2021-10-21
Inventor: Fan Du , Yeuk-Yin Chan , Eunyee Koh , Ryan Rossi , Margarita Savova , Charles Menguy , Anup Rao
Applicant: Adobe Inc.
Applicant Address: US CA San Jose
Assignee: Adobe Inc.
Current Assignee: Adobe Inc.
Current Assignee Address: US CA San Jose
Main IPC: G06F16/28
IPC: G06F16/28 ; G06F16/22

DYNAMIC CLUSTERING OF SPARSE DATA UTILIZING HASH PARTITIONS

Abstract:

The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.

Public/Granted literature

US11328002B2 Dynamic clustering of sparse data utilizing hash partitions Public/Granted day:2022-05-10

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F16/00	信息检索；数据库结构；文件系统结构
G06F16/20	.•结构化数据，例如关系型数据
G06F16/28	..••以数据库模型为特征的数据库，例如，关系或对象模型