- 专利标题: Identifying spam using near-duplicate detection for text and images
-
申请号: US16510578申请日: 2019-07-12
-
公开(公告)号: US11363064B2公开(公告)日: 2022-06-14
- 发明人: Spandan Thakur
- 申请人: ADOBE INC.
- 申请人地址: US CA San Jose
- 专利权人: ADOBE INC.
- 当前专利权人: ADOBE INC.
- 当前专利权人地址: US CA San Jose
- 代理机构: Shook, Hardy & Bacon L.L.P.
- 主分类号: H04L9/30
- IPC分类号: H04L9/30 ; H04L29/06 ; H04L9/40 ; G06F16/14
摘要:
Embodiments described herein provide systems, methods, and computer storage media for detecting spam using by comparing hash values of content. In embodiments, hash values are generated based on the type of content and compared to other hash values in storage buckets. The similarity of content is determined by calculating the distance between two hash values and determining whether the distance exceeds a distance index. Counter values associated with hash values in storage are incremented when the distances between hash values exceed the distance index. Spam indications are communicated when the counter values for associated with hash values exceed a count threshold.
公开/授权文献
信息查询