发明授权
- 专利标题: Identifying terms
- 专利标题(中): 识别术语
-
申请号: US13459018申请日: 2012-04-27
-
公开(公告)号: US09123046B1公开(公告)日: 2015-09-01
- 发明人: Baris Yuksel , Ana Krulec
- 申请人: Baris Yuksel , Ana Krulec
- 申请人地址: US CA Mountain View
- 专利权人: Google Inc.
- 当前专利权人: Google Inc.
- 当前专利权人地址: US CA Mountain View
- 代理机构: Fish & Richardson P.C.
- 主分类号: G06Q10/10
- IPC分类号: G06Q10/10 ; G06Q30/02 ; G06Q10/06 ; G06Q30/06 ; G06Q10/08 ; G06Q30/00 ; G06N5/00
摘要:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, are described for identifying target terms, e.g., spam terms within a collection of documents. In one aspect, methods can include identifying spam terms by calculating a blacklist term frequency-inverse document frequency (BTF-IDF) score for multiple terms, and by selecting, as the spam terms, the terms that have scores above or below a threshold score. The multiple terms may be derived from documents that are associated with accounts that have been designated as spam accounts.
信息查询