-
公开(公告)号:US20170012912A1
公开(公告)日:2017-01-12
申请号:US15263238
申请日:2016-09-12
Applicant: Yahoo! Inc.
Inventor: Sharat Narayan , Vishwanath Tumkur Ramarao , Belle Tseng , Markus Weimer , Young Maeng , Jyh-Shin Shue
CPC classification number: H04L51/12 , G06F15/16 , G06Q10/107 , H04L51/046 , H04L61/2007 , H04L67/2866
Abstract: Embodiments are directed towards multi-level entity classification. An object associated with an entity is received. In one embodiment the object comprises and email and the entity comprises the IP address of a sending email server. If the entity has already been classified, as indicated by an entity classification cache, then a corresponding action is taken on the object. However, if the entity has not been classified, the entity is submitted to a fast classifier for classification. A feature collector concurrently fetches available features, including fast features and full features. The fast classifier classifies the entity based on the fast features, storing the result in the entity classification cache. Subsequent objects associated with the entity are processed based on the cached result of the fast classifier. Then, a full classifier classifies the entity based on at least the full features, storing the result in the entity classification cache.
Abstract translation: 实施例针对多级实体分类。 接收与实体相关联的对象。 在一个实施例中,对象包括和电子邮件,并且实体包括发送电子邮件服务器的IP地址。 如果实体已经被分类,如实体分类缓存所示,则对对象采取相应的动作。 但是,如果实体尚未分类,则将实体提交给快速分类器进行分类。 功能收集器同时提取可用功能,包括快速功能和完整功能。 快速分类器基于快速特征对实体进行分类,将结果存储在实体分类缓存中。 基于快速分类器的缓存结果来处理与实体相关联的后续对象。 然后,完整分类器至少基于全部特征对实体进行分类,将结果存储在实体分类缓存中。
-
公开(公告)号:US20180159808A1
公开(公告)日:2018-06-07
申请号:US15366443
申请日:2016-12-01
Applicant: Yahoo!, Inc.
Inventor: David Pal , Satyen Chandrakant Kale , Yongxin Xi , Ilambharathi Kanniah , Yuval Peduel , Zohar Shay Karnin , Jyh-Shin Shue
IPC: H04L12/58
Abstract: One or more computing devices, systems, and/or methods for message classification are provided. For example, a set of messages is clustered into a set of clusters. A cluster comprises messages with similar features (e.g., similar subject lines, message body content, sender information, recipient information, structure, user action such as reading or deleting, spam vote information, etc.). Cluster features are computed for the clusters based upon features of messages within such clusters. A first table, comprising cluster entries corresponding cluster features of clusters, and a second table, comprising message entries corresponding to clusters to which messages are assigned, are created. Message features of a message are created, using the first table and second table, based upon features of the message and cluster features of clusters to which the message is assigned. A message classifier is used to classify the message (e.g., spam, safe, a threat, etc.) based upon the message features.
-
公开(公告)号:US10594640B2
公开(公告)日:2020-03-17
申请号:US15366443
申请日:2016-12-01
Applicant: Yahoo!, Inc.
Inventor: David Pal , Satyen Chandrakant Kale , Yongxin Xi , Ilambharathi Kanniah , Yuval Peduel , Zohar Shay Karnin , Jyh-Shin Shue
IPC: H04L12/58
Abstract: One or more computing devices, systems, and/or methods for message classification are provided. For example, a set of messages is clustered into a set of clusters. A cluster comprises messages with similar features (e.g., similar subject lines, message body content, sender information, recipient information, structure, user action such as reading or deleting, spam vote information, etc.). Cluster features are computed for the clusters based upon features of messages within such clusters. A first table, comprising cluster entries corresponding cluster features of clusters, and a second table, comprising message entries corresponding to clusters to which messages are assigned, are created. Message features of a message are created, using the first table and second table, based upon features of the message and cluster features of clusters to which the message is assigned. A message classifier is used to classify the message (e.g., spam, safe, a threat, etc.) based upon the message features.
-
-