-
公开(公告)号:US20180159808A1
公开(公告)日:2018-06-07
申请号:US15366443
申请日:2016-12-01
Applicant: Yahoo!, Inc.
Inventor: David Pal , Satyen Chandrakant Kale , Yongxin Xi , Ilambharathi Kanniah , Yuval Peduel , Zohar Shay Karnin , Jyh-Shin Shue
IPC: H04L12/58
Abstract: One or more computing devices, systems, and/or methods for message classification are provided. For example, a set of messages is clustered into a set of clusters. A cluster comprises messages with similar features (e.g., similar subject lines, message body content, sender information, recipient information, structure, user action such as reading or deleting, spam vote information, etc.). Cluster features are computed for the clusters based upon features of messages within such clusters. A first table, comprising cluster entries corresponding cluster features of clusters, and a second table, comprising message entries corresponding to clusters to which messages are assigned, are created. Message features of a message are created, using the first table and second table, based upon features of the message and cluster features of clusters to which the message is assigned. A message classifier is used to classify the message (e.g., spam, safe, a threat, etc.) based upon the message features.
-
公开(公告)号:US10594640B2
公开(公告)日:2020-03-17
申请号:US15366443
申请日:2016-12-01
Applicant: Yahoo!, Inc.
Inventor: David Pal , Satyen Chandrakant Kale , Yongxin Xi , Ilambharathi Kanniah , Yuval Peduel , Zohar Shay Karnin , Jyh-Shin Shue
IPC: H04L12/58
Abstract: One or more computing devices, systems, and/or methods for message classification are provided. For example, a set of messages is clustered into a set of clusters. A cluster comprises messages with similar features (e.g., similar subject lines, message body content, sender information, recipient information, structure, user action such as reading or deleting, spam vote information, etc.). Cluster features are computed for the clusters based upon features of messages within such clusters. A first table, comprising cluster entries corresponding cluster features of clusters, and a second table, comprising message entries corresponding to clusters to which messages are assigned, are created. Message features of a message are created, using the first table and second table, based upon features of the message and cluster features of clusters to which the message is assigned. A message classifier is used to classify the message (e.g., spam, safe, a threat, etc.) based upon the message features.
-