Message recommendation using word isolation and clustering
    1.
    发明授权
    Message recommendation using word isolation and clustering 有权
    消息推荐使用单词隔离和聚类

    公开(公告)号:US09245013B2

    公开(公告)日:2016-01-26

    申请号:US11927450

    申请日:2007-10-29

    IPC分类号: G06F17/30

    摘要: Network system provides a real-time adaptive recommendation set of documents with a high statistical measure of relevancy to the requestor device. The recommendation set is optimized based on analyzing text of documents of the interest set, categorizing these documents into clusters, extracting keywords representing the themes or concepts of documents in the clusters, and filtering a population of eligible documents accessible to the system utilizing site and or Internet-wide search engines. The system is either automatically or manually invoked and it develops and presents the recommendation set in real-time. The recommendation set may be presented as a greeting, notification, alert, HTML fragment, fax, voicemail, or automatic classification or routing of customer e-mail, personal e-mail, job postings, and offers for sale or exchange.

    摘要翻译: 网络系统提供具有与请求者设备的相关性的高统计度量的文档的实时自适应推荐集合。 该建议集是基于分析兴趣集文档的文本进行优化的,将这些文档分类成群集,提取代表群集中文档主题或概念的关键字,以及过滤可利用站点的系统可访问的群体,或 互联网搜索引擎。 系统自动或手动调用,并开发并实时提出建议。 推荐集可以呈现为问候,通知,警报,HTML片段,传真,语音邮件,或客户电子邮件,个人电子邮件,职位发布和出售或交换的报价的自动分类或路由。

    Apparatus and methods for in-the-cloud identification of spam and/or malware
    2.
    发明授权
    Apparatus and methods for in-the-cloud identification of spam and/or malware 有权
    用于云端识别垃圾邮件和/或恶意软件的设备和方法

    公开(公告)号:US08925087B1

    公开(公告)日:2014-12-30

    申请号:US12487959

    申请日:2009-06-19

    摘要: One embodiment relates to an apparatus for in-the-cloud identification of spam and/or malware. The apparatus includes computer-readable code configured to be executed by the processor so as to receive queries, the queries including hash values embedded therein. The apparatus further includes computer-readable code configured to be executed by the processor so as to detect a group of hash codes which are similar and to identify the group as corresponding to an undesirable network outbreak. Another embodiment relates to an apparatus for in-the-cloud detection of spam and/or malware. The apparatus includes computer-readable code configured to be executed by the processor so as to receive an electronic message, calculate a locality-sensitive hash based on the message, embed the locality-sensitive hash into a query, and send the query to a central analysis system via a network interface. Other embodiments, aspects and features are also disclosed.

    摘要翻译: 一个实施例涉及用于在云中识别垃圾邮件和/或恶意软件的装置。 该装置包括被配置为由处理器执行以便接收查询的计算机可读代码,该查询包括嵌入其中的哈希值。 该装置还包括被配置为由处理器执行的计算机可读代码,以便检测类似的一组散列码,并将该组识别为对应于不期望的网络爆发。 另一个实施例涉及用于在云中检测垃圾邮件和/或恶意软件的装置。 该装置包括被配置为由处理器执行以接收电子消息的计算机可读代码,基于该消息计算位置敏感散列,将该区域敏感散列嵌入到查询中,并将查询发送到中央 分析系统通过网络接口。 还公开了其它实施例,方面和特征。

    Methods and apparatus for detecting botnet attacks
    3.
    发明授权
    Methods and apparatus for detecting botnet attacks 有权
    用于检测僵尸网络攻击的方法和装置

    公开(公告)号:US08612523B1

    公开(公告)日:2013-12-17

    申请号:US11805464

    申请日:2007-05-22

    IPC分类号: G06F15/16

    CPC分类号: H04L63/1458 H04L63/1408

    摘要: Botnet attacks may be detected by collecting samples of spam messages, forming clusters of related spam messages, and identifying the source or sources of the related spam messages. The related spam messages may be identified as those generated using the same template. For example, spam messages generated using the same image template, text template, or both may be deemed as related. To find related spam messages, images of spam messages may be extracted and compressed using a lossy compression algorithm. The compressed images may then be compared to one another to identify those generated using the same image template. The lossy compression algorithm may involve dividing an image into several blocks and then computing a value for each block for comparison.

    摘要翻译: 可以通过收集垃圾邮件的样本,形成相关垃圾邮件的群集以及识别相关垃圾邮件的来源或来源来检测僵尸网络攻击。 可以将相关的垃圾邮件标识为使用相同模板生成的垃圾邮件。 例如,使用相同图像模板,文本模板或两者生成的垃圾邮件可能被视为相关。 要查找相关垃圾邮件,可以使用有损压缩算法提取和压缩垃圾邮件的图像。 然后可以将压缩图像彼此进行比较,以识别使用相同图像模板生成的图像。 有损压缩算法可以包括将图像划分成几个块,然后计算每个块的值用于比较。

    Adversarial approach for identifying inappropriate text content in images
    4.
    发明授权
    Adversarial approach for identifying inappropriate text content in images 有权
    用于识别图像中不适当文本内容的对抗方法

    公开(公告)号:US08098939B2

    公开(公告)日:2012-01-17

    申请号:US11803963

    申请日:2007-05-16

    IPC分类号: G06K9/72

    摘要: An adversarial approach in detecting inappropriate text content in images. An expression from a listing of expressions may be selected. The listing of expressions may include words, phrases, or other textual content indicative of a particular type of message. Using the selected expression as a reference, the image is searched for a section that could be similar to the selected expression. The similarity between the selected expression and the section of the image may be in terms of shape. The section may be scored against the selected expression to determine how well the selected expression matches the section. The score may be used to determine whether or not the selected expression is present in the image.

    摘要翻译: 检测图像中不适当的文本内容的对抗方法。 可以选择表达式列表中的表达式。 表达式的列表可以包括指示特定类型的消息的单词,短语或其他文本内容。 使用所选表达式作为参考,搜索图像可能类似于所选表达式的部分。 所选择的表达式和图像的部分之间的相似性可以是形状。 可以对所选表达式对该部分进行评分,以确定所选表达式与该部分匹配的程度。 分数可以用于确定所选择的表达是否存在于图像中。

    System and method for adaptive text recommendation
    5.
    发明申请
    System and method for adaptive text recommendation 有权
    自适应文本推荐的系统和方法

    公开(公告)号:US20090089272A1

    公开(公告)日:2009-04-02

    申请号:US11003920

    申请日:2004-12-03

    IPC分类号: G06F7/06 G06F17/30

    摘要: Network system provides a real-time adaptive recommendation set of documents with a high statistical measure of relevancy to the requestor device. The recommendation set is optimized based on analyzing the text of documents of the interest set, categorizing these documents into clusters, extracting keywords representing the themes or concepts of documents in the clusters, and filtering a population of eligible documents accessible to the system utilizing site and or Internet-wide search engines. The system is either automatically or manually invoked and it develops and presents the recommendation set in real-time; for example, upon logging onto a web site or as the client views additional documents or pages of a website. The recommendation set may be presented as a greeting, notification, alert, HTML fragment, fax, voicemail, or automatic classification or routing of customer e-mail, personal e-mail, job postings, and offers for sale or exchange.

    摘要翻译: 网络系统提供具有与请求者设备的相关性的高统计度量的文档的实时自适应推荐集合。 通过分析利益集合文件的文本,将这些文档分类为集群,提取代表群集中文档主题或概念的关键字,并对系统利用站点可访问的合格文档进行过滤,优化了推荐集, 或互联网范围的搜索引擎。 系统自动或手动调用,并实时开发和呈现推荐集; 例如,登录到网站或客户端查看网站的其他文档或页面时。 推荐集可以呈现为问候,通知,警报,HTML片段,传真,语音邮件,或客户电子邮件,个人电子邮件,职位发布以及销售或交换优惠的自动分类或路由。

    Pure adversarial approach for identifying text content in images
    7.
    发明申请
    Pure adversarial approach for identifying text content in images 有权
    用于识别图像中的文本内容的纯对抗方法

    公开(公告)号:US20080131006A1

    公开(公告)日:2008-06-05

    申请号:US11893921

    申请日:2007-08-16

    IPC分类号: G06K9/72

    摘要: A pure adversarial optical character recognition (OCR) approach in identifying text content in images. An image and a search term are input to a pure adversarial OCR module, which searches the image for presence of the search term. The image may be extracted from an email by an email processing engine. The OCR module may split the image into several character-blocks that each has a reasonable probability of containing a character (e.g., an ASCII character). The OCR module may form a sequence of blocks that represent a candidate match to the search term and calculate the similarity of the candidate sequence to the search term. The OCR module may be configured to output whether or not the search term is found in the image and, if applicable, the location of the search term in the image.

    摘要翻译: 纯粹的对抗光学字符识别(OCR)方法来识别图像中的文本内容。 图像和搜索项被​​输入到纯对抗OCR模块,该OCR模块搜索图像中是否存在搜索项。 可以通过电子邮件处理引擎从电子邮件中提取图像。 OCR模块可以将图像分割成几个字符块,每个字符块具有包含字符(例如,ASCII字符)的合理概率。 OCR模块可以形成表示与搜索项的候选匹配的块序列,并计算候选序列与搜索项的相似性。 OCR模块可以被配置为输出在图像中是否找到搜索项,并且如果适用,则输出搜索项在图像中的位置。

    Adversarial approach for identifying inappropriate text content in images
    8.
    发明申请
    Adversarial approach for identifying inappropriate text content in images 有权
    用于识别图像中不适当文本内容的对抗方法

    公开(公告)号:US20080131005A1

    公开(公告)日:2008-06-05

    申请号:US11803963

    申请日:2007-05-16

    IPC分类号: G06K9/72

    摘要: An adversarial approach in detecting inappropriate text content in images. An expression from a listing of expressions may be selected. The listing of expressions may include words, phrases, or other textual content indicative of a particular type of message. Using the selected expression as a reference, the image is searched for a section that could be similar to the selected expression. The similarity between the selected expression and the section of the image may be in terms of shape. The section may be scored against the selected expression to determine how well the selected expression matches the section. The score may be used to determine whether or not the selected expression is present in the image.

    摘要翻译: 检测图像中不适当的文本内容的对抗方法。 可以选择表达式列表中的表达式。 表达式的列表可以包括指示特定类型的消息的单词,短语或其他文本内容。 使用所选表达式作为参考,搜索图像可能类似于所选表达式的部分。 所选择的表达式和图像的部分之间的相似性可以是形状。 可以对所选表达式对该部分进行评分,以确定所选表达式与该部分匹配的程度。 分数可以用于确定所选择的表达是否存在于图像中。

    System and method for adaptive text recommendation
    9.
    发明授权
    System and method for adaptive text recommendation 有权
    自适应文本推荐的系统和方法

    公开(公告)号:US06845374B1

    公开(公告)日:2005-01-18

    申请号:US09723855

    申请日:2000-11-27

    IPC分类号: G06F17/30

    摘要: Network system provides a real-time adaptive recommendation set of documents with a high statistical measure of relevancy to the requestor device. The recommendation set is optimized based on analyzing text of documents of the interest set, categorizing these documents into clusters, extracting keywords representing the themes or concepts of documents in the clusters, and filtering a population of eligible documents accessible to the system utilizing site and or Internet-wide search engines. The system is either automatically or manually invoked and it develops and presents the recommendation set in real-time. The recommendation set may be presented as a greeting, notification, alert, HTML fragment, fax, voicemail, or automatic classification or routing of customer e-mail, personal e-mail, job postings, and offers for sale or exchange.

    摘要翻译: 网络系统提供具有与请求者设备的相关性的高统计度量的文档的实时自适应推荐集合。 该建议集是基于分析兴趣集文档的文本进行优化的,将这些文档分类成群集,提取代表群集中文档主题或概念的关键字,以及过滤可利用站点的系统可访问的群体,或 互联网搜索引擎。 系统自动或手动调用,并开发并实时提出建议。 推荐集可以呈现为问候,通知,警报,HTML片段,传真,语音邮件,或客户电子邮件,个人电子邮件,职位发布以及销售或交换优惠的自动分类或路由。