Web forum crawling using skeletal links
    1.
    发明授权
    Web forum crawling using skeletal links 有权
    使用骨架链接的网页论坛抓取

    公开(公告)号:US08099408B2

    公开(公告)日:2012-01-17

    申请号:US12163895

    申请日:2008-06-27

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30864

    摘要: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.

    摘要翻译: 提供了一种用于识别用于爬行网站的网站的信息链接的方法和系统。 论坛搜寻器分析网页论坛的示例网页,以识别信息链接,然后通过确定为信息而不是遵循其他链接的链接抓取网页论坛。 论坛搜寻器系统基于它们是网站的整体结构的一部分还是用于选择分割到多个网页上的顺序信息来确定链接是否具有信息性。

    Forum web page clustering based on repetitive regions
    2.
    发明授权
    Forum web page clustering based on repetitive regions 有权
    基于重复区域的论坛网页聚类

    公开(公告)号:US08051083B2

    公开(公告)日:2011-11-01

    申请号:US12103712

    申请日:2008-04-16

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06Q10/10

    摘要: Described is a technology by which forum web pages are processed into clusters for classification purposes, including by determining repetitive regions between pages and associating pages that have similar repetitive regions into a common cluster. Patterns corresponding to the regions are determined, and a feature set based at least in part on those patterns (e.g., pattern frequency) is extracted from the page. The feature set of a page is compared against the feature set of another page to determine similarity therewith, e.g., via a feature space distance computation that is evaluated against a threshold distance.

    摘要翻译: 描述了一种技术,通过该技术将论坛网页处理成用于分类目的的群集,包括通过确定页面之间的重复区域并将具有相似重复区域的页面关联到公共群集中。 确定与区域对应的模式,并且至少部分地基于那些模式(例如,模式频率)从页面提取特征集。 将页面的特征集合与另一页面的特征集进行比较以确定其相似性,例如通过针对阈值距离评估的特征空间距离计算。

    Web forum crawling using skeletal links
    3.
    发明授权
    Web forum crawling using skeletal links 有权
    使用骨架链接的网页论坛抓取

    公开(公告)号:US08700600B2

    公开(公告)日:2014-04-15

    申请号:US13351952

    申请日:2012-01-17

    IPC分类号: G06F7/20 G06F17/30

    CPC分类号: G06F17/30864

    摘要: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.

    摘要翻译: 提供了一种用于识别用于爬行网站的网站的信息链接的方法和系统。 论坛搜寻器分析网页论坛的示例网页,以识别信息链接,然后通过确定为信息而不是遵循其他链接的链接抓取网页论坛。 论坛搜寻器系统基于它们是网站的整体结构的一部分还是用于选择分割到多个网页上的顺序信息来确定链接是否具有信息性。

    WEB FORUM CRAWLING USING SKELETAL LINKS
    4.
    发明申请
    WEB FORUM CRAWLING USING SKELETAL LINKS 有权
    使用SKELETAL链接的WEB FORUM CRAWLING

    公开(公告)号:US20090327237A1

    公开(公告)日:2009-12-31

    申请号:US12163895

    申请日:2008-06-27

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30864

    摘要: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.

    摘要翻译: 提供了一种用于识别用于爬行网站的网站的信息链接的方法和系统。 论坛搜寻器分析网页论坛的示例网页,以识别信息链接,然后通过确定为信息而不是遵循其他链接的链接抓取网页论坛。 论坛搜寻器系统基于它们是网站的整体结构的一部分还是用于选择分割到多个网页上的顺序信息来确定链接是否具有信息性。

    WEB FORUM CRAWLING USING SKELETAL LINKS
    5.
    发明申请
    WEB FORUM CRAWLING USING SKELETAL LINKS 有权
    使用SKELETAL链接的WEB FORUM CRAWLING

    公开(公告)号:US20120117052A1

    公开(公告)日:2012-05-10

    申请号:US13351952

    申请日:2012-01-17

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.

    摘要翻译: 提供了一种用于识别用于爬行网站的网站的信息链接的方法和系统。 论坛搜寻器分析网页论坛的示例网页,以识别信息链接,然后通过确定为信息而不是遵循其他链接的链接抓取网页论坛。 论坛搜寻器系统基于它们是网站的整体结构的一部分还是用于选择分割到多个网页上的顺序信息来确定链接是否具有信息性。

    FORUM WEB PAGE CLUSTERING BASED ON REPETITIVE REGIONS
    6.
    发明申请
    FORUM WEB PAGE CLUSTERING BASED ON REPETITIVE REGIONS 有权
    基于重复区域的论坛网页聚类

    公开(公告)号:US20090265363A1

    公开(公告)日:2009-10-22

    申请号:US12103712

    申请日:2008-04-16

    IPC分类号: G06F17/30

    CPC分类号: G06Q10/10

    摘要: Described is a technology by which forum web pages are processed into clusters for classification purposes, including by determining repetitive regions between pages and associating pages that have similar repetitive regions into a common cluster. Patterns corresponding to the regions are determined, and a feature set based at least in part on those patterns (e.g., pattern frequency) is extracted from the page. The feature set of a page is compared against the feature set of another page to determine similarity therewith, e.g., via a feature space distance computation that is evaluated against a threshold distance.

    摘要翻译: 描述了一种技术,通过该技术将论坛网页处理成用于分类目的的群集,包括通过确定页面之间的重复区域并将具有相似重复区域的页面关联到公共群集中。 确定与区域对应的模式,并且至少部分地基于那些模式(例如,模式频率)从页面提取特征集。 将页面的特征集合与另一页面的特征集进行比较以确定其相似性,例如通过针对阈值距离评估的特征空间距离计算。

    Automatically inserting advertisements into source video content playback streams
    7.
    发明授权
    Automatically inserting advertisements into source video content playback streams 有权
    自动将广告插入源视频内容回放流

    公开(公告)号:US09554093B2

    公开(公告)日:2017-01-24

    申请号:US11626251

    申请日:2007-01-23

    摘要: Systems and methods for automatically inserting advertisements into source video content playback streams are described. In one aspect, the systems and methods communicate a source video content playback stream to a video player to present source video to a user. During playback of the source video, and in response to receipt of a request from the user to navigate portions of the source video (e.g., a user command to fast forward the source video, rewind the source video, or other action), the systems and methods dynamically define a video advertisement clip insertion point (e.g., and insertion point based on a current playback position). The systems and methods then insert a contextually relevant and/or targeted video advertisement clip into the playback stream for presentation to the user.

    摘要翻译: 描述了将广告自动插入到源视频内容回放流中的系统和方法。 在一个方面,系统和方法将源视频内容播放流传送给视频播放器,以向用户呈现源视频。 在播放源视频期间,并且响应于接收到来自用户的请求以导航源视频的部分(例如,用户命令来快速转发源视频,倒带源视频或其他动作),系统 并且方法动态地定义视频广告剪辑插入点(例如,基于当前播放位置的插入点)。 然后,系统和方法将上下文相关和/或目标视频广告剪辑插入到播放流中以呈现给用户。

    Automatically Inserting Advertisements into Source Video Content Playback Streams
    8.
    发明申请
    Automatically Inserting Advertisements into Source Video Content Playback Streams 有权
    自动将广告插入源视频内容播放流

    公开(公告)号:US20070204310A1

    公开(公告)日:2007-08-30

    申请号:US11626251

    申请日:2007-01-23

    IPC分类号: H04N7/173

    摘要: Systems and methods for automatically inserting advertisements into source video content playback streams are described. In one aspect, the systems and methods communicate a source video content playback stream to a video player to present source video to a user. During playback of the source video, and in response to receipt of a request from the user to navigate portions of the source video (e.g., a user command to fast forward the source video, rewind the source video, or other action), the systems and methods dynamically define a video advertisement clip insertion point (e.g., and insertion point based on a current playback position). The systems and methods then insert a contextually relevant and/or targeted video advertisement clip into the playback stream for presentation to the user.

    摘要翻译: 描述了将广告自动插入到源视频内容回放流中的系统和方法。 在一个方面,系统和方法将源视频内容播放流传送给视频播放器,以向用户呈现源视频。 在播放源视频期间,并且响应于接收到来自用户的请求以导航源视频的部分(例如,用户命令来快速转发源视频,倒带源视频或其他动作),系统 并且方法动态地定义视频广告剪辑插入点(例如,基于当前播放位置的插入点)。 然后,系统和方法将上下文相关和/或目标视频广告剪辑插入到播放流中以呈现给用户。