发明授权
- 专利标题: Web forum crawling using skeletal links
- 专利标题(中): 使用骨架链接的网页论坛抓取
-
申请号: US13351952申请日: 2012-01-17
-
公开(公告)号: US08700600B2公开(公告)日: 2014-04-15
- 发明人: Lei Zhang , Wei-Ying Ma , Wei Lai , Jiangming Yang , Rui Cai
- 申请人: Lei Zhang , Wei-Ying Ma , Wei Lai , Jiangming Yang , Rui Cai
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理商 Carole Boelitz; Micky Minhas
- 主分类号: G06F7/20
- IPC分类号: G06F7/20 ; G06F17/30
摘要:
A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.
公开/授权文献
- US20120117052A1 WEB FORUM CRAWLING USING SKELETAL LINKS 公开/授权日:2012-05-10
信息查询