发明授权
- 专利标题: Configuring web crawler to extract web page information
- 专利标题(中): 配置网页抓取工具来提取网页信息
-
申请号: US14081105申请日: 2013-11-15
-
公开(公告)号: US09015144B2公开(公告)日: 2015-04-21
- 发明人: Yiming Sun , Qi Qiang , Boyang Cai , Xiaojun Jin , Zongyuan Wu
- 申请人: Alibaba Group Holding Limited
- 申请人地址: KY George Town, Grand Cayman
- 专利权人: Alibaba Group Holding Limited
- 当前专利权人: Alibaba Group Holding Limited
- 当前专利权人地址: KY George Town, Grand Cayman
- 代理机构: Van Pelt, Yi & James LLP
- 优先权: CN201110207897 20110722
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F7/00 ; G06Q30/02 ; G06Q10/10

摘要:
Web crawling configuration includes: obtaining a webpage comprising a plurality of receiving a user selection of a node in the webpage; presenting a set of web crawling configuration options pertaining to a web crawling action to be performed with respect to the node, the set of web crawling configuration options depending at least in part on a type of an element included in the node and comprising: a first option to perform a first web crawling action in the event that the node include a first type of the element; and a second option to perform a second web crawling action in the event that the node includes a second type of the element; receiving a user input specifying the web crawling configuration option; and storing user specified web crawling configuration option, performing the web crawling action on the node according to the user input, or both.
公开/授权文献
- US20140129541A1 CONFIGURING WEB CRAWLER TO EXTRACT WEB PAGE INFORMATION 公开/授权日:2014-05-08
信息查询