-
公开(公告)号:US09984130B2
公开(公告)日:2018-05-29
申请号:US14521206
申请日:2014-10-22
Applicant: GOOGLE INC.
Inventor: Hui Xu , Rupesh Kapoor , Erik Arjan Hendriks , Hao Fang , Cristian Tapus
CPC classification number: G06F17/3053 , G06F17/211 , G06F17/2229 , G06F17/2247 , G06F17/248 , G06F17/30887 , G06F17/30899
Abstract: Implementations include a batch-optimized render and fetch architecture. An example method performed by the architecture includes receiving a request from a batch process to render a web page and initializing a virtual clock and a task list for rendering the web page. The virtual clock stands still when a request for an embedded item is outstanding and when a task is ready to run. The method may also include generating a rendering result for the web page when the virtual clock matches a run time for a stop task in the task list, and providing the rendering result to the batch process. Another example method includes receiving a request from a batch process to render a web page, identifying an embedded item in the web page, and determining, based on a rewrite rule, that the embedded item has content that is duplicative of content for a previously fetched embedded item.
-
公开(公告)号:US20150287047A1
公开(公告)日:2015-10-08
申请号:US13921440
申请日:2013-06-19
Applicant: Google Inc.
Inventor: Jifeng Situ , Yihua Wu , Kun Fang , Hui Xu , Erik Arjan Hendriks , Changxun Wu , Neha Sugandh , Jianning Ding
IPC: G06Q30/02
CPC classification number: G06Q30/0201
Abstract: Provided is a process of extracting structured chain-store data from chain-store websites, the process including: identifying, via a processor, a store-locator webpage from a store website; querying the store-locator webpage for store locations in a geographic area; detecting a repeating pattern in a document object model (DOM) of a responsive webpage returned by the store website, the repeating pattern containing location information for stores in the geographic area; extracting, from the repeating pattern, location information for the stores in the geographic area; and storing the location information in a business listing repository.
Abstract translation: 提供从连锁店网站提取结构化连锁店数据的过程,其过程包括:通过处理器从商店网站识别商店定位器网页; 在地理区域中查询商店定位器网页上的商店位置; 检测由所述商店网站返回的响应网页的文档对象模型(DOM)中的重复模式,所述重复模式包含用于在所述地理区域中存储的位置信息; 从所述重复模式中提取所述地理区域中的商店的位置信息; 并将所述位置信息存储在商业信息库中。
-
公开(公告)号:US20150379014A1
公开(公告)日:2015-12-31
申请号:US14521206
申请日:2014-10-22
Applicant: GOOGLE INC.
Inventor: Hui Xu , Rupesh Kapoor , Erik Arjan Hendriks , Hao Fang , Cristian Tapus
CPC classification number: G06F17/3053 , G06F17/211 , G06F17/2229 , G06F17/2247 , G06F17/248 , G06F17/30887 , G06F17/30899
Abstract: Implementations include a batch-optimized render and fetch architecture. An example method performed by the architecture includes receiving a request from a batch process to render a web page and initializing a virtual clock and a task list for rendering the web page. The virtual clock stands still when a request for an embedded item is outstanding and when a task is ready to run. The method may also include generating a rendering result for the web page when the virtual clock matches a run time for a stop task in the task list, and providing the rendering result to the batch process. Another example method includes receiving a request from a batch process to render a web page, identifying an embedded item in the web page, and determining, based on a rewrite rule, that the embedded item has content that is duplicative of content for a previously fetched embedded item.
Abstract translation: 实现包括批量优化的渲染和提取架构。 由该架构执行的示例性方法包括从批处理接收请求以呈现网页并初始化虚拟时钟以及用于呈现网页的任务列表。 当嵌入式项目的请求未完成,任务准备运行时,虚拟时钟仍然停留。 该方法还可以包括当虚拟时钟与任务列表中的停止任务的运行时间匹配时,为网页生成呈现结果,以及将渲染结果提供给批处理。 另一示例性方法包括从批处理接收呈现网页的请求,识别网页中的嵌入项目,以及基于重写规则确定嵌入项目具有与之前提取的内容重复的内容 嵌入项目。
-
-