- 专利标题: Distributed graph processing system that adopts a faster data loading technique that requires low degree of communication
-
申请号: US15607985申请日: 2017-05-30
-
公开(公告)号: US10534657B2公开(公告)日: 2020-01-14
- 发明人: Siegfried Depner , Sungpack Hong , Thomas Manhardt , Jinsu Lee , Nicholas Roth , Hassan Chafi
- 申请人: Oracle International Corporation
- 申请人地址: US CA Redwood Shores
- 专利权人: Oracle International Corporation
- 当前专利权人: Oracle International Corporation
- 当前专利权人地址: US CA Redwood Shores
- 代理机构: Hickman Palermo Becker Bingham LLP
- 代理商 Brian N. Miller
- 主分类号: G06F15/16
- IPC分类号: G06F15/16 ; G06F11/00
摘要:
Techniques minimize communication while loading a graph. In a distributed embodiment, each computer loads some edges of the graph. Each edge connects a source vertex (SV) to a destination vertex. For each SV of the edges, the computer hashes the SV to detect a tracking computer (TrC) that tracks on which computer does the SV reside. Each computer informs the TrC that the SV originates an edge that resides on that computer. For each SV, the TrC detects that the SV originates edges that reside on multiple providing computers (PCs). The TrC selects a target computer (TaC) from the multiple PCs to host the SV. The TrC instructs each PC, excluding the TaC, to transfer the SV and related edges that are connected to the SV to the TaC. A vertex's internal identifier indicates which computer hosts the vertex. The TrC maintains a mapping between external and internal identifiers.
公开/授权文献
信息查询