Patent search ap:("salesforce.com Page Inc.") AND inv:"Elden Gregory Bishop"

11.

发明授权
Recovery strategy for a stream processing system 有权

公开(公告)号：US10606711B2

公开(公告)日：2020-03-31

申请号：US15954014

申请日：2018-04-16

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F11/00 , G06F11/14 , G06F11/20

Abstract: The technology disclosed relates to discovering multiple previously unknown and undetected technical problems in fault tolerance and data recovery mechanisms of modern stream processing systems. In addition, it relates to providing technical solutions to these previously unknown and undetected problems. In particular, the technology disclosed relates to discovering the problem of modification of batch size of a given batch during its replay after a processing failure. This problem results in over-count when the input during replay is not a superset of the input fed at the original play. Further, the technology disclosed discovers the problem of inaccurate counter updates in replay schemes of modern stream processing systems when one or more keys disappear between a batch's first play and its replay. This problem is exacerbated when data in batches is merged or mapped with data from an external data store.

12.

发明申请
Stream Processing Task Deployment Using Precompiled Libraries 审中-公开

公开(公告)号：US20190250947A1

公开(公告)日：2019-08-15

申请号：US16396522

申请日：2019-04-26

Applicant: salesforce.com, Inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/48 , G06F9/455 , G06F9/445 , G06F8/71 , G06F9/50

CPC classification number: G06F9/4881 , G06F8/71 , G06F9/44521 , G06F9/45504 , G06F9/4552 , G06F9/5072

Abstract: The technology disclosed provides a novel and innovative technique for compact deployment of application code to stream processing systems. In particular, the technology disclosed relates to obviating the need of accompanying application code with its dependencies during deployment (i.e., creating fat jars) by operating a stream processing system within a container defined over worker nodes of whole machines and initializing the worker nodes with precompiled dependency libraries having precompiled classes. Accordingly, the application code is deployed to the container without its dependencies, and, once deployed, the application code is linked with the locally stored precompiled dependencies at runtime. In implementations, the application code is deployed to the container running the stream processing system between 300 milliseconds and 6 seconds. This is drastically faster than existing deployment techniques that take anywhere between 5 to 15 minutes for deployment.

13.

发明授权
Providing strong ordering in multi-stage streaming processing 有权

公开(公告)号：US10191768B2

公开(公告)日：2019-01-29

申请号：US14986365

申请日：2015-12-31

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/46 , G06F9/48

Abstract: The technology disclosed relates to providing strong ordering in multi-stage processing of near real-time (NRT) data streams. In particular, it relates to maintaining current batch-stage information for a batch at a grid-scheduler in communication with a grid-coordinator that controls dispatch of batch-units to the physical threads for a batch-stage. This includes operating a computing grid, and queuing data from the NRT data streams as batches in pipelines for processing over multiple stages in the computing grid. Also included is determining, for a current batch-stage, batch-units pending dispatch, in response to receiving the current batch-stage information; identifying physical threads that processed batch-units for a previous batch-stage on which the current batch-stage depends and have registered pending tasks for the current batch-stage; and dispatching the batch-units for the current batch-stage to the identified physical threads subsequent to complete processing of the batch-units for the previous batch-stage.

14.

发明申请
PROVIDING STRONG ORDERING IN MULTI-STAGE STREAMING PROCESSING 审中-公开
Title translation: 在多级流水处理中提供强大的订单

公开(公告)号：US20170075721A1

公开(公告)日：2017-03-16

申请号：US14986365

申请日：2015-12-31

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/48 , G06F9/46

CPC classification number: G06F9/4881 , G06F9/466 , G06F2209/484 , G06F2209/485

Abstract: The technology disclosed relates to providing strong ordering in multi-stage processing of near real-time (NRT) data streams. In particular, it relates to maintaining current batch-stage information for a batch at a grid-scheduler in communication with a grid-coordinator that controls dispatch of batch-units to the physical threads for a batch-stage. This includes operating a computing grid, and queuing data from the NRT data streams as batches in pipelines for processing over multiple stages in the computing grid. Also included is determining, for a current batch-stage, batch-units pending dispatch, in response to receiving the current batch-stage information; identifying physical threads that processed batch-units for a previous batch-stage on which the current batch-stage depends and have registered pending tasks for the current batch-stage; and dispatching the batch-units for the current batch-stage to the identified physical threads subsequent to complete processing of the batch-units for the previous batch-stage.

Abstract translation: 所公开的技术涉及在近实时（NRT）数据流的多级处理中提供强顺序。特别地，其涉及在与控制批处理单元到批处理阶段的物理线程的分配单元的网格协调器通信的网格调度器处维护批次的当前批处理阶段信息。这包括运行计算网格，并将来自NRT数据流的数据作为管道中的批次进行排队，以便在计算网格中的多个阶段进行处理。还包括响应于接收到当前批次阶段信息而确定当前批次阶段待批发批处理单元; 识别处理当前批处理阶段所依赖的先前批次阶段并且已经为当前批处理阶段注册了待处理任务的批处理单元的物理线程; 并且在完成前一个批次阶段的批次单元处理之后，将当前分批阶段的分批单元分派到所识别的物理线程。

15.

发明授权
Managing resource allocation in a stream processing framework 有权

公开(公告)号：US11086688B2

公开(公告)日：2021-08-10

申请号：US16200365

申请日：2018-11-26

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/46 , G06F9/50

Abstract: The technology disclosed herein relates to method, system, and computer program product (computer-readable storage device) embodiments for managing resource allocation in a stream processing framework. An embodiment operates by configuring an allocation of a task sequence and machine resources to a container, partitioning a data stream into a plurality of batches arranged for parallel processing by the container via the machine resources allocated to the container, and running the task sequence, running at least one batch of the plurality of batches. Some embodiments may also include changing the allocation responsive to a determination of an increase in data volume, and may further include changing the allocation to a previous state of the allocation, responsive to a determination of a decrease in data volume. Additionally, time-based throughput of the data stream may be monitored for a given worker node configured to run a batch of the plurality of batches.

16.

发明授权
Handling multiple task sequences in a stream processing framework 有权

公开(公告)号：US10198298B2

公开(公告)日：2019-02-05

申请号：US14986351

申请日：2015-12-31

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/38 , G06F9/50 , G06F9/48

Abstract: The technology disclosed improves existing streaming processing systems by allowing the ability to both scale up and scale down resources within an infrastructure of a stream processing system. In particular, the technology disclosed relates to a dispatch system for a stream processing system that adapts its behavior according to a computational capacity of the system based on a run-time evaluation. The technical solution includes, during run-time execution of a pipeline, comparing a count of available physical threads against a set number of logically parallel threads. When a count of available physical threads equals or exceeds the number of logically parallel threads, the solution includes concurrently processing the batches at the physical threads. Further, when there are fewer available physical threads than the number of logically parallel threads, the solution includes multiplexing the batches sequentially over the available physical threads.

17.

发明授权
Managing resource allocation in a stream processing framework 有权

公开(公告)号：US10146592B2

公开(公告)日：2018-12-04

申请号：US14994131

申请日：2016-01-12

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/46 , G06F9/455 , G06F9/50 , G06F9/52

Abstract: The technology disclosed relates to managing resource allocation to task sequences in a stream processing framework. In particular, it relates to operating a computing grid that includes machine resources, with heterogeneous containers defined over whole machines and some containers including multiple machines. It also includes initially allocating multiple machines to a first container, initially allocating first set of stateful task sequences to the first container, running the first set of stateful task sequences as multiplexed units of work under control of a container-scheduler, where each unit of work for a first task sequence runs to completion on first machine resources in the first container, unless it overruns a time-out, before a next unit of work for a second task sequence runs multiplexed on the first machine resources. It further includes automatically modifying a number of machine resources and/or a number assigned task sequences to a container.

18.

发明申请
MAINTAINING THROUGHPUT OF A STREAM PROCESSING FRAMEWORK WHILE INCREASING PROCESSING LOAD 审中-公开

公开(公告)号：US20180253335A1

公开(公告)日：2018-09-06

申请号：US15973230

申请日：2018-05-07

Applicant: Salesforce.com, Inc

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/50 , G06F3/06

CPC classification number: G06F9/505 , G06F3/0613 , G06F3/0631 , G06F3/067 , G06F9/5061

Abstract: The technology disclosed relates to maintaining throughput of a stream processing framework while increasing processing load. In particular, it relates to defining a container over at least one worker node that has a plurality workers, with one worker utilizing a whole core within a worker node, and queuing data from one or more incoming near real-time (NRT) data streams in multiple pipelines that run in the container and have connections to at least one common resource external to the container. It further relates to concurrently executing the pipelines at a number of workers as batches, and limiting simultaneous connections to the common resource to the number of workers by providing a shared connection to a set of batches running on a same worker regardless of the pipelines to which the batches in the set belong.

19.

发明授权
Recovery strategy for a stream processing system 有权

公开(公告)号：US09946593B2

公开(公告)日：2018-04-17

申请号：US15004887

申请日：2016-01-22

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F11/00 , G06F11/07

CPC classification number: G06F11/1471 , G06F11/14 , G06F11/1438 , G06F11/202 , G06F11/2035 , G06F11/2048 , G06F2201/84

Abstract: The technology disclosed relates to discovering multiple previously unknown and undetected technical problems in fault tolerance and data recovery mechanisms of modern stream processing systems. In addition, it relates to providing technical solutions to these previously unknown and undetected problems. In particular, the technology disclosed relates to discovering the problem of modification of batch size of a given batch during its replay after a processing failure. This problem results in over-count when the input during replay is not a superset of the input fed at the original play. Further, the technology disclosed discovers the problem of inaccurate counter updates in replay schemes of modern stream processing systems when one or more keys disappear between a batch's first play and its replay. This problem is exacerbated when data in batches is merged or mapped with data from an external data store.

20.

发明申请
MANAGING PROCESSING OF LONG TAIL TASK SEQUENCES IN A STREAM PROCESSING FRAMEWORK 有权

公开(公告)号：US20170083378A1

公开(公告)日：2017-03-23

申请号：US14986419

申请日：2015-12-31

Applicant: salesforce.com, inc.

Inventor： Elden Gregory Bishop , Jeffrey Chao

IPC: G06F9/50 , G06F17/30

CPC classification number: G06F9/5038 , G06F9/5072 , G06F9/5088 , G06F17/30516

Abstract: The technology disclosed relates to managing processing of long tail task sequences in a stream processing framework. In particular, it relates to operating a computing grid that includes a plurality of physical threads which processes data from one or more near real-time (NRT) data streams for multiple task sequences, and queuing data from the NRT data streams as batches in multiple pipelines using a grid-coordinator that controls dispatch of the batches to the physical threads. The method also includes assigning a priority-level to each of the pipelines using a grid-scheduler, wherein the grid-scheduler initiates execution of a first number of batches from a first pipeline before execution of a second number of batches from a second pipeline, responsive to respective priority levels of the first and second pipelines.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification