-
1.
公开(公告)号:US11314767B2
公开(公告)日:2022-04-26
申请号:US16943314
申请日:2020-07-30
Applicant: salesforce.com, inc.
Inventor: Zhidong Ke , Utsavi Benani , Kevin Terusaki , Yifeng Liu , Percy Mehta , Heng Zhang
Abstract: Managing mutations in a data lake environment. A mutation request to cause write operations that modify data objects or structures within an environment for collecting unformatted raw data is received. The environment has at least a data table and a notification table. An entry is written to the data table with a streaming job configured to receive and process the mutation request. Entries to the data table specify at least records indicating changes to objects in the environment based on ingestion processing for the environment for collecting unformatted raw data and based on the mutation request. A corresponding entry is written to the notification table in response to a successful write attempt to the data table. The notification table entry has information about data table entries for a specified period. At least one data consumer is notified that the data table has been modified.
-
2.
公开(公告)号:US20220035829A1
公开(公告)日:2022-02-03
申请号:US16943314
申请日:2020-07-30
Applicant: salesforce.com, inc.
Inventor: Zhidong Ke , Utsavi Benani , Kevin Terusaki , Yifeng Liu , Percy Mehta , Heng Zhang
Abstract: Managing mutations in a data lake environment. A mutation request to cause write operations that modify data objects or structures within an environment for collecting unformatted raw data is received. The environment has at least a data table and a notification table. An entry is written to the data table with a streaming job configured to receive and process the mutation request. Entries to the data table specify at least records indicating changes to objects in the environment based on ingestion processing for the environment for collecting unformatted raw data and based on the mutation request. A corresponding entry is written to the notification table in response to a successful write attempt to the data table. The notification table entry has information about data table entries for a specified period. At least one data consumer is notified that the data table has been modified.
-
3.
公开(公告)号:US20220012259A1
公开(公告)日:2022-01-13
申请号:US16922989
申请日:2020-07-07
Applicant: salesforce.com, inc.
Inventor: Zhidong Ke , Kevin Terusaki , Yifeng Liu , Utsavi Benani , Heng Zhang
Abstract: Techniques and mechanisms for ingesting data through an atomic transaction are disclosed. Raw data is received from multiple disparate sources to be consumed in an environment that does not support atomic write operations to data consumers. The environment has at least a data table and a notification table. A write to an entry in the data table having an associated version is attempted. The data table entry corresponds to the data to be consumed. A write to a corresponding entry to the notification table is attempted in response to a successful write attempt to the data table. The notification table entry includes information about the corresponding data table entry. The version associated with the data table is modified in response to successful writes of both the data table entry and the notification table entry. At least one data consumer is notified that the data table version has been modified.
-
公开(公告)号:US20220345458A1
公开(公告)日:2022-10-27
申请号:US17238180
申请日:2021-04-22
Applicant: salesforce.com, inc.
Inventor: Jayanth Parayil Kumarji , Priyadarshini Mitra , Kevin Terusaki , Zhidong Ke , Yifeng Liu , Utsavi Benani , Heng Zhang , Evan Jiang , Birva Joshi , Yogesh Patel
IPC: H04L29/06
Abstract: Various techniques and mechanisms for sharing remote resources among a trusted group are disclosed. A credential management agent utilizes a resource credential for a first user to access a secure resource corresponding to the first user for a second user by at least validating a second user and validating a consent of the first user to allow the second user to access the secure resource using the resource credential for the first user. The secure resource resides on a remote server system accessible via one or more application program interfaces (APIs). A platform management agent provides an interface for shared resource-agnostic credential sharing. The platform management agent validates credentials for the second user as belonging to a trusted group and forwards a request for access to the secure resource for the second user to the credential management agent. In response to receiving the resource credential for the first user to access the secure resource, the platform management agent accesses the secure resource on behalf of the second user to allow the second user to utilize the secure resource.
-
5.
公开(公告)号:US20220171696A1
公开(公告)日:2022-06-02
申请号:US17108875
申请日:2020-12-01
Applicant: salesforce.com, inc.
Inventor: Heng Zhang , Utsavi Benani , Zhidong Ke
Abstract: Mechanisms and techniques for providing an isolated runtime environment are disclosed. Scoped properties are generated utilizing a namespace identifier. The namespace identifier is an identifier used to define a scope of at least an isolated application runtime context. Scoped objects are generated corresponding to one or more dependent services utilizing the one or more scoped properties and the namespace identifier. A target application is deployed to a container corresponding to the isolated application runtime context. The target application interacts with at least some of the dependent services utilizing the namespace identifier. At least one integration test is run on the target application in the container. The integration tests refer to the isolated application runtime context utilizing the namespace identifier.
-
6.
公开(公告)号:US20220012214A1
公开(公告)日:2022-01-13
申请号:US16922986
申请日:2020-07-07
Applicant: salesforce.com, inc.
Inventor: Zhidong Ke , Kevin Terusaki , Yifeng Liu , Utsavi Benani , Heng Zhang , Aaron Zhang , Shreedhar Sundaram , Mahalaxmi Sanathkumar
IPC: G06F16/18 , G06F16/174 , G06F16/17 , G06F16/176 , G06F16/22 , G06F11/07
Abstract: Techniques and mechanisms for incremental data ingestion are disclosed. Raw data is received from multiple disparate sources to be consumed in an environment for collecting unformatted raw data. The environment has at least a delta data table and a delta notification table. A write to an entry in the delta data table is attempted. Entries to the delta data table specify at least records indicating changes to objects in the environment. A write a corresponding entry to the delta notification table is attempted in response to a successful write attempt to the delta data table. The delta notification table entry includes information about delta data table entries for a specified period. At least one data consumer is notified that the delta data table has been modified.
-
公开(公告)号:US20210232604A1
公开(公告)日:2021-07-29
申请号:US16775306
申请日:2020-01-29
Applicant: Salesforce.com, Inc.
Inventor: Shreedhar Sundaram , Mahalaxmi Sanathkumar , Aaron Zhang , Parin Kenia , Violet Gong , Priya Sethuraman , Zhidong Ke , Kevin Terusak
IPC: G06F16/28 , G06F16/23 , G06F16/2453
Abstract: A request may be received to transform records in a data lake that match one or more query criteria. Data lake records that match the criteria may be retrieved. The retrieved records may include two data lake records associated with the same data lake record identifier. A transformed record may be generated based on time stamps associated with the retrieved records.
-
公开(公告)号:US11488082B2
公开(公告)日:2022-11-01
申请号:US16366922
申请日:2019-03-27
Applicant: salesforce.com, inc.
Inventor: Utsavi Benani , Zhidong Ke , Scott W. Bishel , Praveen Innamuri , Kevin Terusaki , Shan-Cheng Ho
Abstract: Described is a method and system for verifying the end-to-end distribution of messages within an on-demand services platform. To provide the ability to monitor and trace such messages, the system may include a specialized and queryable datastore (or database) that allows the system to track the distribution and acknowledgement of messages throughout the services platform. Accordingly, a specialized verification process may be initiated to query the database to verify the receipt of a message has been acknowledged by various components within the system. In addition, the verification process may automatically retry the distribution the messages to ensure particular downstream components have received the message. Accordingly, the system alleviates the need to manual re-crawl various message source to ensure the end-to-end distribution of such messages.
-
9.
公开(公告)号:US20220092048A1
公开(公告)日:2022-03-24
申请号:US17026061
申请日:2020-09-18
Applicant: Salesforce.com, inc.
Inventor: Zhidong Ke , Yifeng Liu , Heng Zhang , Utsavi Benani , Kevin Terusaki , Priyadarshini Mitra
IPC: G06F16/22 , G06F16/28 , G06F16/2457 , G06F16/23 , G06F16/27
Abstract: Architectures and techniques to provide an extract-once framework for data ingestion into a data lake. A data consumption job to ingest data to multiple tables within a data collection platform is started. Checkpoint metadata corresponding to the data consumption job is retrieved from a checkpoint metadata store. A subset of processes from the data consumption job are performed. Checkpoint metadata is updated in response to completion of the subset of processes. A subsequent subset of processes from the data consumption job is performed. Checkpoint metadata is updated in response to completion of each of the at least one subsequent subset of processes from the data consumption job. Batch metadata is updated in response to completion of the data consumption job.
-
10.
公开(公告)号:US20200250007A1
公开(公告)日:2020-08-06
申请号:US16264399
申请日:2019-01-31
Applicant: salesforce.com, inc.
Inventor: Zhidong Ke , Kevin Terusaki , Praveen Innamuri , Narek Asadorian
IPC: G06F9/50 , G06F16/174 , G06F9/52
Abstract: Described is a system and method for compacting data into customized (e.g. optimal) file sizes for processing by computing resources. The mechanism may leverage various computing resources such as a cluster computing frameworks combined with a stream processing platform to efficiently process the activity data. For example, activity data of an organization may be processed by a set of jobs (or sub-jobs) as part of a data stream by a set of distributed computing resources. In order to efficiently process such data, the mechanism may compact the data into customized (e.g. optimal) file sizes. For example, the customized file sizes may provide an optimal (or near optimal) amount of data to be processed by each job, for example, to improve performance.
-
-
-
-
-
-
-
-
-