- 专利标题: Near-real-time data processing with partition files
-
申请号: US17175491申请日: 2021-02-12
-
公开(公告)号: US11416312B1公开(公告)日: 2022-08-16
- 发明人: Xu Liu , Steve Chun-Hao Hu , Abhishank Sahu , Yingji Ju , Gunter Leeb , Jose Fernandez , Swadhin Ajay Thakkar , William Edward Miao , Sravanthi Pereddy , Jordan Robert Fitzgibbon , Raveena Dayani
- 申请人: Microsoft Technology Licensing, LLC.
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Technology Licensing, LLC.
- 当前专利权人: Microsoft Technology Licensing, LLC.
- 当前专利权人地址: US WA Redmond
- 代理机构: Workman Nydegger
- 主分类号: G06F9/46
- IPC分类号: G06F9/46 ; G06F9/52 ; G06F9/50
摘要:
Embodiments disclosed herein are related to implementing a near-real-time stream processing system using the same distributed file system as a batch processing system. A data container and partition files are generated according to a partition window that specifies a time range that controls when data is to be included in the partition files. The data container is scanned to determine if the partition files are within a partition lifetime window that specifies a time range that controls how long the partition files are active for processing. For each partition file within the lifetime window, processing tasks are created based on an amount of data included in the partition files. The data in the partition files is accessed and the processing tasks are performed. Information about the partition files is recorded in a configuration data store.
公开/授权文献
- US20220261297A1 NEAR-REAL-TIME DATA PROCESSING WITH PARTITION FILES 公开/授权日:2022-08-18
信息查询