-
公开(公告)号:US12007992B2
公开(公告)日:2024-06-11
申请号:US17818878
申请日:2022-08-10
Applicant: Microsoft Technology Licensing, LLC
Inventor: Rahul Potharaju , Wentao Wu , Terry Y. Kim , Lev Novik , Apoorve Dave
IPC: G06F16/2453 , G06F16/22 , G06F16/25
CPC classification number: G06F16/24542 , G06F16/2272 , G06F16/254
Abstract: Methods, systems and computer program products are described herein that provide a serverless, multi-engine, multi-user data lake indexing subsystem and application programming interface. Indexes are defined as derived datasets and stored on the data lake in a universal format that enables disparate engines to create and/or discover indexes for workload optimization. Embodiment of indexes enable stateful control and management of an index via metadata included in the index and stored on the data lake.
-
公开(公告)号:US11677635B2
公开(公告)日:2023-06-13
申请号:US17379725
申请日:2021-07-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Navendu Jain , Rahul Potharaju
IPC: H04L43/065 , H04L43/0823 , H04L43/028 , H04L43/0876 , H04L43/062 , H04L43/04 , H04L41/14 , H04L41/06 , H04L43/045
CPC classification number: H04L41/145 , H04L41/06 , H04L43/028 , H04L43/04 , H04L43/045 , H04L43/062 , H04L43/065 , H04L43/0823 , H04L43/0876
Abstract: A hierarchical network analytics system operated by a computing device or system is described. In some example techniques, the analytics system may determine results of a plurality of first level analyses each based at least in part on results of a respective plurality of data queries that return respective subsets of a plurality of types of network data. The analytics system may determine a result of a second level analysis based at least in part on results of the plurality of first level analyses.
-
公开(公告)号:US20210334242A1
公开(公告)日:2021-10-28
申请号:US16856600
申请日:2020-04-23
Applicant: Microsoft Technology Licensing, LLC
Inventor: Rahul Potharaju , Terry Y. Kim , Wentao Wu
IPC: G06F16/185 , G06F16/182 , G06F16/13 , G06F16/14 , G06F16/17
Abstract: Methods, systems, apparatuses, and computer program products are described herein for the generation and traversal of a hierarchical index structure. The structure indexes search keys from data ingested from different data sources and enables efficient retrieval of the keys. As data is ingested, index nodes are generated at the lowest level of the structure. The nodes are analyzed to determine whether such nodes comprise duplicate keys. Responsive to doing so, a new index node is generated located at a higher level of the structure. This process results in a DAG comprising orphan nodes including different search keys. When processing a query for search keys, the orphan index nodes are initially analyzed for the keys. Upon finding a search key, its child nodes are recursively searched until location information specifying the location of ingested data in which the search key is located is found.
-
公开(公告)号:US11288271B2
公开(公告)日:2022-03-29
申请号:US16885878
申请日:2020-05-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Rahul Potharaju , Wentao Wu
IPC: G06F16/2453 , G06F16/901 , G06F16/2458
Abstract: Methods, systems and computer program products are described herein that enable data workload optimization through “what-if” modeling of indexes and index recommendation. In an example aspect, a system is configured to accept a workload comprising a plurality of queries directed at data having a first physical data layout, generate a set of candidate indexes based on the plurality of queries, enumerate index configurations based of the set of candidate indexes, each index configuration comprising a subset on the set of candidate indexes, generate a hierarchical graph of the index configurations, search the hierarchical graph for a recommended index configuration comprising an index configuration with the lowest estimated cost while pruning index configurations not considered from the graph of index configurations to generate a pruned graph, execute a graph query against the pruned graph generating a graph query result and perform an optimization operation based on the graph query result.
-
公开(公告)号:US20210357406A1
公开(公告)日:2021-11-18
申请号:US16867010
申请日:2020-05-05
Applicant: Microsoft Technology Licensing, LLC
Inventor: Rahul Potharaju , Wentao Wu
IPC: G06F16/2453 , G06F16/27 , G06F16/2458 , G06F16/22
Abstract: Methods, systems and computer program products are described herein that enable data workload optimization through “what-if” modeling of indexes and index recommendation. In an example aspect, a system is configured to accept a workload in the form of a plurality of data queries, extract indexable columns from the queries, generate candidate indexes from the indexable columns, select and then build the best candidate indexes. Furthermore, the system may receive a query, generate a query plan for the query configured to use one or more of the built candidate indexes, and execute the query plan to generate a query result.
-
公开(公告)号:US20210334256A1
公开(公告)日:2021-10-28
申请号:US16859733
申请日:2020-04-27
Applicant: Microsoft Technology Licensing, LLC
Inventor: Rahul Potharaju , Terry Y. Kim , Wentao Wu
IPC: G06F16/22 , G06F16/245 , G06F16/248
Abstract: Methods, systems, apparatuses, and computer program products are directed to the generation of a global index structure. Agents executing on different data sources locally pre-process (e.g., format, filter, compress, encode, serialize etc.) data generated thereby and index such data. The agents also manage the resources thereof to perform the pre-processing and indexing operations. Each index generated by an agent is formatted as a plurality of index nodes. The index nodes and pre-processed data are provided to backend server(s) that maintain the global index structure and store the data in a globally distributed file system, which aid in unexpected disaster recovery. The backend server(s) generate the global index structure based on the index nodes. As new index nodes are received by the backend servers, the backend servers merge the newly-received index nodes with the global index structure. Global index structure traversal techniques for retrieving search keys are also described herein.
-
公开(公告)号:US11057266B2
公开(公告)日:2021-07-06
申请号:US16357202
申请日:2019-03-18
Applicant: Microsoft Technology Licensing, LLC
Inventor: Navendu Jain , Rahul Potharaju
Abstract: Described herein are various technologies pertaining to providing assistance to an operator in a data center with respect to failures in the data center. An alarm is received, and a failing device is identified based upon content of the alarm. Failure conditions of the alarm are mapped to a failure symptom that may be exhibited by the failing device, and troubleshooting options previously employed to mitigate the failure symptom are retrieved from historical data. Labels are respectively assigned to the troubleshooting options, where a label is indicative of a probability that a troubleshooting option to which the label has been assigned will mitigate the failure symptom.
-
公开(公告)号:US20200259716A1
公开(公告)日:2020-08-13
申请号:US16822845
申请日:2020-03-18
Applicant: Microsoft Technology Licensing, LLC
Inventor: Navendu JAIN , Rahul Potharaju
IPC: H04L12/24
Abstract: A hierarchical network analytics system operated by a computing device or system is described. In some example techniques, the analytics system may determine results of a plurality of first level analyses each based at least in part on results of a respective plurality of data queries that return respective subsets of a plurality of types of network data. The analytics system may determine a result of a second level analysis based at least in part on results of the plurality of first level analyses.
-
公开(公告)号:US09973392B2
公开(公告)日:2018-05-15
申请号:US14057757
申请日:2013-10-18
Applicant: Microsoft Technology Licensing, LLC
Inventor: Navendu Jain , Rahul Potharaju
IPC: G06F15/173 , H04L12/24 , H04L12/26
CPC classification number: H04L41/145 , H04L41/06 , H04L43/028 , H04L43/04 , H04L43/045 , H04L43/062 , H04L43/065 , H04L43/0823 , H04L43/0876
Abstract: A hierarchical network analytics system operated by a computing device or system is described. In some example techniques, the analytics system may determine results of a plurality of first level analyses each based at least in part on results of a respective plurality of data queries that return respective subsets of a plurality of types of network data. The analytics system may determine a result of a second level analysis based at least in part on results of the plurality of first level analyses.
-
公开(公告)号:US09779141B2
公开(公告)日:2017-10-03
申请号:US14106763
申请日:2013-12-14
Applicant: Microsoft Technology Licensing, LLC
Inventor: Navendu Jain , Luhui Hu , Liyuan Zhang , Rahul Potharaju , Vitaly Voloshin , Mingshi Wang , Joseph K. W. Chan
IPC: G06F17/30
CPC classification number: G06F17/3053 , G06F17/30542 , G06F17/3066 , G06F17/30699
Abstract: Disclosed herein is a system and method for searching or processing queries for searching for documents contained in a domain specific knowledge base. The system takes a query and generates from the query a modified version of the query by passing the query through one or more filters in a query processor. The query processor adds or removes terms from the query. The query processor can add or recognize that two words that appear to be separate words actually identify a specific software entity or can determine that a number appearing in a query is not just a number but refers to a specific version or a number relevant to the specific problem.
-
-
-
-
-
-
-
-
-