-
公开(公告)号:US12265528B1
公开(公告)日:2025-04-01
申请号:US18187553
申请日:2023-03-21
Applicant: Amazon Technologies, Inc.
Inventor: Wuwei Lan , Patrick Ng , Zhiguo Wang , Ramesh M. Nallapati , Henghui Zhu , Anuj Chauhan , Sudipta Sengupta , Stephen Michael Ash , Bing Xiang , Gregory David Adams
IPC: G06F16/00 , G06F16/22 , G06F16/242 , G06F16/2457 , G06F16/248 , G06F16/25 , G06N3/0455 , G06N3/0499
Abstract: Techniques for handling natural language query processing are described. In some examples, a sequence-to-sequence model is used to handle a natural language query. Post-processing of a result of the sequence-to-sequence model utilizes fine-grained information from an entity linker. In some examples, the sequence-to-sequence model and aspects of a natural language query pipeline are used to handle a natural language query.
-
公开(公告)号:US20230325384A1
公开(公告)日:2023-10-12
申请号:US18182303
申请日:2023-03-10
Applicant: Amazon Technologies, Inc.
Inventor: Ramesh M Nallapati , Zhiguo Wang , Bing Xiang , Patrick Ng , Yung Haw Wang , Mukul Karnik , Nanyan Li , Sharanabasappa Parashuram Revadigar , Timothy Jones , Stephen Michael Ash , Sudipta Sengupta , Gregory David Adams , Deepak Shantha Murthy , Douglas Scott Cerny , Stephanie Weeks , Hanbo Li
IPC: G06F16/2452 , G06F16/242 , G06F40/295 , G06N20/00
CPC classification number: G06F16/24522 , G06F16/243 , G06F16/2423 , G06F40/295 , G06N20/00
Abstract: Interactive assistances for executing natural language queries to data sets may be performed. A natural language query may be received. Candidate entity linkages may be determined between an entity recognized in the natural language query and columns in data sets. The candidate linkages may be ranked according to confidence scores which may be evaluated to detect ambiguity for an entity linkage. Candidate entity linkages may be provided to a user via an interface to select an entity linkage to use as part of completing the natural language query.
-
3.
公开(公告)号:US20220261413A1
公开(公告)日:2022-08-18
申请号:US17687492
申请日:2022-03-04
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Jones , Andrew Borthwick , Sergei Dobroshinsky , Shehzad Qureshi , Stephen Michael Ash , Pedrito Uriah Maynard-Zhang , Chethan Kommaranahalli Rudramuni , Abhishek Sharma , Juliana Saussy , Adam Lawrence Joseph Heinermann , Alaykumar Navinchandra Desai , Mehul A. Shah , Mehul Y. Shah , Anurag Windlass Gupta , Prajakta Datta Damle
Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
-
4.
公开(公告)号:US11269911B1
公开(公告)日:2022-03-08
申请号:US16199115
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Jones , Andrew Borthwick , Sergei Dobroshinsky , Shehzad Qureshi , Stephen Michael Ash , Pedrito Uriah Maynard-Zhang , Chethan Kommaranahalli Rudramuni , Abhishek Sharma , Juliana Saussy , Adam Lawrence Joseph Heinermann , Alaykumar Navinchandra Desai , Mehul A. Shah , Mehul Y. Shah , Anurag Windlass Gupta , Prajakta Datta Damle
Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
-
公开(公告)号:US11120064B2
公开(公告)日:2021-09-14
申请号:US16197222
申请日:2018-11-20
Applicant: Amazon Technologies, Inc.
Inventor: Stephen Michael Ash
IPC: G06F16/338 , G06F16/335 , G06F40/284 , G06F40/263 , G06F16/35
Abstract: A data records service is configured to receive original data records and, in parallel, generate a transliterated version of the original data record into a phonetic based language. Individual fields of data records can be transliterated by identifying a primary language, generating language specific tokens for individual text portions, and transliterating the token. The records processing service can then execute matching models on both original data records and transliterated data records to detect matching data records.
-
公开(公告)号:US11113254B1
公开(公告)日:2021-09-07
申请号:US16587902
申请日:2019-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Borthwick , Stephen Michael Ash
IPC: G06F16/215 , G06F16/23
Abstract: Techniques for scaling record linkage via elimination of highly overlapped blocks are described. A method for scaling record linkage via elimination of highly overlapped blocks includes identifying a first plurality of blocks based at least on a plurality of records stored in a storage service of a provider network, identifying a plurality of sets of matching blocks from the first plurality of blocks, deleting the plurality of sets of matching blocks except for a first block from each set from the plurality of sets of matching blocks, and iteratively performing dynamic blocking based at least on the first block to generate subsequent pluralities of blocks until the subsequent pluralities of blocks are below a threshold size.
-
公开(公告)号:US11086940B1
公开(公告)日:2021-08-10
申请号:US16588296
申请日:2019-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Borthwick , Stephen Michael Ash
IPC: G06F16/9035 , G06F16/901 , G06K9/62 , G06F16/906
Abstract: Techniques for Scalable parallel elimination of approximately subsumed sets are described. A method for Scalable parallel elimination of approximately subsumed sets includes identifying a first plurality of blocks based at least on a plurality of records stored in a storage service of a provider network, determining a plurality of subsumption relationships between blocks from the first plurality of blocks, retaining a first subset of the first plurality of blocks and demoting a second subset of the first plurality of blocks based at least on the plurality of subsumption relationships, and iteratively performing dynamic blocking based at least on the first subset of the plurality of matching blocks and the second subset of the plurality of matching blocks to generate a subsequent pluralities of blocks.
-
公开(公告)号:US12007988B2
公开(公告)日:2024-06-11
申请号:US18182303
申请日:2023-03-10
Applicant: Amazon Technologies, Inc.
Inventor: Ramesh M Nallapati , Zhiguo Wang , Bing Xiang , Patrick Ng , Yung Haw Wang , Mukul Karnik , Nanyan Li , Sharanabasappa Parashuram Revadigar , Timothy Jones , Stephen Michael Ash , Sudipta Sengupta , Gregory David Adams , Deepak Shantha Murthy , Douglas Scott Cerny , Stephanie Weeks , Hanbo Li
IPC: G06F16/2452 , G06F16/242 , G06F40/295 , G06N20/00
CPC classification number: G06F16/24522 , G06F16/2423 , G06F16/243 , G06F40/295 , G06N20/00
Abstract: Interactive assistances for executing natural language queries to data sets may be performed. A natural language query may be received. Candidate entity linkages may be determined between an entity recognized in the natural language query and columns in data sets. The candidate linkages may be ranked according to confidence scores which may be evaluated to detect ambiguity for an entity linkage. Candidate entity linkages may be provided to a user via an interface to select an entity linkage to use as part of completing the natural language query.
-
9.
公开(公告)号:US11941016B2
公开(公告)日:2024-03-26
申请号:US17687492
申请日:2022-03-04
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Jones , Andrew Borthwick , Sergei Dobroshinsky , Shehzad Qureshi , Stephen Michael Ash , Pedrito Uriah Maynard-Zhang , Chethan Kommaranahalli Rudramuni , Abhishek Sharma , Juliana Saussy , Adam Lawrence Joseph Heinermann , Alaykumar Navinchandra Desai , Mehul A. Shah , Mehul Y. Shah , Anurag Windlass Gupta , Prajakta Datta Damle
CPC classification number: G06F16/254 , G06F9/543 , G06N20/00
Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
-
公开(公告)号:US11500865B1
公开(公告)日:2022-11-15
申请号:US17219706
申请日:2021-03-31
Applicant: Amazon Technologies, Inc.
Inventor: Jun Wang , Zhiguo Wang , Sharanabasappa Parashuram Revadigar , Ramesh M Nallapati , Bing Xiang , Stephen Michael Ash , Timothy Jones , Sudipta Sengupta , Rishav Chakravarti , Patrick Ng , Jiarong Jiang , Hanbo Li , Donald Harold Rivers Weidner
IPC: G06F7/00 , G06F16/2452 , G06F40/295 , G06N20/00 , G06F16/242
Abstract: Multiple stage filtering may be implemented for natural language query processing pipelines. Natural language queries may be received at a natural language query processing system and processed through a query language processing pipeline. The query language processing pipeline may filter candidate linkages for a natural language query before performing further filtering of the candidate linkages in the natural language query processing pipeline as part of generating an intermediate representation used to execute the natural language query.
-
-
-
-
-
-
-
-
-