-
公开(公告)号:US12014155B2
公开(公告)日:2024-06-18
申请号:US17847115
申请日:2022-06-22
Applicant: Amazon Technologies, Inc.
Inventor: Praphruetpong Athiwaratkun , Yuchen Tian , Mingyue Shang , Zijian Wang , Ramesh M Nallapati , Parminder Bhatia , Andrew Oliver Arnold , Bing Xiang , Sudipta Sengupta , Yanitsa Donchev , Srinivas Iragavarapu , Matthew Lee , Vamshidhar Krishnamurthy Dantu , Atul Deo , Ankur Deepak Desai
IPC: G06F8/33
CPC classification number: G06F8/33
Abstract: Pre-fix matching may constrain the generation of next token predictions. Input text to perform a next token prediction may be received. Multiple tokens may be determined from the input text, including a partial token. From possible tokens, one or more matching possible tokens with the partial token may be identified. Next token predictions may then be filtered using the identified possible tokens in order to ensure that the partial token is matched.
-
公开(公告)号:US20230418566A1
公开(公告)日:2023-12-28
申请号:US17847113
申请日:2022-06-22
Applicant: Amazon Technologies, Inc.
Inventor: Praphruetpong Athiwaratkun , Zixuan Lin , Ramana Keerthi , Zijian Wang , Yuchen Tian , Hantian Ding , Sri Ranga Akhilesh Bontala , Matthew Lee , Yanitsa Donchev , Ramesh M Nallapati , Parminder Bhatia , Andrew Oliver Arnold , Bing Xiang , Sudipta Sengupta , Rama Krishna Sandeep Pokkunuri , Srinivas Iragavarapu , Atul Deo , Ankur Deepak Desai
CPC classification number: G06F8/33 , G06F8/447 , G06F11/3608
Abstract: Evaluation data sets may be programmatically generated for code generation models. An evaluation data set is obtained that includes items that correspond to different evaluation tests for a code generation system. The individual items of the evaluation data set maybe converted, including the conversion of a function signature for the items, the test statements for the items and using a code generation system to generate the body of the function.
-
公开(公告)号:US20230418565A1
公开(公告)日:2023-12-28
申请号:US17847112
申请日:2022-06-22
Applicant: Amazon Technologies, Inc.
Inventor: Sathish Arumugam Selvaraj , Qiang Yu , Venkat Rakshith Reddy Swamireddy , Matthew Lee , Lei Gao , Wei Fang , Rama Krishna Sandeep Pokkunuri , Ramesh M Nallapati , Srinivas Iragavarapu , Alexander Johannes Smola , Sudipta Sengupta , Wasi Uddin Ahmad , Parminder Bhatia , Atul Deo , Ankur Deepak Desai , Bing Xiang , Andrew Oliver Arnold
IPC: G06F8/33 , G06F16/332
CPC classification number: G06F8/33 , G06F16/3322
Abstract: Code completion suggestions may be proactively obtained and validated. An event that triggers obtaining a code completion suggestion for inclusion in a code file being edited using an integrated development environment may be detected. The code completion suggestion may be obtained. The characters of the code completion suggestion may be compared with characters added to the code file after the detection of the event that triggered obtaining the code completion suggestion to determine whether the code completion suggestion is valid. A valid code completion suggestion may then be displayed.
-
14.
公开(公告)号:US11797535B1
公开(公告)日:2023-10-24
申请号:US17105092
申请日:2020-11-25
Applicant: Amazon Technologies, Inc.
Inventor: Stefano Stefani , Sudipta Sengupta , Julio Delgado Mangas , James Laurence Finnerty , Ronak Bharat Shah , Sumeetkumar V. Maru
IPC: G06F16/00 , G06F16/2453 , G06N20/00 , G06F16/2455 , G06F16/248
CPC classification number: G06F16/24542 , G06F16/248 , G06F16/24552 , G06F16/24553 , G06N20/00
Abstract: Techniques for batch mode execution for calls to remote services are described. A method of batch mode execution for calls to remote services may include generating, by a query service of a provider network, a query plan to optimize a query for batch processing of data, the query plan including at least a function reference to a function provided by at least one service of the provider network, executing the query plan to invoke the function associated with the function reference, wherein a batch function generates a request including a batch of service calls to be processed by the at least one service, sends the request including the batch of service calls to the at least one service, and obtains a plurality of machine learning responses from the at least one service, and generating a query response based on the plurality of responses.
-
公开(公告)号:US11475067B2
公开(公告)日:2022-10-18
申请号:US16698080
申请日:2019-11-27
Applicant: Amazon Technologies, Inc.
Inventor: Cicero Nogueira Dos Santos , Xiaofei Ma , Peng Xu , Ramesh M. Nallapati , Bing Xiang , Sudipta Sengupta , Zhiguo Wang , Patrick Ng
IPC: G06F40/30 , G06F16/9032 , G06K9/62 , G06F16/9038 , G06N20/00 , G06F16/903 , G06F16/93 , G06F40/20
Abstract: Techniques for generation of synthetic queries from customer data for training of document querying machine learning (ML) models as a service are described. A service may receive one or more documents from a user, generate a set of question and answer pairs from the one or more documents from the user using a machine learning model trained to predict a question from an answer, and store the set of question and answer pairs generated from the one or more documents from the user. The question and answer pairs may be used to train another machine learning model, for example, a document ranking model, a passage ranking model, a question/answer model, or a frequently asked question (FAQ) model.
-
公开(公告)号:US11467835B1
公开(公告)日:2022-10-11
申请号:US16199129
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Sudipta Sengupta , Poorna Chand Srinivas Perumalla , Jalaja Kurubarahalli , Samuel Oshin , Cory Pruce , Jun Wu , Eftiquar Shaikh , Pragya Agarwal , David Thomas , Karan Kothari , Daniel Evans , Umang Wadhwa , Mark Klunder , Rahul Sharma , Zdravko Pantic , Dominic Rajeev Divakaruni , Andrea Olgiati , Leo Dirac , Nafea Bshara , Bratin Saha , Matthew Wood , Swaminathan Sivasubramanian , Rajankumar Singh
Abstract: Techniques for partitioning data flow operations between execution on a compute instance and an attached accelerator instance are described. A set of operations supported by the accelerator is obtained. A set of operations associated with the data flow is obtained. An operation in the set of operations associated with the data flow is identified based on the set of operations supported by the accelerator. The accelerator executes the first operation.
-
公开(公告)号:US11449796B2
公开(公告)日:2022-09-20
申请号:US16578060
申请日:2019-09-20
Applicant: Amazon Technologies, Inc.
Inventor: Sangil Song , Yongsik Yoon , Kamal Kant Gupta , Saileshwar Krishnamurthy , Stefano Stefani , Sudipta Sengupta , Jaeyun Noh
IPC: G06F7/00 , G06N20/00 , G06F16/242 , G06F16/2453 , G06N5/04
Abstract: Techniques for making machine learning inference calls for database query processing are described. In some embodiments, a method of making machine learning inference calls for database query processing may include generating a first batch of machine learning requests based at least on a query to be performed on data stored in a database service, wherein the query identifies a machine learning service, sending the first batch of machine learning requests to an input buffer of an asynchronous request handler, the asynchronous request handler to generate a second batch of machine learning requests based on the first batch of machine learning requests, and obtaining a plurality of machine learning responses from an output buffer of the asynchronous request handler, the machine learning responses generated by the machine learning service using a machine learning model in response to receiving the second batch of machine learning requests.
-
公开(公告)号:US10853129B1
公开(公告)日:2020-12-01
申请号:US16358355
申请日:2019-03-19
Applicant: Amazon Technologies, Inc.
Inventor: Sudipta Sengupta , Haifeng He , Pejus Manoj Das , Poorna Chand Srinivas Perumalla , Wei Xiao , Shirley Xue Yi Leung , Vladimir Mitrovic , Yongcong Luo , Jiacheng Guo , Stefano Stefani , Matthew Shawn Wilson
Abstract: Implementations detailed herein include description of a computer-implemented method to migrate a machine learning model from one accelerator portion (such as a portion of a graphical processor unit (GPU)) to a different accelerator portion. In some instances, a state of the first accelerator portion is persisted, the second accelerator portion is configured, the first accelerator portion is then detached from a client application instance, and at least a portion of an inference request is performed using the loaded at least a portion of the machine learning model on the second accelerator portion that had been configured.
-
公开(公告)号:US12106222B2
公开(公告)日:2024-10-01
申请号:US18112036
申请日:2023-02-21
Applicant: Amazon Technologies, Inc.
Inventor: Sudipta Sengupta , Randy Renfu Huang , Ron Diamant , Vignesh Vivekraja
Abstract: Methods and systems for training a neural network are provided. In one example, an apparatus comprises a memory that stores instructions; and a hardware processor configured to execute the instructions to: control a neural network processor to perform a loss gradient operation to generate data gradients; after the loss gradient operation completes, control the neural network processor to perform a forward propagation operation to generate intermediate outputs; control the neural network processor to perform a backward propagation operation based on the data gradients and the intermediate outputs to generate weight gradients; receive the weight gradients from the neural network processor; and update weights of a neural network based on the weight gradients.
-
公开(公告)号:US12007988B2
公开(公告)日:2024-06-11
申请号:US18182303
申请日:2023-03-10
Applicant: Amazon Technologies, Inc.
Inventor: Ramesh M Nallapati , Zhiguo Wang , Bing Xiang , Patrick Ng , Yung Haw Wang , Mukul Karnik , Nanyan Li , Sharanabasappa Parashuram Revadigar , Timothy Jones , Stephen Michael Ash , Sudipta Sengupta , Gregory David Adams , Deepak Shantha Murthy , Douglas Scott Cerny , Stephanie Weeks , Hanbo Li
IPC: G06F16/2452 , G06F16/242 , G06F40/295 , G06N20/00
CPC classification number: G06F16/24522 , G06F16/2423 , G06F16/243 , G06F40/295 , G06N20/00
Abstract: Interactive assistances for executing natural language queries to data sets may be performed. A natural language query may be received. Candidate entity linkages may be determined between an entity recognized in the natural language query and columns in data sets. The candidate linkages may be ranked according to confidence scores which may be evaluated to detect ambiguity for an entity linkage. Candidate entity linkages may be provided to a user via an interface to select an entity linkage to use as part of completing the natural language query.
-
-
-
-
-
-
-
-
-