Patent search ap:("Amazon Technologies Page Inc") AND inv:"Sudipta Sengupta"

11.

发明授权
Constrained prefix matching for generating next token predictions 有权

公开(公告)号：US12014155B2

公开(公告)日：2024-06-18

申请号：US17847115

申请日：2022-06-22

Applicant: Amazon Technologies, Inc.

Inventor： Praphruetpong Athiwaratkun , Yuchen Tian , Mingyue Shang , Zijian Wang , Ramesh M Nallapati , Parminder Bhatia , Andrew Oliver Arnold , Bing Xiang , Sudipta Sengupta , Yanitsa Donchev , Srinivas Iragavarapu , Matthew Lee , Vamshidhar Krishnamurthy Dantu , Atul Deo , Ankur Deepak Desai

IPC: G06F8/33

CPC classification number: G06F8/33

Abstract: Pre-fix matching may constrain the generation of next token predictions. Input text to perform a next token prediction may be received. Multiple tokens may be determined from the input text, including a partial token. From possible tokens, one or more matching possible tokens with the partial token may be identified. Next token predictions may then be filtered using the identified possible tokens in order to ensure that the partial token is matched.

12.

发明公开
PROGRAMMATICALLY GENERATING EVALUATION DATA SETS FOR CODE GENERATION MODELS 审中-公开

公开(公告)号：US20230418566A1

公开(公告)日：2023-12-28

申请号：US17847113

申请日：2022-06-22

Applicant: Amazon Technologies, Inc.

Inventor： Praphruetpong Athiwaratkun , Zixuan Lin , Ramana Keerthi , Zijian Wang , Yuchen Tian , Hantian Ding , Sri Ranga Akhilesh Bontala , Matthew Lee , Yanitsa Donchev , Ramesh M Nallapati , Parminder Bhatia , Andrew Oliver Arnold , Bing Xiang , Sudipta Sengupta , Rama Krishna Sandeep Pokkunuri , Srinivas Iragavarapu , Atul Deo , Ankur Deepak Desai

IPC: G06F8/33 , G06F11/36 , G06F8/41

CPC classification number: G06F8/33 , G06F8/447 , G06F11/3608

Abstract: Evaluation data sets may be programmatically generated for code generation models. An evaluation data set is obtained that includes items that correspond to different evaluation tests for a code generation system. The individual items of the evaluation data set maybe converted, including the conversion of a function signature for the items, the test statements for the items and using a code generation system to generate the body of the function.

13.

发明公开
VALIDATING AND PROVIDING PROACTIVELY GENERATED CODE SUGGESTIONS 审中-公开

公开(公告)号：US20230418565A1

公开(公告)日：2023-12-28

申请号：US17847112

申请日：2022-06-22

Applicant: Amazon Technologies, Inc.

Inventor： Sathish Arumugam Selvaraj , Qiang Yu , Venkat Rakshith Reddy Swamireddy , Matthew Lee , Lei Gao , Wei Fang , Rama Krishna Sandeep Pokkunuri , Ramesh M Nallapati , Srinivas Iragavarapu , Alexander Johannes Smola , Sudipta Sengupta , Wasi Uddin Ahmad , Parminder Bhatia , Atul Deo , Ankur Deepak Desai , Bing Xiang , Andrew Oliver Arnold

IPC: G06F8/33 , G06F16/332

CPC classification number: G06F8/33 , G06F16/3322

Abstract: Code completion suggestions may be proactively obtained and validated. An event that triggers obtaining a code completion suggestion for inclusion in a code file being edited using an integrated development environment may be detected. The code completion suggestion may be obtained. The characters of the code completion suggestion may be compared with characters added to the code file after the detection of the event that triggered obtaining the code completion suggestion to determine whether the code completion suggestion is valid. A valid code completion suggestion may then be displayed.

14.

发明授权
Use of batch mode function execution in database engines to enable efficient calls to remote services 有权

公开(公告)号：US11797535B1

公开(公告)日：2023-10-24

申请号：US17105092

申请日：2020-11-25

Applicant: Amazon Technologies, Inc.

Inventor： Stefano Stefani , Sudipta Sengupta , Julio Delgado Mangas , James Laurence Finnerty , Ronak Bharat Shah , Sumeetkumar V. Maru

IPC: G06F16/00 , G06F16/2453 , G06N20/00 , G06F16/2455 , G06F16/248

CPC classification number: G06F16/24542 , G06F16/248 , G06F16/24552 , G06F16/24553 , G06N20/00

Abstract: Techniques for batch mode execution for calls to remote services are described. A method of batch mode execution for calls to remote services may include generating, by a query service of a provider network, a query plan to optimize a query for batch processing of data, the query plan including at least a function reference to a function provided by at least one service of the provider network, executing the query plan to invoke the function associated with the function reference, wherein a batch function generates a request including a batch of service calls to be processed by the at least one service, sends the request including the batch of service calls to the at least one service, and obtains a plurality of machine learning responses from the at least one service, and generating a query response based on the plurality of responses.

15.

发明授权
Systems, apparatuses, and methods to generate synthetic queries from customer data for training of document querying machine learning models 有权

公开(公告)号：US11475067B2

公开(公告)日：2022-10-18

申请号：US16698080

申请日：2019-11-27

Applicant: Amazon Technologies, Inc.

Inventor： Cicero Nogueira Dos Santos , Xiaofei Ma , Peng Xu , Ramesh M. Nallapati , Bing Xiang , Sudipta Sengupta , Zhiguo Wang , Patrick Ng

IPC: G06F40/30 , G06F16/9032 , G06K9/62 , G06F16/9038 , G06N20/00 , G06F16/903 , G06F16/93 , G06F40/20

Abstract: Techniques for generation of synthetic queries from customer data for training of document querying machine learning (ML) models as a service are described. A service may receive one or more documents from a user, generate a set of question and answer pairs from the one or more documents from the user using a machine learning model trained to predict a question from an answer, and store the set of question and answer pairs generated from the one or more documents from the user. The question and answer pairs may be used to train another machine learning model, for example, a document ranking model, a passage ranking model, a question/answer model, or a frequently asked question (FAQ) model.

16.

发明授权
Framework integration for instance-attachable accelerator 有权

公开(公告)号：US11467835B1

公开(公告)日：2022-10-11

申请号：US16199129

申请日：2018-11-23

Applicant: Amazon Technologies, Inc.

Inventor： Sudipta Sengupta , Poorna Chand Srinivas Perumalla , Jalaja Kurubarahalli , Samuel Oshin , Cory Pruce , Jun Wu , Eftiquar Shaikh , Pragya Agarwal , David Thomas , Karan Kothari , Daniel Evans , Umang Wadhwa , Mark Klunder , Rahul Sharma , Zdravko Pantic , Dominic Rajeev Divakaruni , Andrea Olgiati , Leo Dirac , Nafea Bshara , Bratin Saha , Matthew Wood , Swaminathan Sivasubramanian , Rajankumar Singh

IPC: G06F3/00 , G06F9/30 , G06F9/38

Abstract: Techniques for partitioning data flow operations between execution on a compute instance and an attached accelerator instance are described. A set of operations supported by the accelerator is obtained. A set of operations associated with the data flow is obtained. An operation in the set of operations associated with the data flow is identified based on the set of operations supported by the accelerator. The accelerator executes the first operation.

17.

发明授权
Machine learning inference calls for database query processing 有权

公开(公告)号：US11449796B2

公开(公告)日：2022-09-20

申请号：US16578060

申请日：2019-09-20

Applicant: Amazon Technologies, Inc.

Inventor： Sangil Song , Yongsik Yoon , Kamal Kant Gupta , Saileshwar Krishnamurthy , Stefano Stefani , Sudipta Sengupta , Jaeyun Noh

IPC: G06F7/00 , G06N20/00 , G06F16/242 , G06F16/2453 , G06N5/04

Abstract: Techniques for making machine learning inference calls for database query processing are described. In some embodiments, a method of making machine learning inference calls for database query processing may include generating a first batch of machine learning requests based at least on a query to be performed on data stored in a database service, wherein the query identifies a machine learning service, sending the first batch of machine learning requests to an input buffer of an asynchronous request handler, the asynchronous request handler to generate a second batch of machine learning requests based on the first batch of machine learning requests, and obtaining a plurality of machine learning responses from an output buffer of the asynchronous request handler, the machine learning responses generated by the machine learning service using a machine learning model in response to receiving the second batch of machine learning requests.

18.

发明授权
Accelerator based inference service 有权

公开(公告)号：US10853129B1

公开(公告)日：2020-12-01

申请号：US16358355

申请日：2019-03-19

Applicant: Amazon Technologies, Inc.

Inventor： Sudipta Sengupta , Haifeng He , Pejus Manoj Das , Poorna Chand Srinivas Perumalla , Wei Xiao , Shirley Xue Yi Leung , Vladimir Mitrovic , Yongcong Luo , Jiacheng Guo , Stefano Stefani , Matthew Shawn Wilson

IPC: G06F9/46 , G06F9/48 , G06N20/00 , G06N5/04 , G06F9/50 , G06N3/08 , G06F9/455

Abstract: Implementations detailed herein include description of a computer-implemented method to migrate a machine learning model from one accelerator portion (such as a portion of a graphical processor unit (GPU)) to a different accelerator portion. In some instances, a state of the first accelerator portion is persisted, the second accelerator portion is configured, the first accelerator portion is then detached from a client application instance, and at least a portion of an inference request is performed using the loaded at least a portion of the machine learning model on the second accelerator portion that had been configured.

19.

发明授权
Neural network training under memory restraint 有权

公开(公告)号：US12106222B2

公开(公告)日：2024-10-01

申请号：US18112036

申请日：2023-02-21

Applicant: Amazon Technologies, Inc.

Inventor： Sudipta Sengupta , Randy Renfu Huang , Ron Diamant , Vignesh Vivekraja

IPC: G06N3/084 , G06N3/04

CPC classification number: G06N3/084 , G06N3/04

Abstract: Methods and systems for training a neural network are provided. In one example, an apparatus comprises a memory that stores instructions; and a hardware processor configured to execute the instructions to: control a neural network processor to perform a loss gradient operation to generate data gradients; after the loss gradient operation completes, control the neural network processor to perform a forward propagation operation to generate intermediate outputs; control the neural network processor to perform a backward propagation operation based on the data gradients and the intermediate outputs to generate weight gradients; receive the weight gradients from the neural network processor; and update weights of a neural network based on the weight gradients.

20.

发明授权
Interactive assistance for executing natural language queries to data sets 有权

公开(公告)号：US12007988B2

公开(公告)日：2024-06-11

申请号：US18182303

申请日：2023-03-10

Applicant: Amazon Technologies, Inc.

Inventor： Ramesh M Nallapati , Zhiguo Wang , Bing Xiang , Patrick Ng , Yung Haw Wang , Mukul Karnik , Nanyan Li , Sharanabasappa Parashuram Revadigar , Timothy Jones , Stephen Michael Ash , Sudipta Sengupta , Gregory David Adams , Deepak Shantha Murthy , Douglas Scott Cerny , Stephanie Weeks , Hanbo Li

IPC: G06F16/2452 , G06F16/242 , G06F40/295 , G06N20/00

CPC classification number: G06F16/24522 , G06F16/2423 , G06F16/243 , G06F40/295 , G06N20/00

Abstract: Interactive assistances for executing natural language queries to data sets may be performed. A natural language query may be received. Candidate entity linkages may be determined between an entity recognized in the natural language query and columns in data sets. The candidate linkages may be ranked according to confidence scores which may be evaluated to detect ambiguity for an entity linkage. Candidate entity linkages may be provided to a user via an interface to select an entity linkage to use as part of completing the natural language query.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification