-
公开(公告)号:US20210174238A1
公开(公告)日:2021-06-10
申请号:US16578060
申请日:2019-09-20
Applicant: Amazon Technologies, Inc.
Inventor: Sangil SONG , Yongsik YOON , Kamal Kant GUPTA , Saileshwar KRISHNAMURTHY , Stefano STEFANI , Sudipta SENGUPTA , Jaeyun NOH
IPC: G06N20/00 , G06F16/242 , G06F16/2453 , G06N5/04
Abstract: Techniques for making machine learning inference calls for database query processing are described. In some embodiments, a method of making machine learning inference calls for database query processing may include generating a first batch of machine learning requests based at least on a query to be performed on data stored in a database service, wherein the query identifies a machine learning service, sending the first batch of machine learning requests to an input buffer of an asynchronous request handler, the asynchronous request handler to generate a second batch of machine learning requests based on the first batch of machine learning requests, and obtaining a plurality of machine learning responses from an output buffer of the asynchronous request handler, the machine learning responses generated by the machine learning service using a machine learning model in response to receiving the second batch of machine learning requests.
-
公开(公告)号:US20200311617A1
公开(公告)日:2020-10-01
申请号:US16001548
申请日:2018-06-06
Applicant: Amazon Technologies, Inc.
Inventor: Charles Drummond SWAN , Edo LIBERTY , Steven Andrew LOEPPKY , Stefano STEFANI , Alexander Johannes SMOLA , Swaminathan SIVASUBRAMANIAN , Craig WILEY , Richard Shawn BICE , Thomas Albert FAULHABER, JR. , Taylor GOODHART
Abstract: Techniques for using scoring algorithms utilizing containers for flexible machine learning inference are described. In some embodiments, a request to host a machine learning (ML) model within a service provider network on behalf of a user is received, the request identifying an endpoint to perform scoring using the ML model. An endpoint is initialized as a container running on a virtual machine based on a container image and used to score data and return a result of said scoring to a user device.
-
公开(公告)号:US20190156816A1
公开(公告)日:2019-05-23
申请号:US15922495
申请日:2018-03-15
Applicant: Amazon Technologies, Inc.
Inventor: Ashish SINGH , Deepikaa SURESH , Vasanth PHILOMIN , Rajkumar GULABANI , Vladimir ZHUKOV , Swaminathan SIVASUBRAMANIAN , Vikram Sathyanarayana ANBAZHAGAN , Praveen Kumar AKARAPU , Stefano STEFANI
Abstract: Techniques for automated speech recognition (ASR) are described. A user can upload an audio file to a storage location. The user then provides the ASR service with a reference to the audio file. An ASR engine analyzes the audio file, using an acoustic model to divide the audio data into words, and a language model to identify the words spoken in the audio file. The acoustic model can be trained using audio sentence data, enabling the transcription service to accurately transcribe lengthy audio data. The results are punctuated and normalized, and the resulting transcript is returned to the user.
-
公开(公告)号:US20190156124A1
公开(公告)日:2019-05-23
申请号:US15926745
申请日:2018-03-20
Applicant: Amazon Technologies, Inc.
Inventor: Nitin SINGHAL , Vivek BHADAURIA , Ranju DAS , Gaurav D. GHARE , Roman GOLDENBERG , Stephen GOULD , Kuang HAN , Jonathan Andrew HEDLEY , Gowtham JEYABALAN , Vasant MANOHAR , Andrea OLGIATI , Stefano STEFANI , Joseph Patrick TIGHE , Praveen Kumar Udayakumar , Renjun ZHANG
IPC: G06K9/00
CPC classification number: G06K9/00744 , G06F16/71 , G06K9/00228 , G06K9/00718 , G06K9/00765 , G06K2009/00738
Abstract: Techniques for analyzing stored video upon a request are described. For example, a method of receiving a first application programming interface (API) request to analyze a stored video, the API request to include a location of the stored video and at least one analysis action to perform on the stored video; accessing the location of the stored video to retrieve the stored video; segmenting the accessed video into chunks; processing each chunk with a chunk processor to perform the at least one analysis action, each chunk processor to utilize at least one machine learning model in performing the at least one analysis action; joining the results of the processing of each chunk to generate a final result; storing the final result; and providing the final result to a requestor in response to a second API request is described.
-
公开(公告)号:US20210173896A1
公开(公告)日:2021-06-10
申请号:US15934277
申请日:2018-03-23
Applicant: Amazon Technologies, Inc.
Inventor: Poorna Chand Srinivas PERUMALLA , Pracheer GUPTA , Stefano STEFANI
IPC: G06F17/30
Abstract: Techniques are described for a nearest neighbor search service that enables users to perform nearest neighbor searches. The nearest neighbor search service includes an interface that enables users to create collections of searchable vectors, add and update vectors to a collection, delete vectors from a collection, and perform searches for nearest neighbors to a given vector. The nearest neighbor search service enables users to add, update, and delete vectors of a collection in real-time while also enabling users to perform searches at the same time.
-
公开(公告)号:US20190164080A1
公开(公告)日:2019-05-30
申请号:US15822061
申请日:2017-11-24
Applicant: Amazon Technologies, Inc.
Inventor: Stefano STEFANI , Steven Andrew LOEPPKY , Thomas Albert FAULHABER, JR. , Craig WILEY , Edo LIBERTY
Abstract: Techniques for auto-scaling hosted machine learning models for production inference are described. A machine learning model can be deployed in a hosted environment such that the infrastructure supporting the machine learning model scales dynamically with demand so that performance is not impacted. The model can be auto-scaled using reactive techniques or predictive techniques.
-
7.
公开(公告)号:US20190156247A1
公开(公告)日:2019-05-23
申请号:US15919628
申请日:2018-03-13
Applicant: Amazon Technologies, Inc.
Inventor: Thomas Albert FAULHABER, JR. , Edo LIBERTY , Stefano STEFANI , Zohar KARNIN , Craig WILEY , Steven Andrew LOEPPKY , Swaminathan SIVASUBRAMANIAN , Alexander Johannes SMOLA , Taylor GOODHART
Abstract: Techniques for dynamic accuracy-based experimentation and deployment of machine learning (ML) models are described. Inference traffic flowing to ML models and the accuracy of the models is analyzed and used to ensure that better performing models are executed more often via model selection. A predictive component can evaluate which model is more likely to be accurate for certain input data elements. Ensemble techniques can combine inference results of multiple ML models to aim to achieve a better overall result than any individual model could on its own.
-
公开(公告)号:US20150379050A1
公开(公告)日:2015-12-31
申请号:US14845220
申请日:2015-09-03
Applicant: Amazon Technologies, Inc.
Inventor: David Craig YANACEK , Mohammad Zeeshan QURESHI , Jai VASANTH , Pejus Manoj DAS , Stefano STEFANI , Stuart Henry SEELYE MARSHALL , Wei XIAO
IPC: G06F17/30
Abstract: Methods and apparatus for configurable-capacity time-series tables are disclosed. A schedule of database table management operations, including at least an operation to change a throughput constraint associated with a table in response to a triggering event, is generated. The table is instantiated with an initial throughput constraint in accordance with the schedule. Work requests directed to the table are accepted based on the initial throughput constraint. The throughput constraint is modified in response to the triggering event. Subsequent work requests are accepted based on the modified throughput constraint.
Abstract translation: 公开了可配置容量时间序列表的方法和装置。 生成数据库表管理操作的调度表,其包括响应于触发事件而至少改变与表关联的吞吐量约束的操作。 该表根据时间表以初始吞吐量约束进行实例化。 基于初始吞吐量约束接受指向表的工作请求。 响应于触发事件修改吞吐量约束。 基于修改的吞吐量约束接受后续工作请求。
-
-
-
-
-
-
-