-
公开(公告)号:US20210157857A1
公开(公告)日:2021-05-27
申请号:US16698080
申请日:2019-11-27
Applicant: Amazon Technologies, Inc.
Inventor: Cicero NOGUEIRA DOS SANTOS , Xiaofei MA , Peng XU , Ramesh M. NALLAPATI , Bing XIANG , Sudipta SENGUPTA , Zhiguo WANG , Patrick NG
IPC: G06F16/9032 , G06N20/00 , G06F16/9038 , G06K9/62
Abstract: Techniques for generation of synthetic queries from customer data for training of document querying machine learning (ML) models as a service are described. A service may receive one or more documents from a user, generate a set of question and answer pairs from the one or more documents from the user using a machine learning model trained to predict a question from an answer, and store the set of question and answer pairs generated from the one or more documents from the user. The question and answer pairs may be used to train another machine learning model, for example, a document ranking model, a passage ranking model, a question/answer model, or a frequently asked question (FAQ) model.