Utilizing artificial intelligence-based machine translation to augment document summarization

    公开(公告)号:US10679015B1

    公开(公告)日:2020-06-09

    申请号:US14981536

    申请日:2015-12-28

    Abstract: Technologies are disclosed for utilizing artificial intelligence-based machine translation to augment document summarization. Text can be extracted from a document in a first language. Machine translation can be utilized to translate the text from the first language to a second language. The translated text can be used to identify documents in the second language that include support for the translated text. A user interface can be provided that indicates the number of documents in the second language that provide support for the extracted text. Documents in the first language can also be translated to the second language. Documents that provide support for a text string can be identified in the documents translated to the second language and in other documents in the second language. A user interface can be provided that indicates the number of documents in the first language and the second language that provide support for the text.

    Multi-domain machine translation system with training data clustering and dynamic domain adaptation

    公开(公告)号:US10437933B1

    公开(公告)日:2019-10-08

    申请号:US15238101

    申请日:2016-08-16

    Abstract: A machine translation system capable of clustering training data and performing dynamic domain adaptation is disclosed. An unsupervised domain clustering process is utilized to identify domains in general training data that can include in-domain training data and out-of-domain training data. Segments in the general training data are then assigned to the domains in order to create domain-specific training data. The domain-specific training data is then utilized to create domain-specific language models, domain-specific translation models, and domain-specific model weights for the domains. An input segment to be translated can be assigned to a domain at translation time. The domain-specific model weights for the assigned domain can be utilized to translate the input segment.

Patent Agency Ranking