- 专利标题: Cross-lingual information retrieval and information extraction
-
申请号: US16415988申请日: 2019-05-17
-
公开(公告)号: US11531824B2公开(公告)日: 2022-12-20
- 发明人: Bonan Min , Rabih Zbib , Zhongqiang Huang
- 申请人: Raytheon BBN Technologies Corp.
- 申请人地址: US MA Cambridge
- 专利权人: Raytheon BBN Technologies Corp.
- 当前专利权人: Raytheon BBN Technologies Corp.
- 当前专利权人地址: US MA Cambridge
- 代理机构: Schwegman Lundberg & Woessner, P.A.
- 主分类号: G06F17/00
- IPC分类号: G06F17/00 ; G06F40/58 ; G06F16/9032 ; G06N20/00 ; G06F16/2457 ; G06N3/08
摘要:
A machine accesses a query in a first natural language. The machine identifies an event corresponding to the query. The machine computes, using a cross-lingual information retrieval module, a ranked list of documents in a second natural language that are related to the event. At least a portion of documents in the ranked list are selected from a collection of documents in the second natural language that are not annotated with events. The cross-lingual information retrieval module is trained using a dataset comprising annotated documents in the first natural language and translations of the annotated documents into the second natural language. Each annotated document is annotated with one or more events. The machine provides an output representing at least a portion of the ranked list of documents in the second natural language. The second natural language is different from the first natural language.
信息查询