-
公开(公告)号:US20230066143A1
公开(公告)日:2023-03-02
申请号:US17464534
申请日:2021-09-01
发明人: Liviu Sebastian Matei , Filip Trojan , Marc Michiel Bron , Andrew Kenneth Hind , Yingzhao Zhou , Maria-Monica Petrica , Rajesh Ashwinbhai Shah
摘要: A document may be received as part of a request to identify similar documents in a collection of documents. However, the received document and the documents in the collection may have different schemas or formats. To provide semantic context to the search and allow similarity scores to be generated between different document types, a configuration may be accessed that defines how to generate queries from one schema into another schema. The configuration may map queries between different fields in both schemas. Results of the multiple queries can be combined to generate a weighted combination for each document that can be used as a similarity score between different document types.