- 专利标题: CROSS-CONTEXT NATURAL LANGUAGE MODEL GENERATION
-
申请号: US17210311申请日: 2021-03-23
-
公开(公告)号: US20210295822A1公开(公告)日: 2021-09-23
- 发明人: Adam Tomkins , Walter Bender , Carlos Fernández Musoles , Richard Graves , Dipanwita Das
- 申请人: Sorcero, Inc.
- 申请人地址: US DC Washington
- 专利权人: Sorcero, Inc.
- 当前专利权人: Sorcero, Inc.
- 当前专利权人地址: US DC Washington
- 主分类号: G10L15/06
- IPC分类号: G10L15/06 ; G10L15/197 ; G06F16/332 ; G10L15/16 ; G06F40/20
摘要:
Provided is a method including obtaining a corpus and an associated set of domain indicators. The method includes learning a set of vectors in an embedding space based on n-grams of the corpus. The method includes updating ontology graphs comprising a set of vertices and edges associating the set of vertices with each other. The method also includes determining a vector cluster using hierarchical clustering based on distances of the set of vectors with respect to each other in the embedding space and determining a hierarchy of the ontology graphs based on a set of domain indicators of a respective set of vertices corresponding to vectors of the vector cluster. The method also includes updating an index based on the ontology graphs.
公开/授权文献
- US11151982B2 Cross-context natural language model generation 公开/授权日:2021-10-19
信息查询