发明授权
- 专利标题: Dataset adaptation for high-performance in specific natural language processing tasks
-
申请号: US15852167申请日: 2017-12-22
-
公开(公告)号: US10942954B2公开(公告)日: 2021-03-09
- 发明人: David Martinez Iraola , Sheng Hua Bao , Donna K. Byron , Priscilla Moraes
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Patterson + Sheridan, LLP
- 主分类号: G06F16/332
- IPC分类号: G06F16/332 ; G06N20/00 ; G06F16/33
摘要:
Systems, methods, and computer program products to perform an operation comprising identifying a first available dataset having a degree of similarity to a received input dataset that exceeds a similarity threshold, determining, based on a plurality of features of the first available dataset and a plurality of features of the input dataset, a set of recommendations for transforming the input dataset, and transforming a text of the input dataset based on the set of recommendations and to optimize the input dataset for processing by a natural language processing (NLP) algorithm.
公开/授权文献
信息查询