Dataset adaptation for high-performance in specific natural language processing tasks
摘要:
Systems, methods, and computer program products to perform an operation comprising identifying a first available dataset having a degree of similarity to a received input dataset that exceeds a similarity threshold, determining, based on a plurality of features of the first available dataset and a plurality of features of the input dataset, a set of recommendations for transforming the input dataset, and transforming a text of the input dataset based on the set of recommendations and to optimize the input dataset for processing by a natural language processing (NLP) algorithm.
信息查询
0/0