-
公开(公告)号:US11556705B2
公开(公告)日:2023-01-17
申请号:US17083510
申请日:2020-10-29
Applicant: International Business Machines Corporation
Inventor: Alexander Brooks , Gabriel Goodhart , Sukriti Sharma , Nhan Hoang , Jalal Mahmud , Gaurav Kumbhat , Amita Misra , Zachary Branson
IPC: G06F40/20 , G06F40/289 , G06F40/253 , G06F9/50
Abstract: An input text that is also transmitted to a text processing service (e.g., a cloud based text processing service) is received. Characterizing information (e.g., contiguous parts of speech, terms used per part of speech, payload length, etc.) is extracted from the input text. A text payload is generated using the characterizing information. A performance test is run on the text payload. The performance test can include performing at least one selected from a group consisting of: sentiment analysis on the text payload, entity analysis on the text payload, content classification on the text payload, and syntax analysis on the text payload. The performance test can yield a processing time required to perform the performance test. Memory and processing power resource allocation to the text processing service can be altered based on the processing time of the performance test.
-
公开(公告)号:US20210326719A1
公开(公告)日:2021-10-21
申请号:US16850985
申请日:2020-04-16
Applicant: International Business Machines Corporation
Inventor: Jalal Mahmud , Amita Misra , Pritam Gundecha , Zhe Liu , Rama Kalyani T. Akkiraju , Xiaotong Liu , Anbang Xu
Abstract: A method, system, and a computer program product automatically select training data for updating a model by applying human-annotated training data to a model to generate results that are evaluated to identify correct case results and false case results that are categorized into error type categories for use in building error models corresponding to the error type categories, where each error model is built from at least failed case results belonging to a corresponding error type, and where unlabeled data samples are applied to each error model to compute an error likelihood for each unlabeled data sample with respect to each error type category, thereby enabling the selection and display of unlabeled data samples for annotation by a subject matter expert based on a computed error likelihood for the one or more unlabeled data samples in a specified error type category meeting or exceeding an error threshold requirement.
-
公开(公告)号:US11526802B2
公开(公告)日:2022-12-13
申请号:US16451693
申请日:2019-06-25
Applicant: International Business Machines Corporation
Inventor: Zhe Liu , Amita Misra , Pritam Gundecha , Jalal Mahmud , Yash Bhalgat
IPC: G06F16/951 , G06N20/00 , G06K9/62 , G06F17/18
Abstract: A method and a system for model training are provided. The method can include training a first classifier, a second classifier, and a third classifier with subsets of a labeled dataset. The method can also include predicting a pseudo labeled dataset from an unlabeled dataset using the first classifier, the second classifier, and the third classifier. The method further includes assigning a role to the first classifier, to the second classifier, and to the third classifier. The method can further include selecting a teaching sample dataset from the pseudo labeled dataset based on the role assigned to the third classifier, wherein the third classifier is assigned a role of a student. The method can also include retraining the third classifier with the teaching sample dataset in conjunction with a subset of the labeled dataset.
-
公开(公告)号:US11386273B2
公开(公告)日:2022-07-12
申请号:US16687098
申请日:2019-11-18
Applicant: International Business Machines Corporation
Inventor: Amita Misra , Jalal Mahmud , Saurabh Tripathy
IPC: G06F40/30 , G06N20/00 , G06N3/08 , G06F40/247 , G06N5/04 , G06N3/04 , G06F40/205 , G06F40/242 , G06F40/253
Abstract: A method, system and computer-usable medium are disclosed for sentiment detection based on applying an antonym dictionary to a natural language processing (NLP) system. A binary classifier is trained to predict negation cues, where a constituency parse tree is used to create rules for scope detection. The trained binary classifier, a list of conversational negation terms, and a list of antonyms are used to annotate content that considers negation cues and scope for the created rules.
-
公开(公告)号:US11443209B2
公开(公告)日:2022-09-13
申请号:US16850985
申请日:2020-04-16
Applicant: International Business Machines Corporation
Inventor: Jalal Mahmud , Amita Misra , Pritam Gundecha , Zhe Liu , Rama Kalyani T. Akkiraju , Xiaotong Liu , Anbang Xu
Abstract: A method, system, and a computer program product automatically select training data for updating a model by applying human-annotated training data to a model to generate results that are evaluated to identify correct case results and false case results that are categorized into error type categories for use in building error models corresponding to the error type categories, where each error model is built from at least failed case results belonging to a corresponding error type, and where unlabeled data samples are applied to each error model to compute an error likelihood for each unlabeled data sample with respect to each error type category, thereby enabling the selection and display of unlabeled data samples for annotation by a subject matter expert based on a computed error likelihood for the one or more unlabeled data samples in a specified error type category meeting or exceeding an error threshold requirement.
-
公开(公告)号:US20220138419A1
公开(公告)日:2022-05-05
申请号:US17083510
申请日:2020-10-29
Applicant: International Business Machines Corporation
Inventor: Alexander Brooks , Gabriel Goodhart , SUKRITI SHARMA , Nhan Hoang , Jalal Mahmud , Gaurav Kumbhat , Amita Misra , Zachary Branson
IPC: G06F40/253 , G06F9/50 , G06F40/289
Abstract: An input text that is also transmitted to a text processing service is received. Characterizing information is extracted from the input text. A text payload is generated using the characterizing information. A performance test is run on the text payload.
-
公开(公告)号:US20210149995A1
公开(公告)日:2021-05-20
申请号:US16687098
申请日:2019-11-18
Applicant: International Business Machines Corporation
Inventor: Amita Misra , Jalal Mahmud , Saurabh Tripathy
Abstract: A method, system and computer-usable medium are disclosed for sentiment detection based on applying an antonym dictionary to a natural language processing (NLP) system. A binary classifier is trained to predict negation cues, where a constituency parse tree is used to create rules for scope detection. The trained binary classifier, a list of conversational negation terms, and a list of antonyms are used to annotate content that considers negation cues and scope for the created rules.
-
-
-
-
-
-