USING DOMAIN SPECIFIC VOCABULARIES TO SPELLCHECK INPUT STRINGS

    公开(公告)号:US20220269857A1

    公开(公告)日:2022-08-25

    申请号:US17182080

    申请日:2021-02-22

    摘要: Provided are a computer program product, system, and method for using domain specific vocabularies to spellcheck input strings. A determination is made of a determining a domain specific vocabulary comprising domain specific tokens for a user. The input strings in the input text from the user are processed to determine whether the input strings match the domain specific tokens in the domain specific vocabulary. Indication is made that input strings that are determined to match the domain specific tokens in the domain specific vocabulary are not to be spellchecked by a global vocabulary available to users across domains.

    PRE-PROCESSING A TABLE IN A DOCUMENT FOR NATURAL LANGUAGE PROCESSING

    公开(公告)号:US20220230012A1

    公开(公告)日:2022-07-21

    申请号:US17155077

    申请日:2021-01-21

    摘要: Provided are a computer program product, system, and method for pre-processing a table in a document for natural language processing. A table in a document is parsed to extract column headers, row headers, and data cells, which are processed to determine an initial set of a main element comprising an entity whose value is to be extracted, a conditional element that refines the entity, and a value element comprising a value for the entity. A user selection is received of at least one of the column headers, row headers, and data cells for at least one of the main element, conditional element, and the value element in the initial set to produce a modified set of the main element, conditional element, and value element. The modified set is provided to a natural language processing engine to perform natural language processing of the document including the table, using the modified set.