Abstract:
A linguistic rewriting rule for use in linguistic processing of an ordered sequence of linguistic tokens includes a token pattern recognition rule that matches the ordered sequence of linguistic tokens with a syntactical pattern. The token pattern recognition rule incorporates a character pattern recognition rule to match characters contained in an ambiguous portion of the ordered sequence of linguistic tokens with a character pattern defining a corresponding portion of the syntactical pattern.
Abstract:
A system, apparatus, method, and computer program product encoding the method are provided for expectation fulfillment evaluation. The system includes a natural language processing component that extracts sets of normalized tasks from an input expectation document and an input fulfillment document. A task list comparison component compares the two sets of tasks and identifies each match between a normalized task in the first set and a normalized task in the second set, each normalized task in the first set which has no matching task in the second set, and each normalized task in the second set which has no matching task in the first set. A report generator outputs a report based on the comparison. The report may further include one or more of statistics generated from the comparison, information on an opinion generated by opinion mining a third document, and as a list of the normalized tasks and an indication of whether the tasks were fulfilled, derived from analysis of temporal expression in the two documents. The system may be implemented as software in memory by an associated computer processor.
Abstract:
A computer-implemented system and method are provided for warning a user of a missing attachment to an email. The method may include automatically recognizing a natural language of text of an email and selecting a keyword list from a plurality of keyword lists, based on the recognized natural language. Each keyword list is associated with a respective natural language and includes at least one keyword. At least one of the keyword lists includes a multi-sense keyword having a plurality of senses. A first of the plurality of senses is recognized as referring to an attachment and a second of the plurality of senses is recognized as not referring to an attachment. The text of the email is processed to identify an instance, where present, of a keyword that is in the selected keyword list and, for a keyword which is a multi-sense keyword, at least one sense-related rule is applied to a portion of the text which includes the instance of the multi-sense keyword. Based on the application of the at least one sense-related rule, where the email lacks an attachment, a notification is provided to the user.
Abstract:
A computer implemented electronic out-of-office message analysis system and method are disclosed. The method includes, for each of a plurality of users, receiving a user-generated electronic out-of-office message in a natural language in which a time window of absence and at least one alternate named contact are expressed and, based on the out-of-office message, generating a structured representation of the out-of-office message which links the alternate contact to a normalized representation of the time window. The structured representation of the out-of-office message is stored in a database. From the database it can be determined whether a current user's out-of-office message conflicts with another user's out-of-office message. If a conflict is detected, the current user can be notified.
Abstract:
A computer-implemented system and method are provided for warning a user of a missing attachment to an email. The method may include automatically recognizing a natural language of text of an email and selecting a keyword list from a plurality of keyword lists, based on the recognized natural language. Each keyword list is associated with a respective natural language and includes at least one keyword. At least one of the keyword lists includes a multi-sense keyword having a plurality of senses. A first of the plurality of senses is recognized as referring to an attachment and a second of the plurality of senses is recognized as not referring to an attachment. The text of the email is processed to identify an instance, where present, of a keyword that is in the selected keyword list and, for a keyword which is a multi-sense keyword, at least one sense-related rule is applied to a portion of the text which includes the instance of the multi-sense keyword. Based on the application of the at least one sense-related rule, where the email lacks an attachment, a notification is provided to the user.
Abstract:
A parser for parsing text includes a tokenizing module which divides the text into an ordered sequence of linguistic tokens. A morphological module associates parts of speech with the linguistic tokens. A detection module identifies candidate titles of creative works, such as works of art. A filtering module filters the candidate titles of works to exclude citations of direct speech from the candidate titles of works. A comparison module compares any remaining candidate titles of works with titles of works in an associated knowledge base. The comparison module annotates the text when a match is found.