Abstract:
A computer implemented electronic out-of-office message analysis system and method are disclosed. The method includes, for each of a plurality of users, receiving a user-generated electronic out-of-office message in a natural language in which a time window of absence and at least one alternate named contact are expressed and, based on the out-of-office message, generating a structured representation of the out-of-office message which links the alternate contact to a normalized representation of the time window. The structured representation of the out-of-office message is stored in a database. From the database it can be determined whether a current user's out-of-office message conflicts with another user's out-of-office message. If a conflict is detected, the current user can be notified.
Abstract:
A method for identifying header/footer content of a document, in order to sequence text fragments comprising recognizable text blocks as derived from the document. The textual variability of lines comprised of text blocks, including the different kinds of text blocks within the line is analyzed for assessment of textual variability. Header/footer zones are defined by textual content having a low textual variability. An alternative embodiment identifies pagination constructs by comparing selected text-boxes for similarity and proximity and clustering the text boxes satisfying a predetermined similarity value, wherein the clustered text boxes are deemed to comprise pagination constructs.
Abstract:
A computer-implemented method, device, and computer readable medium transform a markup language document from a digital form to a user-specified form on a display device. Based on a configuration file, a digital markup language document is processed. For a current navigated-to page in the markup language document, context is set to a page node, and a page transformation is performed by the computer. A selection language expression is evaluated, and a node transformation is performed. The node transformation may include setting context, determining the type of decoration associated with the current context, reading the selection language expressions, computing a decoration parameter value for each of the decoration parameters associated with each declaration, and creating and displaying a decoration based on the computed decoration parameter values. The steps may be repeated for remaining markup language node and for each remaining decoration declaration.
Abstract:
A method for detection of page numbers in a document includes identifying a plurality of text fragments associated with a plurality of pages of a document. From the identified text fragments, at least one sequence is identified. Each identified sequence includes a plurality of terms. Each term of the sequence is derived from a text fragment selected from the plurality text fragments. The terms of an identified sequence comply with at least one predefined numbering scheme which defines a form and an incremental state of the terms in a sequence. A subset of the identified sequences which cover at least some of the pages of the document is computed. Terms of at least some of the subset of the identified sequences are construed as page numbers of pages of the document. Additional page numbers may be identified by considering one or more features of the terms in the subset of identified sequences.
Abstract:
The system comprises an improved document monitoring agent in which user evaluations are used to decide whether a changed document should be saved in the system or not. The evaluation of importance of the change in a document is performed by one or more users who collaboratively monitor a networked document, typically identified by a URL. By providing a user evaluation interface, it is possible for users to indicate their evaluation of the significance of the change. As such, only significantly changed documents, as indicated by the users themselves, are saved. Thus, a more efficient saving is obtained while at the same time reducing the risk of discarding potentially interesting changed documents which would have been discarded by conventional monitoring agents.
Abstract:
An interactive document processing system includes a display screen that is coupled to a processing device. In addition, the document processing system includes an input device for viewing all or a portion of a hardcopy document on a work surface. The input device generates signals representing the appearance of the hardcopy document from representative electronic data stored in a print memory. In one embodiment, the processing device is connected to the input device to receive the signals, determine the position of a moving indicator relative to the document, and correlate the determined position with the electronic representation of the hardcopy document. The display screen provides information related to user commands that edit and/or select portions of the electronic representation of the hardcopy document.
Abstract:
A method and apparatus for vacuum arc deposition of carbon on a substrate inhibits or eliminates emission of contaminating carbon particles in the ion plasma by maintaining an elevated local plasma pressure at the cathode or target surface, thereby minimizing the role of heat conduction in the creation of the particles and strongly increasing the electron emission cooling effects.
Abstract:
A method for manufacturing a bioresorbable device, said method comprising: providing an anodic material; providing a cathodic material, said anodic and cathodic materials forming a galvanic couple; vapor depositing simultaneously said anodic and cathodic materials on a substrate to obtain a bioresorbable material; and processing said bioresorbable material to form said bioresorbable device. Said vapor deposition of said anodic and cathodic materials is performed under conditions such that bioresorption of said device is promoted by galvanic corrosion between said anodic and cathodic materials.
Abstract:
An initial organizational table for a document is determined based on textual similarity between entries of the organizational table and target text fragments and not taking into account text formatting. A classifier is trained to identify text fragment pairs consisting of entries of the organizational table and corresponding target text fragments based at least in part on text formatting features. The training employs a training set of examples annotated based on the initial organizational table. The initial organizational table is updated using the trained classifier.
Abstract:
In a method for identifying a table of contents in a document, an ordered sequence of text fragments is derived from the document. A table of contents is selected as a contiguous sub-sequence of the ordered sequence of text fragments satisfying the criteria: (i) entries defined by text fragments of the table of contents each have a link to a target text fragment having textual similarity with the entry; (ii) no target text fragment lies within the table of contents; and (iii) the target text fragments have an ascending ordering corresponding to an ascending ordering of the entries defining the target text fragments.