Abstract:
An embodiment provides a computer-implemented system and method for providing visual suggestions for cluster classification. One or more clusters comprising uncoded documents from a set are obtained. A different set of reference documents that are each classified with a code is designated. A cluster center in one of the clusters is identified. The cluster center is compared to one or more of the reference documents. Those of the reference documents that are similar to the cluster are identified based on the comparison. The classification codes of each of the similar reference documents are visually represented as a suggestion for assigning one of the classification codes to the cluster.
Abstract:
A system and method for providing inclusion-based electronically stored information item classification suggestions with the aid of a digital computer are provided. A set of reference electronically stored information items is designated. A set of uncoded electronically stored information items is obtained. One or more of the coded reference electronically stored information items are combined with the set. A score vector is generated for each of the combined uncoded and reference electronically stored information items. The combined uncoded electronically stored information items are grouped into a plurality of clusters. The score vectors are used to generate a suggestion for assigning one of the classification codes. The visual representations of each of the classification codes of the reference electronically stored information items are displayed as a further suggestion for assigning one of the classification codes. An assignment of one of the classification codes is received.
Abstract:
A computer-implemented system and method for providing classification suggestions via concept injection is provided. Clusters of uncoded concepts are accessed. Each uncoded concept represents one or more documents including that concept. One or more uncoded concepts in one of the clusters is compared to a set of reference concepts. Each reference concept is associated with a classification code. One or more of the reference concepts most similar to the one or more uncoded concepts is placed into the cluster. A neighborhood is generated for one of the uncoded concepts in the cluster. The neighborhood includes at least one of the reference concepts placed in the cluster. One of the classification codes representative of the neighborhood of reference concepts is determined and provided as a suggestion for classification of the uncoded concept.
Abstract:
An embodiment provides a computer-implemented system and method for providing visual suggestions for cluster classification. One or more clusters comprising uncoded documents from a set are obtained. A different set of reference documents that are each classified with a code is designated. A cluster center in one of the clusters is identified. The cluster center is compared to one or more of the reference documents. Those of the reference documents that are similar to the cluster are identified based on the comparison. The classification codes of each of the similar reference documents are visually represented as a suggestion for assigning one of the classification codes to the cluster.
Abstract:
A computer-implemented system and method for inclusion-based electronically stored information item cluster visual representation is provided. A set of reference electronically stored information items is maintained. A subset of the electronically stored information items is selected from the set, each associated with a classification code, each of the classification codes associated with a visual representation different from the visual representations of the remaining classification codes. The subset is combined with a set of uncoded electronically stored information items, each associated with a visual representation different from the visual representations of the classification codes. The combined electronically stored information items are grouped into clusters. Each of the clusters is visually represented, including displaying the visual representation associated with the code of each of the reference electronically stored information items in that cluster and the visual representation associated with each of the uncoded electronically stored information item in that cluster.
Abstract:
A computer-implemented system and method for providing classification suggestions is provided. A set of uncoded documents is maintained. One of the uncoded documents is selected and compared with a set of reference documents, each associated with a classification. Those reference documents that are similar to the uncoded document are identified. Relationships between the uncoded document and each reference document are identified by counting a number of similar reference documents associated with each different classification. The classification having a highest count of similar reference documents is selected for the selected uncoded document as a suggestion.
Abstract:
A computer-implemented system and method for visually suggesting classification for inclusion-based document cluster spines are provided. A set of reference documents each associated with a classification code is designated. A different set of uncoded documents is obtained. One or more of the coded reference documents are combined with a plurality of uncoded documents into a combined document set. The documents in the combined document set are grouped into clusters. The clusters are organized along one or more spines, each spine including a vector. A visual suggestion for assigning one of the classification codes to one of the spines is provided, including visually representing each of the reference concepts in the clusters along that spine.
Abstract:
A computer-implemented system and method for inclusion-based electronically stored information item cluster visual representation is provided. A set of reference electronically stored information items is maintained. A subset of the electronically stored information items is selected from the set, each associated with a classification code, each of the classification codes associated with a visual representation different from the visual representations of the remaining classification codes. The subset is combined with a set of uncoded electronically stored information items, each associated with a visual representation different from the visual representations of the classification codes. The combined electronically stored information items are grouped into clusters. Each of the clusters is visually represented, including displaying the visual representation associated with the code of each of the reference electronically stored information items in that cluster and the visual representation associated with each of the uncoded electronically stored information item in that cluster.
Abstract:
A computer-implemented system and method for visually suggesting classification for inclusion-based document cluster spines are provided. A set of reference documents each associated with a classification code is designated. A different set of un-coded documents is obtained. One or more of the coded reference documents are combined with a plurality of un-coded documents into a combined document set. The documents in the combined document set are grouped into clusters. The clusters are organized along one or more spines, each spine including a vector. A visual suggestion for assigning one of the classification codes to one of the spines is provided, including visually representing each of the reference concepts in the clusters along that spine.
Abstract:
A computer-implemented system and method for providing visual classification suggestions for inclusion-based concept clusters are provided. Reference concepts each associated with a classification code are designated. One or more of the reference concepts are grouped with a plurality of uncoded concepts into a grouped concept set. Clusters are generated, each including a portion of the uncoded concepts and the reference concepts of the grouped concept set. A visual suggestion for assigning one of the classification codes to one of the clusters including visually representing each of the reference concepts in that cluster is provided.