摘要:
In a method for identifying an unknown product a library of absorbance spectra of known products is measured and stored in a library. A quick search using clustering techniques is conducted to narrow the search to a few products, followed by an exhaustive search of the spectra of the few products. More specifically, principal component analysis is applied to the absorbance spectra to generate product score vectors extending into principal component inside model space which are divided into clusters and subclusters in accordance with their relative proximity. Hyperspheres are constructed around each vector and an envelope is constructed to enclose each cluster surrounding the hyperspheres within the cluster. The absorbance spectrum of the unknown product to be identified is measured and an unknown product score vector is determined from the unknown product spectrum projecting in principal component inside model space of the clusters. It is determined whether or not the unknown product score vector falls within one of the envelopes and if so the product score vector is projected into the principal component inside model space of that cluster and it is determined whether or not the unknown product score vector falls within any of the subclusters divided from the cluster. This process is repeated until the unknown product score vector is found to lie in a cluster which is not further subdivided. In this manner, the search is narrowed to a few products. An exhaustive search is then carried out to match the spectrum of the unknown product with the spectra of the known products corresponding to the undivided subcluster.
摘要:
A system and method for unsupervised clustering of audio data segments in an audio data recording containing speech from multiple speakers including the steps of: 1) providing a portion of the audio data containing speech from all of the speakers; 2) forming initial clusters by dividing the portion of the audio data into segments, each of which includes an ordered data set; 3) computing the pairwise distance between each pair of clusters using a likelihood ration independent of the order of data within the segments; and 4) combining into a new cluster the two clusters with a minimum pairwise distance. These steps are repeated until a number of clusters equal to the number of speakers is obtained.
摘要:
To provide a data management system and a data management program enabling a person not having reference authority of a display name of a node of an ordinary hierarchical structure to refer to data associated with a node in an ordinary hierarchical structure, a data management system which, when one of the nodes of a hierarchical structure constituted by a plurality of nodes is specified, notifies at least some values of purchased product data associated with the specified node notifies the ordinary hierarchical structure to a person having reference authority of a display name of a node of the ordinary hierarchical structure, and notifies, to a person not having reference authority of the display name of the node of the ordinary hierarchical structure, a substitute hierarchical structure constituted by nodes of which value of at least one item of the purchased product data is the display name and with which purchased product data including this value is associated.
摘要:
An approach is provided that improves a question answering (QA) computer system by automatically generating relationship templates. Event patterns are extracted from data in a corpus utilized by the QA computer system. The extracted event patterns are analyzed with the analysis resulting in a number of clusters of related event patterns. Relationship templates are then created from the plurality of clusters of related event patterns and these relationship templates are then utilized to visually interact with the corpus.
摘要:
An approach is provided that improves a question answering (QA) computer system by automatically generating relationship templates. Event patterns are extracted from data in a corpus utilized by the QA computer system. The extracted event patterns are analyzed with the analysis resulting in a number of clusters of related event patterns. Relationship templates are then created from the plurality of clusters of related event patterns and these relationship templates are then utilized to visually interact with the corpus.
摘要:
Systems and methods for data reduction including organizing data of an event stream into a file access table concurrently with receiving the event stream, the data including independent features and dependent features. A frequent pattern tree (FP-Tree) is built including nodes corresponding to the dependent features according to a frequency of occurrence of the dependent features relative to the independent features. Each single path in the FP-Tree is merged into a special node corresponding to segments of dependent features to produce a reduced FP-Tree. All path combinations in the reduced FP-Tree are identified. A compressible file access template (CFAT) is generated corresponding to each of the path combinations. The data of the event stream is compressed with the CFATs to reduce the dependent features to special events representing the dependent features.
摘要:
The disclosed subject matter involves identifying clusters and segments of a population of data for use in a recommendation service. Clusters of members or items are formed, where the clusters, or partitions are close to being equal in size. Items are distributed based on similarities identified with matrix factorization. The items are formed into clusters based on the similarities and the clusters are used in training of a generalized linear mixed model treating the clusters as random-level effects. The trained model may be used in the recommendation service. Other embodiments are described and claimed.
摘要:
Methods, apparatuses, computer program products, and systems are provided for training a network to act as an overhanging structure detector using an unsupervised machine learning technique. An image is received that was captured by an image capturing device in communication with a probe apparatus on board a vehicle, wherein the image comprises at least a portion of a roadway. A sky projection is generated based on at least a portion of the image. It is determined whether the sky projection comprises a feature that defines a feature direction that is substantially non-vertical. Responsive to determining that the sky projection does comprise a feature that defines a feature direction that is substantially non-vertical, it is determined that the image comprises an overhanging structure.
摘要:
A method performed by an apparatus is described. The method includes receiving a first set of object data corresponding to a first journey. The method also includes receiving a second set of object data corresponding to a second journey. The method further includes determining a similarity metric between the first set of object data and the second set of object data. The similarity metric indicates a distance between the first set of object data and the second set of object data for at least one object. The method additionally includes clustering the first set of object data and the second set of object data for the at least one object based on the similarity metric to produce at least one object cluster. The method also includes producing map data based on the at least one object cluster.
摘要:
Disclosed is a learning device. A feature-quantity calculation unit extracts a feature quantity from each feature point of a learning image. An acquisition unit acquires a classifier already obtained by learning as a transfer classifier. A classifier generation unit substitutes feature quantities into weak classifiers constituting the transfer classifier, calculates error rates of the weak classifiers on the basis of classification results of the weak classifiers and a weight of the learning image, and iterates a process of selecting a weak classifier of which the error rate is minimized a plurality of times. In addition, the classifier generation unit generates a classifier for detecting a detection target by linearly coupling a plurality of selected weak classifiers.