摘要:
A method for displaying a plurality of images is provided. The method includes: obtaining one or more image sets based on the plurality of images, wherein a similarity degree between each pair of images in each image set is greater than a similarity threshold; identifying an operation instruction triggered on at least one of the image sets; if the operation instruction triggered on the at least one of the image sets satisfies a predetermined updating condition, updating the similarity threshold; and displaying the plurality of images based on the updated similarity threshold.
摘要:
An estimator training method and a pose estimating method using a depth image are disclosed, in which the estimator training method may train an estimator configured to estimate a pose of an object, based on an association between synthetic data and real data, and the pose estimating method may estimate the pose of the object using the trained estimator.
摘要:
A system for automatically extracting interesting structures or areas (e.g., built-up structures such as buildings, tents, etc.) from HR/VHR satellite imagery data using corresponding LR satellite imagery data. The system breaks down HR/VHR input satellite images into a plurality of components (e.g., groups of pixels), organizes the components into a first hierarchical data structure (e.g., a Max-Tree), generates a second hierarchical data structure (e.g., a KD-Tree) from feature elements (e.g., spectral and shape characteristics) of the components, uses LR satellite imagery data to categorize components as being of interest or not, uses the feature elements of the categorized components to train the second data structure to be able to classify all components of the first data structure as being of interest or not, classifies the components of the first data structure with the trained second data structure, and then maps components classified as being of interest into a resultant image.
摘要:
Texture features of images are calculated for recognition of pixel sets in the images as one category among multiple candidate categories. For example, wavelet transformation is applied to obtain a wavelet vector. Via analyzing components of the wavelet vector, one pixel set may be recognized as part of an architecture object or a natural plant object. In addition, line segments within the pixel sets may be calculated and their statistics result may be used for recognizing different objects.
摘要:
Systems and methods for generating visual words define initial inter-visual word relationships between a plurality of visual words; define visual word-image relationships between the plurality of visual words and a plurality of images; define inter-image relationships between the plurality of images; generate revised inter-visual word relationships in a vector space based on the initial inter-visual word relationships, the inter-image relationships, and the visual word-image relationships; and generate higher-level visual words in the vector space based on the revised inter-visual word relationships.
摘要:
A computer implemented method and apparatus for automatically identifying a representative image for an image group. The method comprises dividing an image group into one or more clusters based on an average time gap of the image group, wherein the images in the image group are in sequential timestamp order wherein the average time gap is calculated using a time span calculated from the timestamp of a first image in the image group to the timestamp of a last image in the image group; recursively dividing a largest cluster in the one or more clusters to determine a resultant cluster, wherein the resultant cluster comprises no time gaps larger than the average time gap of the resultant cluster; and identifying a representative image from the resultant cluster as an image representative for the image group.
摘要:
A model is provided that produces predicted sensor data as a function of at least one input feature that includes an adjustable setting of a cooling infrastructure. The model is able to model a non-linear relationship between the predicted sensor data and the adjustable setting.
摘要:
An adequate solution for computer vision applications is arrived at more efficiently and, with more automation, enables users with limited or no special image processing and pattern recognition knowledge to create reliable vision systems for their applications. Computer rendering of CAD models is used to automate the dataset acquisition process and labeling process. In order to speed up the training data preparation while maintaining the data quality, a number of processed samples are generated from one or a few seed images.
摘要:
A system and method for tagging an image of an individual in a plurality of photos is disclosed herein. A feature vector of an individual is used to analyze a set of photos on a social networking website such as www.facebook.com to determine if an image of the individual is present in a photo of the set of photos. Photos having an image of the individual are tagged preferably by listing a URL or URI for each of the photos in a database.
摘要:
A computer-implemented technique can receive a plurality of photos and automatically select a subset of the plurality of photos having a high degree of representativeness by jointly maximizing both photo quality and photo diversity to obtain a photo album. The technique can determine one or more clusters for the photo album using a hierarchical clustering algorithm, and store the photo album according to the one or more clusters. The technique can control the manner in which the photo album is displayed using the one or more clusters. The technique can adjust at least one of the one or more clusters and the automatic photo album generation based on user input. The user input can include at least one of adding, deleting, and moving a photo with respect to the one or more clusters. The technique can then re-cluster, automatically generate a new photo album, and/or adjust the presentation.