Abstract:
This disclosure provides method and systems of classifying a digital image of an object. Specifically, according to one exemplary embodiment, an object classifier is trained using a constrained MI-SVM (multiple instance-support vector machine) approach whereby training images of objects are sampled to generate a collection of image regions associated with an object type and viewpoint, and the classifier is trained to determine an appropriate mid-level representation of the training image which is discriminative.
Abstract:
The disclosed embodiments are related to a method and system for creating a digital image album implementable on a computing device. The method includes receiving a plurality of digital images. The method further includes creating a first signature corresponding to each of the plurality of digital images. The method further includes comparing the first signature corresponding to each of the plurality of digital images with one or more second signatures. Each of the second signatures corresponds to each of one or more prototype digital albums. The method further includes selecting one or more digital images from the plurality of digital images based on the comparison to create the digital image album.
Abstract:
A method for learning visual attribute labels for images includes, from textual comments associated with a corpus of images, identifying a set of candidate textual labels that are predictive of aesthetic scores associated with images in the corpus. The candidate labels in the set are clustered into a plurality of visual attribute clusters based on similarity and each of the clusters assigned a visual attribute label. For each of the visual attribute labels, a classifier is trained using visual representations of images in the corpus and respective visual attribute labels. The visual attribute labels are evaluated, based on performance of the trained classifier. A subset of the visual attribute labels is retained, based on the evaluation. The visual attribute labels can be used in processes such as image retrieval, image labeling, and the like.
Abstract:
An image selection method includes receiving a collection of images and optionally, filtering the collection of images. The filtering may include removing images that are near-duplicates of other images in the collection. A plurality of features is extracted from each of the images in the optionally-filtered collection. The optionally-filtered collection of images is considered as a time-ordered sequence of images and is segmented to form a sequence of segments. Each segment includes at least one of the images. The segmenting of the sequence of images is based on the extracted features and positions of the images in the sequence of images. Images from the segments are selected to form a sub-collection of the images.
Abstract:
An image selection method includes receiving a collection of images and optionally, filtering the collection of images. The filtering may include removing images that are near-duplicates of other images in the collection. A plurality of features is extracted from each of the images in the optionally-filtered collection. The optionally-filtered collection of images is considered as a time-ordered sequence of images and is segmented to form a sequence of segments. Each segment includes at least one of the images. The segmenting of the sequence of images is based on the extracted features and positions of the images in the sequence of images. Images from the segments are selected to form a sub-collection of the images.
Abstract:
This disclosure provides method and systems of classifying a digital image of an object. Specifically, according to one exemplary embodiment, an object classifier is trained using a constrained MI-SVM (multiple instance-support vector machine) approach whereby training images of objects are sampled to generate a collection of image regions associated with an object type and viewpoint, and the classifier is trained to determine an appropriate mid-level representation of the training image which is discriminative.
Abstract:
A system and method for searching a finite collection of images using at least one semantic network. Upon receipt of a query from a user that includes a theme and one or more initial keywords, a set of keywords based on the theme and including the initial keywords is generated from one or more semantic networks corresponding to the theme and/or initial keywords. When the finite collection of images includes suitable metadata, a result set is generated of images corresponding to the expanded set of keywords. When the finite collection includes images lacking in metadata, a remote third-party image collection is searched with the set of keywords to obtain a result set that is used to train visual classifiers as to visual concepts associated with the keywords. The classifiers are used to classify the images in the finite collection lacking metadata and the search of the finite collection is performed with the set of keywords to generate a result set.
Abstract:
The disclosed embodiments are related to a method and system for creating a digital image album implementable on a computing device. The method includes receiving a plurality of digital images. The method further includes creating a first signature corresponding to each of the plurality of digital images. The method further includes comparing the first signature corresponding to each of the plurality of digital images with one or more second signatures. Each of the second signatures corresponds to each of one or more prototype digital albums. The method further includes selecting one or more digital images from the plurality of digital images based on the comparison to create the digital image album.
Abstract:
A system and method for searching a finite collection of images using at least one semantic network. Upon receipt of a query from a user that includes a theme and one or more initial keywords, a set of keywords based on the theme and including the initial keywords is generated from one or more semantic networks corresponding to the theme and/or initial keywords. When the finite collection of images includes suitable metadata, a result set is generated of images corresponding to the expanded set of keywords. When the finite collection includes images lacking in metadata, a remote third-party image collection is searched with the set of keywords to obtain a result set that is used to train visual classifiers as to visual concepts associated with the keywords. The classifiers are used to classify the images in the finite collection lacking metadata and the search of the finite collection is performed with the set of keywords to generate a result set.
Abstract:
A method, a system, and a computer program product for extracting one or more images from a storage medium. A search model is selected based on the availability of a semantically related aesthetic model. A search model includes a generic aesthetic model if the semantically related aesthetic model for query is not available. A semantic score and an aesthetic score are computed based on the selected search model. The images are further ranked based on the semantic and aesthetic score.