摘要:
A transition in an MPEG sequence including anchor pictures and bidirectionally predicted (B) pictures is encoded by a) coding first and second anchor pictures and b) coding a transition in the sequence by inserting B pictures into the bitstream to create the transition from the first anchor picture to the second anchor picture.
摘要:
A method for simultaneously recording motion and still images, includes the steps of: capturing a motion image sequence and accompanying audio of a scene with a digital video camera adapted to record both motion and higher resolution still images; simultaneously capturing a still image sequence having a higher resolution and lower frame rate than the motion capture sequence; compressing the motion image sequence using interframe compression and the accompanying audio and storing the compressed motion image and audio data; and compressing the still images using intraframe coding and storing the compressed still image data.
摘要:
A method of storing a digital image file, includes the steps of: storing a header defined by a standard; storing image data encoded according to the standard after the header; and storing an image tail containing non-standard meta-data at the end of the image data.
摘要:
A method for automatically classifying images into events for composing and authoring of a multimedia image program on a recordable optical disc comprises the steps of: (a) receiving a plurality of images having either or both date and/or time of image capture; (b) determining one or more largest time differences of the plurality of images based on clustering of the images; (c) separating the plurality of images into events based on having one or more boundaries between events which one or more boundaries correspond to the one or more largest time differences; (d) specifying at least one multimedia feature that is related to each event; (e) encoding the images between event boundaries and the at least one multimedia feature associated therewith into an event bitstream; and (f) writing each event bitstream to the recordable optical disc, whereby each event is authored into a separate section of the recordable optical disc.
摘要:
A method for automatically organizing digitized photographic images into events based on spoken annotations comprises the steps of: providing natural-language text based on spoken annotations corresponding to at least some of the photographic images; extracting predetermined information from the natural-language text that characterizes the annotations of the images; segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and identifying each event by assembling the categories of information into event descriptions. The invention further comprises the step of summarizing each event by selecting and arranging the event descriptions in a suitable manner, such as in a photographic album.
摘要:
A method for representing a video sequence including a time sequence of input video frames, the input video frames including some common scene content that is common to all of the input video frames and some dynamic scene content that changes between at least some of the input video frames. Affine transform are determined to align the common scene content in the input video frames. A common video frame including the common scene content is determined by forming a sparse combination of a first basis functions. A dynamic video frame is determined for each input video frame by forming a sparse combination of a second basis functions, wherein the dynamic video frames can be combined with the respective affine transforms and the common video frame to provide reconstructed video frames.
摘要:
A method for determining an estimated clutter level of an input digital image based on an inequality index. The inequality index is determined by partitioning the input digital image into small sub-images and analyzing the sub-images to determine a set of image features. The image features are associated with a set of designated reference features, and the inequality index is determined based on the statistical variation of the reference features. The inequality index is compared to a predefined threshold to classify the input digital image as a rich-content image or a low-content image. For rich-content images, the estimated clutter level is determined responsive to a set of scene content features relating to spatial structures or semantic content of the input digital image is determined by analyzing the input digital image. For low-content images, the estimated clutter level is determined responsive to an overall luminance level.
摘要:
A method for determining an estimated clutter level of an input digital image based on an inequality index. The inequality index is determined by analyzing the input digital image to determine a set of image features. The image features are associated with a set of designated reference features, and the inequality index is determined based on the statistical variation of the reference features. A set of scene content features relating to spatial structures or semantic content of the input digital image is determined by analyzing the input digital image. The estimated clutter is determined responsive to the inequality index and the scene content features.
摘要:
A method for determining an estimated clutter level of an input digital image based on an inequality index. The inequality index is determined by partitioning the input digital image into small sub-images and analyzing the sub-images to determine a set of image features. The image features are associated with a set of designated reference features, and the inequality index is determined based on the statistical variation of the reference features. The inequality index is compared to a predefined threshold to classify the input digital image as a rich-content image or a low-content image. For rich-content images, the estimated clutter level is determined responsive to a set of scene content features relating to spatial structures or semantic content of the input digital image is determined by analyzing the input digital image. For low-content images, the estimated clutter level is determined responsive to an overall luminance level.
摘要:
Generating a tag layout from a set of tags and an ordering of the set of tags, wherein each tag includes a text label and a size for the text label, is disclosed. The system includes a processor accessible memory for receiving an ordered set of tags, each tag including a text label and a size for the text label, and at least one closed shape corresponding to a space for the tag layout. The system further includes a processor for generating the tag layout by computing a scale factor for either the closed shape or the size of the text labels in the set of tags such that all the tags in the set of tags fit within the closed shape, and the processor stores the generated tag layout in the memory.