摘要:
An example apparatus for calibrating texture cameras includes an image receiver to receive a depth image from a depth camera and a color image from a texture camera. The apparatus also includes a feature extractor to extract features from the depth image and the color image. The apparatus further includes a feature tester to detect that the extracted features from the depth image and the color image exceed a quality threshold. The apparatus includes a misalignment detector to detect a misalignment between the extracted features from depth image and the extracted features from color image exceeds a misalignment threshold. The apparatus also further includes a calibrator to modify calibration parameters for the texture camera to reduce the detected misalignment between the extracted features from the depth image and the extracted features from the color image below a misalignment threshold.
摘要:
The present invention describes a system and method for controlling interactive video, including a remote control device allowing the user to interact with the video. The method allows bringing the “lean-forward” hypervideo experience to a “lay-back” environment such as the TV screen. The method also allows for the creation of a single system that combines both the social aspects of the “lay-back” and the “lean-forward experience” into a new type of rich media viewing experience.
摘要:
A method determines similarity of objects depicted in images when the images pertain to different modalities. The method includes obtaining images that depict the objects and that pertain to the different modalities. An embedding function is applied to each of the images. The embedding function is selected from a set of two or more embedding functions, each of the embedding functions corresponding to a modality of the different modalities, the selected embedding function corresponding to the modality of the image to which it is applied. Application of the embedding function maps that image to a representation in a representation space such that when the images are mapped to the representation space, a distance between the representations of the images is indicative of a similarity of their depicted objects. The similarity of the depicted objects is determined based on the location of the corresponding representations in the representation space.
摘要:
A camera intrinsic calibration may be performed using an object geometry. An intrinsic camera matrix may then be recovered. A homography is fit between object and camera coordinate systems. View transformations are finally recovered.
摘要:
A projector illuminates an object, within the field of view of a camera, with a sequence of code patterns. The camera captures the illuminated object and provides object images to a decoder to convert the code patterns into code. A transition locator locates discontinuities in the code pattern images. A dequantizer reconstructs a range image from those discontinuities and said code.
摘要:
A method of cropping a representation of a face for electronic processing, said method comprising: selecting a first geodesic contour about an invariant reference point on said face, setting a region within said first geodesic contour as a first mask, selecting a second geodesic contour about a boundary of said identified first region, setting a region within said second geodesic contour as a second mask, and forming a final mask from a union of said first mask and said second mask.
摘要:
A method for resource allocation for video encoder to achieve the minimum sequence cost within given resource budgets. Optimal video encoder design by deriving the optimal sequence order and frame type selection is invented. In order to achieve computationally practical resource allocation, the current invention utilizes various encoder model and buffer model. The models allow the optimization procedure to assess the best encoding design without actually performing the computationally expensive encoding. Efficient optimization algorithm is also derived to substantially reduce the computations required to search for the optimal action sequence.
摘要:
The present invention describes a system and method for controlling interactive video, including a remote control device allowing the user to interact with the video. The method allows bringing the “lean-forward” hypervideo experience to a “lay-back” environment such as the TV screen. The method also allows for the creation of a single system that combines both the social aspects of the “lay-back” and the “lean-forward experience” into a new type of rich media viewing experience.
摘要:
A method for resource allocation for video encoder to achieve optimal picture quality within a given resource budget. Making a video encoder utilize the computational complexity, bitrate and other resources in an optimal way while maintaining optimal quality is a complicated optimization problem. A subset of this resource allocation problem, optimizing the tradeoff between bitrate versus quality is called rate-distortion optimization and is performed in most modern encoders. In order to achieve a computationally practical solution of the resource allocation problem, the current invention partitions the video content into a number of regions based on their characteristics and assesses resource allocation among regions to achieve the optimal quality within the resource budget limit. To maintain the computation tractable, the invention relies on bit production model and distortion model for the underlying video content to assess the quality and resource usage instead of actually conducting video compression. An iterative optimization algorithm has been developed to implement the invention.