Abstract:
A method and system determines flows by first acquiring a video of the flows with a camera, wherein the flows are pedestrians in a scene, wherein the video includes a set of frames. Motion vectors are extracted from each frame in the set, and a data matrix is constructed from the motion vectors in the set of frames. A low rank Koopman operator is determined from the data matrix and a spectrum of the low rank Koopman operator is analyzed to determine a set of Koopman modes. Then, the frames are segmented into independent flows according to a clustering of the Koopman modes.
Abstract:
A method processes a signal by first constructing a graph from the signal, and then determining a graph matrix from the graph and the signal. A Krylov-based subspace is determined based on the graph matrix and the signal. A filter for the Krylov subspace is determined. The filter transforms the signal to produce a filtered signal, which is output.
Abstract:
A method compresses a point cloud composed of a plurality of points in a three-dimensional (3D) space by first acquiring the point cloud with a sensor, wherein each point is associated with a 3D coordinate and at least one attribute. The point cloud is partitioned into an array of 3D blocks of elements, wherein some of the elements in the 3D blocks have missing points. For each 3D block, attribute values for the 3D block are predicted based on the attribute values of neighboring 3D blocks, resulting in a 3D residual block. A 3D transform is applied to each 3D residual block using locations of occupied elements to produce transform coefficients, wherein the transform coefficients have a magnitude and sign. The transform coefficients are entropy encoded according the magnitudes and sign bits to produce a bitstream.
Abstract:
A method processes keypoint trajectories in a video, wherein the keypoint trajectories describe motion of a plurality of keypoints across pictures of the video over time, by first acquiring the video of a scene using a camera. Kkeypoints and associated feature descriptors are detected in each picture. The keypoints and associated features descriptors are matched between neighboring pictures to generate keypoint trajectories. Then, the keypoint trajectories are coded predictively into a bitstream, which is outputted.
Abstract:
Videos of a scene are processed for view synthesis. The videos are acquired by corresponding cameras arranged so that a view of each camera overlaps with the view of at least one other camera. For each current block, disparity vectors are obtained from neighboring blocks. A depth block is based on a corresponding reference depth image and the disparity vectors. A prediction block is generated based on the depth block using backward warping of a motion field. Then, predictive coding for the current block using the prediction block. Backward mapping can also be performed in the spatial domain.
Abstract:
A method processes a signal represented as a graph by first determining a graph spectral transform based on the graph. In a spectral domain, parameters of a graph filter are estimated using a training data set of unenhanced and corresponding enhanced signals. The graph filter is derived based on the graph spectral transform and the estimated graph filter parameters. Then, the signal is processed using the graph filter to produce an output signal. The processing can enhance signals such as images b denoising or interpolating missing samples.
Abstract:
An input segment of an input video is encoded by first extracting and storing, for each segment of previously encoded videos, a set of reference features. The set of input features are matched with each set of the reference features to produce a set of scores. The reference segments having largest scores are selected to produce a first reduced set of reference segments. A rate-distortion cost for each reference segment in the first reduced set of reference segments is estimated. The reference segments in the first reduced set of reference segments is selected to produce a second reduced set of reference segments. Then, the input segment are encoded based on second reduced set of reference segments.
Abstract:
Videos of a scene are processed for view synthesis. The videos are acquired by corresponding cameras arranged so that a view of each camera overlaps with the view of at least one other camera. For each current block, motion or disparity vector is obtained from neighboring blocks. A depth block is based on a corresponding reference depth image and the motion or disparity vector. A prediction block is generated based on the depth block using backward warping of a motion field. Then, predictive coding for the current block using the prediction block. Backward mapping can also be performed in the spatial domain.
Abstract:
Systems and methods for a point cloud decoder including a processor to decode each block in a set of blocks from a point cloud, so as to obtain a decoded point cloud. Wherein each block includes a set of points, such that for each block the processor is to decode a set of prediction residuals from a compressed bitstream. Use a predetermined location in the block, and compute for each prediction residual in the set of prediction residuals, a position of a point by adding the prediction residual to the predetermined location, so as to obtain a set of decoded points for the block. Wherein the decoded points for the blocks in the set of blocks represent the decoded point cloud.
Abstract:
Systems and methods for determining a pattern in time series data representing an operation of a machine. A memory to store and provide a set of training data examples generated by a sensor of the machine, wherein each training data example represents an operation of the machine for a period of time ending with a failure of the machine. A processor configured to iteratively partition each training data example into a normal region and an abnormal region, determine a predictive pattern absent from the normal regions and present in each abnormal region only once, and determine a length of the abnormal region. Outputting the predictive pattern via an output interface in communication with the processor or storing the predictive pattern in memory, wherein the predictive pattern is a predictive estimate of an impending failure and assists in management of the machine.