摘要:
The video receiving apparatus is capable of transmitting and receiving data via a communications network. The video receiving apparatus includes a video extractor, a controller, a calibration executor, and an input unit to which a video signal is input. The video extractor extracts a partial video, which is to be used for video recognition processing, from the video signal. The controller controls the following operations: That is, the controller transmits, to a video recognition apparatus, either the partial video or content recognition information formed from the partial video, thereby requesting the video recognition apparatus to perform the video recognition processing. The controller acquires a result of the video recognition processing from the video recognition apparatus, and acquires the additional information based on the video recognition processing from an additional-information delivering apparatus. The calibration executor performs calibration processing in cooperation with the video recognition apparatus, thereby setting the predetermined parameter.
摘要:
Methods, systems, and media for transforming fingerprints to detect unauthorized media content items are provided. The method comprises: receiving criteria relating to an application of a circumvention technique to one or more video content items, wherein the criteria includes abuse criteria that describes the circumvention technique and a transform for use with the one or more video content items in which the circumvention technique was applied; generating an abuse query that includes at least a portion of the abuse criteria that describes the circumvention technique; determining from a plurality of video content items, a subset of video content items responsive to the abuse query; applying, for each video content item in the subset of video content items, the transform to each video content item to obtain a transformed video content item; generating, for each transformed video content item, a fingerprint that represents the transformed video content item; and comparing the fingerprint of the transformed video content item to a plurality of fingerprints associated with reference video content items to determine whether the video content item corresponding to the transformed video content item matches one of the reference video content items.
摘要:
A method for pushing information is disclosed by which a client acquires the statistical characteristic information of a current video frame in real time during video playback on the client; the client then searches a first mapping relationship table consisting of mapping relations between the statistical characteristic information and index values that is established by the client for the index value that matches the acquired statistical characteristic information, and sends the index value thus found to a cloud server; the cloud server searches a second mapping relationship table consisting of mapping relations between the index values and push information that is established by the cloud server for the push information that corresponds to the index value; and finally the client receives and plays or displays the push information. There is also provided a system for pushing information.
摘要:
A method for associate metadata to a multimedia content based on finding matches to similar multimedia content. A given input multimedia content is matched to at least another multimedia content with corresponding metadata. Upon determination of a match, the corresponding metadata is used as metadata of the given multimedia content. When a large number of multimedia data is compared a ranked list of metadata is provided. The most appropriate metadata is associated to the given multimedia content based on various criteria. The method can be implemented in any applications which involve large-scale content-based clustering, recognition and classification of multimedia data, such as, content-tracking, video filtering, multimedia taxonomy generation, video fingerprinting, speech-to-text, audio classification, object recognition, video search and any other application requiring content-based signatures generation and matching for large content volumes such as, web and other large-scale databases.
摘要:
Systems and methods are described for identifying the video content as spherical video or non-spherical video in response to determining that frame scores and video scores satisfy a threshold level. For example, a plurality of image frames can be extracted from video content, classified in a dual stage process, and scored according to particular classification and scoring mechanisms.
摘要:
A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method selects an entity from a plurality of entities identifying characteristics of a video item, where the video item has associated metadata. The computer-implemented method receives probabilities of existence of the entity in video frames of the video item, and selects a video frame determined to comprise the entity responsive to determining the video frame having a probability of existence of the entity greater than zero. The computer-implemented method determines a scaling factor for the probability of existence of the entity using the metadata of the video item, and determines an adjusted probability of existence of the entity by using the scaling factor to adjust the probability of existence of the entity. The computer-implemented method labels the video frame with the adjusted probability of existence.
摘要:
In an embodiment, digital video frames in a flow are subjected to a method of extraction of features including the operations of: extracting from the video frames respective sequences of pairs of keypoints/descriptors limiting to a threshold value the number of pairs extracted for each frame; sending the sequences extracted from an extractor module to a server for processing with a bitrate value variable in time; receiving the aforesaid bitrate value variable in time at the extractor as target bitrate for extraction; and limiting the number of pairs extracted by the extractor to a threshold value variable in time as a function of the target bitrate.
摘要:
Provided are devices, computer-program products, and methods for removing redundant data associated with frames. For example, a method can include receiving an initial frame, determining initial cue data for the initial frame, and sending the initial cue data to a server. The method can further include receiving a new frame and determining new cue data for the new frame. The method can further include identifying a pixel value range. The method can further include determining a pixel value difference between an initial pixel data sample and a new pixel data sample. The method can further include determining the pixel value difference is within the pixel value range and updating the new cue data by removing the new pixel data sample from the new cue data when the pixel value difference is within the pixel value range. The method can further include sending the updated new cue data to the server.
摘要:
The video receiving apparatus is capable of transmitting and receiving data via a communications network. The video receiving apparatus includes a video extractor, a controller, a calibration executor, and an input unit to which a video signal is input. The video extractor extracts a partial video, which is to be used for video recognition processing, from the video signal. The controller controls the following operations: That is, the controller transmits, to a video recognition apparatus, either the partial video or content recognition information formed from the partial video, thereby requesting the video recognition apparatus to perform the video recognition processing. The controller acquires a result of the video recognition processing from the video recognition apparatus, and acquires the additional information based on the video recognition processing from an additional-information delivering apparatus. The calibration executor performs calibration processing in cooperation with the video recognition apparatus, thereby setting the predetermined parameter.
摘要:
Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.