-
公开(公告)号:US10984542B2
公开(公告)日:2021-04-20
申请号:US16981240
申请日:2018-11-20
发明人: Yangyu Chen , Yi He , Lei Li
摘要: A method for determining geometric transformation relation for images is provided. The method includes: obtaining a first image and a second image collection; obtaining coordinates of three non-collinear pixels in the first image to forma coordinate collection; generating a target matrix according to the coordinate collection, and for the at least two second images in the second image collection, obtaining coordinates of pixels in the at least two second images corresponding to coordinate-indicative pixels in the coordinate collection to form a corresponding coordinate collection; according to the corresponding coordinate collection, generating a target column vector matching a column number of the target matrix; pre-multiplying the target matrix by the target column vector to obtain a transformation column vector; determining the geometric transformation relation of the first images and the at least two second images according to the transformation column vector.
-
公开(公告)号:US20200273483A1
公开(公告)日:2020-08-27
申请号:US16652028
申请日:2018-12-29
摘要: An audio fingerprint extraction method and device are provided. The method includes: converting an audio signal to a spectrogram; determining one or more characteristic points in the spectrogram; in the spectrogram, determining one or more masks for the characteristic points; determining mean energy of each of the spectrum regions; determining one or more audio fingerprint bits according to mean energy of the plurality of spectrum regions in the one or more masks; judging credibility of the audio fingerprint bits to determine one or more weight bits; and combining the audio fingerprint bits and the weight bits to obtain an audio fingerprint. Each of the one or more masks includes a plurality of spectrum regions.
-
公开(公告)号:US11874869B2
公开(公告)日:2024-01-16
申请号:US16962416
申请日:2018-12-29
IPC分类号: G06F16/41 , G06F16/438 , G06F18/22 , G06F18/2113
CPC分类号: G06F16/41 , G06F16/438 , G06F18/2113 , G06F18/22
摘要: The present disclosure relates to a media retrieval method and apparatus, wherein the method comprising: acquiring a media feature of a query media as a first media feature which contains a plurality of first media feature units; performing, according to each individual of the first media feature unit, a first ranking on a plurality of existing media, and taking, according to a result of the first ranking, first k existing media as a first candidate media set, where k is a positive integer; and performing, according to the plurality of the first media feature units arranged in sequence, a second ranking on the first candidate media set, and taking, according to a result of the second ranking, first n first candidate media as a retrieval result, where n is a positive integer.
-
公开(公告)号:US11055536B2
公开(公告)日:2021-07-06
申请号:US16768527
申请日:2018-12-29
发明人: Yi He , Lei Li , Cheng Yang , Gen Li , Yitan Li
IPC分类号: G06K9/00
摘要: A video feature extraction method and device are provided. The method includes extracting one or more frames from a video object to obtain one or more frames of images, obtaining one or more shift vectors for each of the one or more frames of images, using each of the one or more shift vectors, taking any pixel in each of the one or more frames of images as a starting point, determining a terminal point directed by the one or more shift vectors from the starting point, and determining an image feature of the one or more frames of images according to a difference between the starting point and the terminal point, and determining a video feature of the video object according to the image feature of the one or more frames of images.
-
公开(公告)号:US11783808B2
公开(公告)日:2023-10-10
申请号:US17985795
申请日:2022-11-11
CPC分类号: G10L15/083 , G10L15/005 , G10L15/20 , G10L15/22 , G10L2015/223
摘要: Embodiments of the present disclosure disclose an audio content recognition method and apparatus, an electronic device and a non-transitory computer-readable medium. A specific implementation of the method includes: obtaining a voice fragment collection and a non-voice fragment collection by segmenting audio; determining a type and language information of each voice fragment in the voice fragment collection; obtaining, for each voice fragment in the voice fragment collection, a first recognition result by performing voice recognition on the voice fragment based on the type and the language information of the voice fragment. In the implementation, speaking and music fragments in the audio are recognized by different models, so that two audio contents may both have better recognition effects. Moreover, audio of different language contents is recognized by using different models, thereby further improving a voice recognition effect.
-
公开(公告)号:US20210357673A1
公开(公告)日:2021-11-18
申请号:US16981226
申请日:2018-12-10
发明人: Cheng Yang , Yi He , Lei Li
摘要: A method and a device for processing feature points of an image are provided. A specific embodiment of the method includes obtaining an image to be processed; determining weights of the feature points of the image to be processed to obtain a weight set; and according to the weights, selecting target numbered feature points as target feature points of the image to be processed. The weights include a texture weight; the texture weight and a color change scope of pixels in a target sized image region in which the feature points locate are directly proportional. The embodiment can reduce the number of feature points of the image, and further release the storage pressure of feature points regarding the image.
-
公开(公告)号:US20210027478A1
公开(公告)日:2021-01-28
申请号:US16981240
申请日:2018-11-20
发明人: Yangyu Chen , Yi He , Lei Li
摘要: A method for determining geometric transformation relation for images is provided. The method includes: obtaining a first image and a second image collection; obtaining coordinates of three non-collinear pixels in the first image to forma coordinate collection; generating a target matrix according to the coordinate collection, and for the at least two second images in the second image collection, obtaining coordinates of pixels in the at least two second images corresponding to coordinate-indicative pixels in the coordinate collection to form a corresponding coordinate collection; according to the corresponding coordinate collection, generating a target column vector matching a column number of the target matrix; pre-multiplying the target matrix by the target column vector to obtain a transformation column vector; determining the geometric transformation relation of the first images and the at least two second images according to the transformation column vector.
-
公开(公告)号:US11593582B2
公开(公告)日:2023-02-28
申请号:US16979784
申请日:2018-12-29
发明人: Yi He , Lei Li , Cheng Yang , Gen Li , Yitan Li
摘要: The disclosure is related to a method and device for comparing media features, the method comprising: obtaining first media feature sequences of a first media object and second media feature sequences of a second media object, the first media feature sequence comprises a plurality of first media feature units arranged in sequence, and the second media feature sequence comprises a plurality of second media feature units arranged in sequence; determining unit similarities between the first media feature units and the second media feature units; determining a similarity matrix between the first media feature sequences and the second media feature sequences according to the unit similarities; determining a similarity of the first media object and the second media object according to the similarity matrix.
-
公开(公告)号:US11265598B2
公开(公告)日:2022-03-01
申请号:US16958513
申请日:2018-12-29
发明人: Yi He , Lei Li , Cheng Yang , Gen Li , Yitan Li
IPC分类号: H04N21/44 , H04N21/234
摘要: The present invention relates to a method and a device for determining a duplicate video, wherein the method comprises: acquiring multiple types of video features of a query video; according to the multiple types of video features of the query video, sequentially comparing a plurality of existing videos respectively to obtain a sequence comparison result; performing a first ranking on the plurality of existing videos according to the sequence comparison result, and taking first n existing videos as first candidate videos according to a result of the first ranking, where n is a positive integer; and determining a duplication level of the query video according to the sequence comparison result of the first candidate videos.
-
公开(公告)号:US11182426B2
公开(公告)日:2021-11-23
申请号:US16636579
申请日:2018-12-29
IPC分类号: G06F16/683 , G06F16/638 , G06F16/632 , G10L19/02 , G10L25/18 , G10L25/54
摘要: An audio retrieval and recognition method and device are provided. The method includes acquiring an audio fingerprint of a query audio; and recognizing the query audio according to the audio fingerprint to obtain a recognition result. The audio fingerprint includes a first part configured for indicating a content feature of the query audio and a second part configured for indicating credibility of the first part.
-
-
-
-
-
-
-
-
-