-
公开(公告)号:US11463631B2
公开(公告)日:2022-10-04
申请号:US17025255
申请日:2020-09-18
Inventor: Henan Zhang , Xin Li , Fu Li , Tianwei Lin , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
Abstract: Embodiments of the present disclosure provide a method and apparatus for generating an image. The method may include: receiving a first image including a face input by a user in an interactive scene; presenting the first image to the user; inputting the first image into a pre-trained generative adversarial network in a backend to obtain a second image output by the generative adversarial network; where the generative adversarial network uses face attribute information generated based on the input image as a constraint; and presenting the second image to the user in response to obtaining the second image output by the generative adversarial network in the backend.
-
公开(公告)号:US20220270289A1
公开(公告)日:2022-08-25
申请号:US17743402
申请日:2022-05-12
Inventor: Wei Zhang , Xiaoqing Ye , Xiao Tan , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: G06T7/73 , G06T7/194 , G06T7/593 , H04N13/128
Abstract: A method and device for detecting a vehicle pose, relating to the fields of computer vision and automatic driving. The specific implementation solution comprises: inputting a vehicle left view point image and a vehicle right view point image into a part prediction and mask segmentation network model, and determining foreground pixel points and part coordinates thereof in a reference image; converting coordinates of the foreground pixels in the reference image into coordinates of the foreground pixels in a camera coordinate system so as to obtain a pseudo-point cloud, and fusing part coordinate of the foreground pixels and the pseudo-point cloud to obtain fused pseudo-point cloud; and inputting the fused pseudo-point cloud into a pre-trained pose prediction model to obtain a pose information of the vehicle to be detected.
-
公开(公告)号:US11416967B2
公开(公告)日:2022-08-16
申请号:US17024253
申请日:2020-09-17
Inventor: Chao Li , Shilei Wen , Errui Ding
Abstract: Embodiments of the present disclosure provide a video processing method, a video processing device and a related non-transitory computer readable storage medium. The method includes the following. Frame sequence data of a low-resolution video to be converted is obtained. Pixel tensors of each frame in the frame sequence data are inputted into a pre-trained neural network model to obtain high-resolution video frame sequence data corresponding to the video to be converted output by the neural network model. The neural network model obtains the high-resolution video frame sequence data based on high-order pixel information of each frame in the frame sequence data.
-
公开(公告)号:US11363271B2
公开(公告)日:2022-06-14
申请号:US17125370
申请日:2020-12-17
Inventor: Chao Li , Yukang Ding , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: H04N19/132 , H04N19/172 , G06K9/62 , G06N3/04
Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.
-
公开(公告)号:US20210240983A1
公开(公告)日:2021-08-05
申请号:US17236050
申请日:2021-04-21
Inventor: Jidong Peng , Cheng Wan , Cheng Chen , Shilei Wen , Ke Sun , Chengliang Luo
Abstract: A method and an apparatus for building extraction, and a storage medium are provided in the present disclosure. The method includes: obtaining remote sensing image data and user behavior associated data of a target area, in which the user behavior associated data and the remote sensing image data have a spatiotemporal correlation; generating new channel data according to the user behavior associated data and the remote sensing image data and extracting a building in the target area according to the remote sensing image data and the new channel data.
-
6.
公开(公告)号:US11538286B2
公开(公告)日:2022-12-27
申请号:US16710464
申请日:2019-12-11
Inventor: Wei Zhang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and apparatus for vehicle damage assessment, an electronic device, and a computer-readable storage medium are provided. The method may include: extracting, from an input image, a first feature characterizing a part of a vehicle and a second feature characterizing a damage type of the vehicle; integrating the first feature and the second feature to generate a third feature characterizing a corresponding relation between the part and the damage type; converting the third feature into a characteristic vector; and determining a damage recognition result based on the characteristic vector. According to the technical solution of the disclosure, users can rapidly and accurately learn about the damage condition of the vehicle by providing pictures or videos of the damaged vehicle, thus providing an objective basis for subsequent damage assessment, claim settlement, and repair.
-
公开(公告)号:US11379696B2
公开(公告)日:2022-07-05
申请号:US16817419
申请日:2020-03-12
Inventor: Zhigang Wang , Jian Wang , Shilei Wen , Errui Ding , Hao Sun
Abstract: The present disclosure provides a pedestrian re-identification method and apparatus, computer device and readable medium. The method comprises: collecting a target image and a to-be-identified image including a pedestrian image; obtaining a feature expression of the target image and a feature expression of the to-be-identified image respectively, based on a pre-trained feature extraction model; wherein the feature extraction model is obtained by training based on a self-attention feature of a base image as well as a co-attention feature of the base image relative to a reference image; identifying whether a pedestrian in the to-be-identified image is the same pedestrian as that in the target image according to the feature expression of the target image and the feature expression of the to-be-identified image. According to the pedestrian re-identification method of the present disclosure, the accuracy of the pedestrian re-identification can be effectively improved when the feature extraction model is used to perform the pedestrian re-identification.
-
公开(公告)号:US20210319062A1
公开(公告)日:2021-10-14
申请号:US17182960
申请日:2021-02-23
Inventor: Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen
IPC: G06F16/783 , G06K9/00 , G06K9/62
Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.
-
9.
公开(公告)号:US11615140B2
公开(公告)日:2023-03-28
申请号:US17144523
申请日:2021-01-08
Inventor: Xiang Long , Dongliang He , Fu Li , Xiang Zhao , Tianwei Lin , Hao Sun , Shilei Wen , Errui Ding
IPC: G06F16/738 , G06V20/40 , G06F18/214 , G06F18/25
Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.
-
公开(公告)号:US20200320303A1
公开(公告)日:2020-10-08
申请号:US16905482
申请日:2020-06-18
Inventor: Dongliang He , Xiang Zhao , Jizhou Huang , Fu Li , Xiao Liu , Shilei Wen
IPC: G06K9/00 , G06F16/783 , G06N3/08
Abstract: A method and an apparatus for grounding a target video clip in a video are provided. The method includes: determining a current video clip in the video based on a current position; acquiring descriptive information indicative of a pre-generated target video clip descriptive feature, and executing a target video clip determining step which includes: determining current state information of the current video clip, wherein the current state information includes information indicative of a feature of the current video clip; generating a current action policy based on the descriptive information and the current state information, the current action policy being indicative of a position change of the current video clip in the video; the method further comprises: in response to reaching a preset condition, using a video clip resulting from executing the current action policy on the current video clip as the target video clip.
-
-
-
-
-
-
-
-
-