-
公开(公告)号:US11803981B2
公开(公告)日:2023-10-31
申请号:US17613617
申请日:2020-05-29
Applicant: Mobileye Vision Technologies Ltd.
Inventor: Gideon Stein , Itay Blumenthal , Nadav Shaag , Jeffrey Moskowitz , Natalie Carlebach
IPC: G06T7/579 , G06T7/292 , H04N13/25 , B60R1/22 , G06V10/774 , G06V20/58 , G06V10/82 , G06T3/00 , G06T17/05 , G06N3/045 , G06V10/25 , G06V20/56 , G08G1/16 , G06T17/00
CPC classification number: G06T7/579 , B60R1/22 , G06N3/045 , G06T3/0093 , G06T7/292 , G06T17/00 , G06T17/05 , G06V10/25 , G06V10/7747 , G06V10/82 , G06V20/56 , G06V20/58 , G08G1/166 , H04N13/25 , G06T2200/04 , G06T2200/08 , G06T2207/10016 , G06T2207/10021 , G06T2207/20081 , G06T2207/20084 , G06T2207/30252 , G08G1/165
Abstract: Various systems and methods for modeling a scene. A device for modeling a scene includes a hardware interface to obtain a time-ordered sequence of images representative of a scene, the time-ordered sequence including a plurality of images, one of the sequence of images being a current image, the scene captured by a monocular imaging system; and processing circuitry to: provide a data set to an artificial neural network (ANN) to produce a three-dimensional structure of the scene, the data set including: a portion of the sequence of images, the portion of the sequence of images including the current image; and motion of a sensor that captured the sequence of images; and model the scene using the three-dimensional structure of the scene, wherein the three-dimensional structure is determined for both moving and fixed objects in the scene.
-
公开(公告)号:US11798255B2
公开(公告)日:2023-10-24
申请号:US17563205
申请日:2021-12-28
Applicant: Korea Electronics Technology Institute
Inventor: Han Mu Park , Jin Yea Jang , Yoon Young Jeong , Sa Im Shin
CPC classification number: G06V10/421 , G06V10/7747 , G06V10/84 , G09B21/009
Abstract: There are provided a method for segmenting a sign language video by gloss to recognize a sign language sentence, and a method for training. According to an embodiment, a sign language video segmentation method receives an input of a sign language sentence video, and segments the inputted sign language sentence video by gloss. Accordingly, there is suggested a method for segmenting a sign language sentence video by gloss, analyzing various gloss sequences from the linguistic perspective, understanding meanings robustly in spite of various changes in sentences, and translating sign language into appropriate Korean sentences.
-
公开(公告)号:US11787339B2
公开(公告)日:2023-10-17
申请号:US16947386
申请日:2020-07-30
Applicant: Magna Electronics Inc.
Inventor: Shweta Suresh Daga , Jigneshkumar Natvarlal Vasoya
IPC: G06V10/22 , B60R1/00 , G06V10/44 , G06V20/56 , G06F18/214 , G06V10/764 , G06V10/774
CPC classification number: B60R1/00 , G06F18/2148 , G06V10/22 , G06V10/44 , G06V10/764 , G06V10/7747 , G06V20/56 , B60R2300/10 , B60R2300/20 , B60R2300/808 , B60R2300/8086
Abstract: A trailer assist system for a vehicle includes a camera disposed at a rear portion of a vehicle and having a field of view exterior and at least rearward of the vehicle, the field of view encompassing at least a portion of a trailer coupler of a trailer stationary a distance from the vehicle. The camera captures image data that is representative of at least the trailer coupler of the trailer. An ECU includes an image processor operable to process image data captured by the camera. The ECU, responsive to image processing at the ECU of image data captured by the camera, determines a location of the trailer coupler using a detector model, which is based on an ensemble regression tree algorithm.
-
公开(公告)号:US20230326186A1
公开(公告)日:2023-10-12
申请号:US17705595
申请日:2022-03-28
Applicant: International Business Machines Corporation
IPC: G06V10/762 , G06V10/74 , G06V10/774
CPC classification number: G06V10/7747 , G06V10/761 , G06V10/762
Abstract: An automated data labeling method, system, and computer program product that includes composing a semantically-named anchor vector derived from a source dataset into a sequence that defines a location description for target data items based on a generalization of distances into Cayley-Menger content and outputting a label for a target data item based on the location description.
-
公开(公告)号:US20230316729A1
公开(公告)日:2023-10-05
申请号:US17711951
申请日:2022-04-01
Applicant: DeepMind Technologies Limited
Inventor: Dan-Andrei Calian , Sven Adrian Gowal , Timothy Arthur Mann , András György
IPC: G06V10/774 , G06V10/82 , G06V10/776
CPC classification number: G06V10/7747 , G06V10/82 , G06V10/776
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for processing a network input using a trained neural network with network parameters to generate an output for a machine learning task. The training includes: receiving a set of training examples each including a training network input and a reference output; for each training iteration, generating a corrupted network input for each training network input using a corruption neural network; updating perturbation parameters of the corruption neural network using a first objective function based on the corrupted network inputs; generating an updated corrupted network input for each training network input based on the updated perturbation parameters; and generating a network output for each updated corrupted network input using the neural network; for each training example, updating the network parameters using a second objective function based on the network output and the reference output.
-
公开(公告)号:US20230316591A1
公开(公告)日:2023-10-05
申请号:US17709895
申请日:2022-03-31
Applicant: Adobe Inc.
Inventor: Zhixin Shu , Zhe Lin , Yuchen Liu , Yijun Li , Richard Zhang
IPC: G06T11/00 , G06V10/40 , G06V10/774
CPC classification number: G06T11/00 , G06V10/40 , G06V10/7747
Abstract: Techniques for identity preserved controllable facial image manipulation are described that support generation of a manipulated digital image based on a facial image and a render image. For instance, a facial image depicting a facial representation of an individual is received as input. A feature space including an identity parameter and at least one other visual parameter is extracted from the facial image. An editing module edits one or more of the visual parameters and preserves the identity parameter. A renderer generates a render image depicting a morphable model reconstruction of the facial image based on the edit. The render image and facial image are encoded, and a generator of a neural network is implemented to generate a manipulated digital image based on the encoded facial image and the encoded render image.
-
公开(公告)号:US20230316471A1
公开(公告)日:2023-10-05
申请号:US17657171
申请日:2022-03-30
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Igal Avishai , Gal Bitan , Roy Yam
IPC: G06T5/20 , G06V10/778 , H04N5/232 , G06V10/774 , G06V10/762 , G06T9/00 , G06V10/22
CPC classification number: G06T5/20 , G06V10/778 , H04N5/23229 , G06T2207/20081 , G06V10/762 , G06T9/00 , G06V10/22 , G06V10/7747
Abstract: Methods and apparatuses for correcting bad image pixels are described. The described sensor-independent image processing techniques leverage one or more dynamic dictionaries of learned filters for bad pixel correction (e.g., where a camera leverages such dictionaries to efficiently identify filters to accurately adjust and correct bad pixel values). For example, a dictionary may store filters that are learned offline (via a self-supervised learning algorithm implemented at a server using known images and ground truth bad pixel correction values). To select a filter for a bad pixel correction operation, a camera may encode an image patch surrounding a bad pixel (into an encoded patch descriptor) and search the dictionary for a matching patch descriptor key. The camera may then apply the filter (value) corresponding to the searched patch descriptor (key) of the dictionary to the image patch to correct the bad pixel and generate a corrected output image.
-
公开(公告)号:US20230315209A1
公开(公告)日:2023-10-05
申请号:US17710888
申请日:2022-03-31
Applicant: SONY GROUP CORPORATION
IPC: G06F3/01 , G06V10/82 , G06V10/774 , H04L67/04
CPC classification number: G06F3/017 , G06V10/82 , G06V10/7747 , H04L67/04
Abstract: An electronic device for gesture recognition on resource-constrained devices is provided. The electronic device controls storage of a plurality of first consecutive image frames in a first buffer of a first length. The plurality of first consecutive image frames corresponds to the first length. The electronic device recognizes a first hand sign of a plurality of hand signs in a first subset of image frames of the plurality of first consecutive image frames. The electronic device controls storage of the recognized first hand sign in a second buffer of a second length based on the determination that a ratio of a number of the first subset of image frames and the first length is one of equal to or greater than the threshold. The electronic device determines a gesture corresponding to one or more hand signs of the plurality of hand signs stored in the second buffer.
-
公开(公告)号:US20230298332A1
公开(公告)日:2023-09-21
申请号:US17874940
申请日:2022-07-27
Inventor: Do-Yeon KIM , Jong-Won CHOI , Yong-Hyun JEONG , Young-Min RO , Pyoung-Geon KIM
IPC: G06V10/82 , G06V10/778 , G06V10/774 , G06V10/764
CPC classification number: G06V10/82 , G06V10/764 , G06V10/7747 , G06V10/778
Abstract: A method and apparatus for training a fake image discriminative model according to an embodiment of the present disclosure includes generating one or more fake images for a real image by selecting one or more encoding layers and one or more decoding layers from a generator network of an autoencoder structure, generating a training image set based on the one or more fake images, and training a classifier for discriminating a fake image by using the training image set.
-
公开(公告)号:US11756288B2
公开(公告)日:2023-09-12
申请号:US17569232
申请日:2022-01-05
Applicant: BAIDU USA LLC
Inventor: Xinyang Zhang , Zhisheng Hu , Zhenyu Zhong
IPC: G06K9/00 , G06V10/774 , G06T5/50
CPC classification number: G06V10/7747 , G06T5/50 , G06T2207/10048 , G06T2207/20081 , G06T2207/20084 , G06T2207/20221
Abstract: The present disclosure provides an image processing method and apparatus, an electronic device and a storage medium, which relate to the field of computer technology, and more particularly to artificial intelligence technology including computer vision, deep learning and the like. The image processing method includes: recognizing the image to be processed to determine attribute information of each object included in the image to be processed; determining a target thermal image to be recognized according to the attribute information of each object and the image to be processed; reconstructing the target thermal image to generate a first reconstructed image; and determining whether the image to be processed includes an object of a preset class according to a difference between the first reconstructed image and the target thermal image.
-
-
-
-
-
-
-
-
-