SELF-DIRECTED VISUAL INTELLIGENCE SYSTEM
    3.
    发明公开

    公开(公告)号:US20240062522A1

    公开(公告)日:2024-02-22

    申请号:US17968986

    申请日:2022-10-19

    CPC classification number: G06V10/774 G06V20/46

    Abstract: There is provided a self-directed visual intelligence system, The self-directed visual intelligence system according to an embodiment prepares data necessary for training a visual intelligence model when a change in a visual context of a real world is recognized, configures a visual intelligence model and configures training data of the visual intelligence model, based on the changed visual context of the real world, trains the configured visual intelligence model with the training data, and evaluates performance of the trained visual intelligence model. Accordingly, the visual intelligence model is corrected/improved in a self-directed way according to a change in a visual context of a real world, and is grown/advanced by itself, so that performance of the visual intelligence model is maintained in a best state even in response to any change in the context of the real world.

    IMAGE-BASED LANE DETECTION AND EGO-LANE RECOGNITION METHOD AND APPARATUS

    公开(公告)号:US20210334553A1

    公开(公告)日:2021-10-28

    申请号:US17137832

    申请日:2020-12-30

    Abstract: A method and an apparatus for detecting a lane is provided. The lane detection apparatus according to an embodiment includes: an acquisition unit configured to acquire a front image of a vehicle; and a processor configured to input the image acquired through the acquisition unit to an AI model, and to detect information of a lane on a road, and the AI model is trained to detect lane information that is expressed in a plane form from an input image. Accordingly, data imbalance between a lane area and a non-lane area can be solved by using the AI model which learns/predicts lane information that is expressed in a plane form, not in a segment form such as a straight line or curved line.

    REAL-TIME 3D HUMAN OBJECT RECONSTRUCTION APPARATUS AND METHOD BASED ON MONOCULAR COLOR IMAGE

    公开(公告)号:US20250104344A1

    公开(公告)日:2025-03-27

    申请号:US18573736

    申请日:2022-12-13

    Abstract: There are provided an apparatus and a method for reconstructing a 3D human object in real time based on a monocular color image. A 3D human object reconstruction apparatus according to an embodiment extracts a pixel-aligned feature from a monocular image, extracts a ray-invariant feature from the pixel-aligned feature, generates encoded position information by encoding position information of a point, predicts a SD of a point from the ray-invariant feature and the encoded position information which are extracted, and reconstructs a 3D human object by using the predicted SD. Accordingly, the ray-invariant feature extracted from the pixel-aligned feature, and the encoded position information are used, so that an amount of computation for predicting SDs of points of a 3D space can be noticeably reduced and a speed can be remarkably enhanced.

    BACKBONE NETWORK LEARNING METHOD AND SYSTEM BASED ON SELF-SUPERVISED LEARNING AND MULTI-HEAD FOR VISUAL INTELLIGENCE

    公开(公告)号:US20240394546A1

    公开(公告)日:2024-11-28

    申请号:US18225304

    申请日:2023-07-24

    Abstract: There is provided a learning method and system of a backbone network for visual intelligence based on self-supervised learning and multi-head. A network learning system according to an embodiment generates a plurality of first modified vectors by modifying a first feature vector outputted from a teacher network, generates a plurality of second modified vectors by modifying a second feature vector outputted from a student network, calculates a loss by using the first modified vectors and the second modified vectors, and optimizes parameters of the student network. Accordingly, the effect of learning by knowledge distillation may be enhanced by training the backbone network for visual intelligence like group learning is performed by various teacher networks and student networks.

Patent Agency Ranking