Differencing Based Self-Supervised Pretraining for Change Detection (D-SSCD)

    公开(公告)号:US20230123493A1

    公开(公告)日:2023-04-20

    申请号:US17502729

    申请日:2021-10-15

    Abstract: A computer implemented network for executing a self-supervised scene change detection method, wherein at least one image pair with images captured at different instances of time is processed to detect structural changes caused by an appearance or disappearance of an object in the image pair, and wherein a self-supervised pretraining method is employed that utilizes an unlabelled image pair or pairs to learn representations for scene change detection, and wherein the aligned image pair is subjected to a differencing based self-supervised pre-training method to maximize a correlation between changed regions in the images which provide the structural changes that occur in the image pairs.

    Differencing based self-supervised scene change detection (D-SSCD) with temporal consistency

    公开(公告)号:US12062188B2

    公开(公告)日:2024-08-13

    申请号:US17691397

    申请日:2022-03-10

    CPC classification number: G06T7/20 G06N3/088 G06T2207/20081 G06T2207/20084

    Abstract: A computer implemented network for executing a self-supervised scene change detection method in which image pairs (T0, T1) from different time instances are subjected to random photometric transformations to obtain two pairs of augmented images (T0→T0′, T0″; T1→T1′, T1″), which augmented images are passed into an encoder (fθ) and a projection head (gϕ) to provide corresponding feature representations. Absolute feature differencing is applied over the outputs of the projection head (gϕ) to obtain difference representations (d1, d2) of changed features between the pair of images, and a self-supervised objective function (LSSL) is applied on the difference representations d1 and d2 to maximize a cross-correlation of the changed features, wherein d1 and d2 are defined as









    d
    1

    =



    "\[LeftBracketingBar]"



    g

    (

    f

    (

    T
    0


    )

    )

    -

    g

    (

    f

    (

    T
    0


    )

    )




    "\[RightBracketingBar]"








    d
    2

    =



    "\[LeftBracketingBar]"



    g

    (

    f

    (

    T
    0


    )

    )

    -

    g

    (

    f

    (

    T
    1


    )

    )




    "\[RightBracketingBar]"







    (
    1
    )







    Furthermore, an invariant prediction and change consistency loss is applied in the D-SSCD Network to reduce the effects of differences in the lighting conditions or camera viewpoints by enhancing the image alignment between the temporal images in the decision and feature space.

    Differencing Based Self-Supervised Scene Change Detection (D-SSCD) with Temporal Consistency

    公开(公告)号:US20230289977A1

    公开(公告)日:2023-09-14

    申请号:US17691397

    申请日:2022-03-10

    CPC classification number: G06T7/20 G06N3/088 G06T2207/20081 G06T2207/20084

    Abstract: A computer implemented network for executing a self-supervised scene change detection method in which image pairs (T0, T1) from different time instances are subjected to random photometric transformations to obtain two pairs of augmented images (T0 → T 0′, T 0‴ ; T1 → T 1′, T1″), which augmented images are passed into an encoder (fθ) and a projection head (gϕ) to provide corresponding feature representations. Absolute feature differencing is applied over the outputs of the projection head (gϕ) to obtain difference representations (d1, d2) of changed features between the pair of images, and a self-supervised objective function (LSSL) is applied on the difference representations d1 and d2 to maximize a cross-correlation of the changed features, wherein d1 and d2 are defined as










    d
    1

    =


    g


    f




    T


    0






    g


    f




    T


    1












    d
    2

    =


    g


    f




    T


    0






    g


    f




    T


    1













    ­­­(1)







    Furthermore, an invariant prediction and change consistency loss is applied in the D-SSCD Network to reduce the effects of differences in the lighting conditions or camera viewpoints by enhancing the image alignment between the temporal images in the decision and feature space.

Patent Agency Ranking