-
1.
公开(公告)号:US20240062057A1
公开(公告)日:2024-02-22
申请号:US17818506
申请日:2022-08-09
Applicant: Adobe Inc.
Inventor: Surgan Jandial , Nikaash Puri , Balaji Krishnamurthy
CPC classification number: G06N3/08 , G06N3/0454
Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that regularize learning targets for a student network by leveraging past state outputs of the student network with outputs of a teacher network to determine a retrospective knowledge distillation loss. For example, the disclosed systems utilize past outputs from a past state of a student network with outputs of a teacher network to compose student-regularized teacher outputs that regularize training targets by making the training targets similar to student outputs while preserving semantics from the teacher training targets. Additionally, the disclosed systems utilize the student-regularized teacher outputs with student outputs of the present states to generate retrospective knowledge distillation losses. Then, in one or more implementations, the disclosed systems compound the retrospective knowledge distillation losses with other losses of the student network outputs determined on the main training tasks to learn parameters of the student networks.
-
公开(公告)号:US11080817B2
公开(公告)日:2021-08-03
申请号:US16673574
申请日:2019-11-04
Applicant: Adobe Inc.
Inventor: Kumar Ayush , Surgan Jandial , Mayur Hemani , Balaji Krishnamurthy , Ayush Chopra
Abstract: Generating a synthesized image of a person wearing clothing is described. A two-dimensional reference image depicting a person wearing an article of clothing and a two-dimensional image of target clothing in which the person is to be depicted as wearing are received. To generate the synthesized image, a warped image of the target clothing is generated via a geometric matching module, which implements a machine learning model trained to recognize similarities between warped and non-warped clothing images using multi-scale patch adversarial loss. The multi-scale patch adversarial loss is determined by sampling patches of different sizes from corresponding locations of warped and non-warped clothing images. The synthesized image is generated on a per-person basis, such that the target clothing fits the particular body shape, pose, and unique characteristics of the person.
-
公开(公告)号:US20210133919A1
公开(公告)日:2021-05-06
申请号:US16673574
申请日:2019-11-04
Applicant: Adobe Inc.
Inventor: Kumar Ayush , Surgan Jandial , Mayur Hemani , Balaji Krishnamurthy , Ayush Chopra
Abstract: Generating a synthesized image of a person wearing clothing is described. A two-dimensional reference image depicting a person wearing an article of clothing and a two-dimensional image of target clothing in which the person is to be depicted as wearing are received. To generate the synthesized image, a warped image of the target clothing is generated via a geometric matching module, which implements a machine learning model trained to recognize similarities between warped and non-warped clothing images using multi-scale patch adversarial loss. The multi-scale patch adversarial loss is determined by sampling patches of different sizes from corresponding locations of warped and non-warped clothing images. The synthesized image is generated on a per-person basis, such that the target clothing fits the particular body shape, pose, and unique characteristics of the person.
-
公开(公告)号:US20240362941A1
公开(公告)日:2024-10-31
申请号:US18140143
申请日:2023-04-27
Applicant: Adobe Inc.
Inventor: Silky Singh , Surgan Jandial , Shripad Vilasrao Deshmukh , Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy , Arneh Jain , Abhinav Java
IPC: G06V30/262 , G06V30/14 , G06V30/19 , G06V30/414
CPC classification number: G06V30/274 , G06V30/1444 , G06V30/19147 , G06V30/414
Abstract: A corrective noise system receives an electronic version of a fillable form generated by a segmentation network and receives a correction to a segmentation error in the electronic version of the fillable form. The corrective noise system is trained to generate noise that represents the correction and superimpose the noise on the fillable form. The corrective noise system is further trained to identify regions in a corpus of forms that are semantically similar to a region that was subject to the correction. The generated noise is propagated to the semantically similar regions in the corpus of forms and the noisy corpus of forms is provided as input to the segmentation network. The noise causes the segmentation network to accurately identify fillable regions in the corpus of forms and output a segmented version of the corpus of forms having improved fidelity without retraining or otherwise modifying the segmentation network.
-
公开(公告)号:US20240330351A1
公开(公告)日:2024-10-03
申请号:US18190686
申请日:2023-03-27
Applicant: Adobe Inc.
Inventor: Abhinav Java , Surgan Jandial , Shripad Vilasrao Deshmukh , Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy , Arneh Jain
IPC: G06F16/383 , G06F16/332 , G06V30/19 , G06V30/412
CPC classification number: G06F16/383 , G06F16/332 , G06V30/19147 , G06V30/412
Abstract: Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar. The content processing system utilizes the training dataset to train a machine learning model to perform form structure similarity matching.
-
公开(公告)号:US11874902B2
公开(公告)日:2024-01-16
申请号:US17160862
申请日:2021-01-28
Applicant: Adobe Inc.
Inventor: Pinkesh Badjatiya , Surgan Jandial , Pranit Chawla , Mausoom Sarkar , Ayush Chopra
IPC: G06F18/25 , G06N3/04 , G06F16/538 , G06F16/532 , G06F16/535 , G06F18/214
CPC classification number: G06F18/253 , G06F16/532 , G06F16/535 , G06F16/538 , G06F18/214 , G06F18/251 , G06N3/04
Abstract: Techniques are disclosed for text conditioned image searching. A methodology implementing the techniques according to an embodiment includes receiving a source image and a text query defining a target image attribute. The method also includes decomposing the source image into image content and style feature vectors and decomposing the text query into text content and style feature vectors, wherein image style is descriptive of image content and text style is descriptive of text content. The method further includes composing a global content feature vector based on the text content feature vector and the image content feature vector and composing a global style feature vector based on the text style feature vector and the image style feature vector. The method further includes identifying a target image that relates to the global content feature vector and the global style feature vector so that the target image relates to the target image attribute.
-
公开(公告)号:US20240296337A1
公开(公告)日:2024-09-05
申请号:US18178225
申请日:2023-03-03
Applicant: ADOBE INC.
Inventor: Surgan Jandial , Tarun Ram Menta , Akash Sunil Patil , Chirag Agarwal , Mausoom Sarkar , Balaji Krishnamurthy
IPC: G06N3/096
CPC classification number: G06N3/096
Abstract: Systems and methods for transfer learning are provided. According to one aspect, a method for transfer learning includes obtaining a target dataset, a source dataset, and a machine learning model trained on the source dataset; selecting a hard subset of the target dataset based on a similarity between the hard subset and the source dataset; computing a transferability metric for the target dataset based on the hard subset of the target dataset; and training the machine learning model using the target dataset based on the transferability metric.
-
公开(公告)号:US20240119122A1
公开(公告)日:2024-04-11
申请号:US18045542
申请日:2022-10-11
Applicant: ADOBE INC.
Inventor: Shripad Vilasrao Deshmukh , Surgan Jandial , Abhinav Java , Milan Aggarwal , Mausoom Sarkar , Arneh Jain , Balaji Krishnamurthy
IPC: G06K9/62
CPC classification number: G06K9/6269 , G06K9/6259 , G06K9/6285
Abstract: Systems and methods for data augmentation are provided. One aspect of the systems and methods include receiving an image that is misclassified by a classification network; computing an augmentation image based on the image using an augmentation network; and generating an augmented image by combining the image and the augmentation image, wherein the augmented image is correctly classified by the classification network.
-
公开(公告)号:US11797823B2
公开(公告)日:2023-10-24
申请号:US16793551
申请日:2020-02-18
Applicant: Adobe Inc.
Inventor: Ayush Chopra , Balaji Krishnamurthy , Mausoom Sarkar , Surgan Jandial
IPC: G06N3/04 , G06N3/084 , G06F18/214 , G06N3/047 , G06V10/764 , G06V10/82
CPC classification number: G06N3/04 , G06F18/214 , G06N3/047 , G06N3/084 , G06V10/764 , G06V10/82
Abstract: Generating a machine learning model that is trained using retrospective loss is described. A retrospective loss system receives an untrained machine learning model and a task for training the model. The retrospective loss system initially trains the model over warm-up iterations using task-specific loss that is determined based on a difference between predictions output by the model during training on input data and a ground truth dataset for the input data. Following the warm-up training iterations, the retrospective loss system continues to train the model using retrospective loss, which is model-agnostic and constrains the model such that a subsequently output prediction is more similar to the ground truth dataset than the previously output prediction. After determining that the model's outputs are within a threshold similarity to the ground truth dataset, the model is output with its current parameters as a trained model.
-
公开(公告)号:US11720651B2
公开(公告)日:2023-08-08
申请号:US17160893
申请日:2021-01-28
Applicant: Adobe Inc.
Inventor: Pinkesh Badjatiya , Surgan Jandial , Pranit Chawla , Mausoom Sarkar , Ayush Chopra
IPC: G06F18/25 , G06N3/04 , G06F16/583 , G06F16/532 , G06F16/538 , G06F18/214
CPC classification number: G06F18/253 , G06F16/532 , G06F16/538 , G06F16/5846 , G06F18/214 , G06F18/251 , G06N3/04
Abstract: Techniques are disclosed for text-conditioned image searching. A methodology implementing the techniques includes decomposing a source image into visual feature vectors associated with different levels of granularity. The method also includes decomposing a text query (defining a target image attribute) into feature vectors associated with different levels of granularity including a global text feature vector. The method further includes generating image-text embeddings based on the visual feature vectors and the text feature vectors to encode information from visual and textual features. The method further includes composing a visio-linguistic representation based on a hierarchical aggregation of the image-text embeddings to encode visual and textual information at multiple levels of granularity. The method further includes identifying a target image that includes the visio-linguistic representation and the global text feature vector, so that the target image relates to the target image attribute, and providing the target image as an image search result.
-
-
-
-
-
-
-
-
-