Automatically segmenting images based on natural language phrases

    公开(公告)号:US10089742B1

    公开(公告)日:2018-10-02

    申请号:US15458887

    申请日:2017-03-14

    摘要: The invention is directed towards segmenting images based on natural language phrases. An image and an n-gram, including a sequence of tokens, are received. An encoding of image features and a sequence of token vectors are generated. A fully convolutional neural network identifies and encodes the image features. A word embedding model generates the token vectors. A recurrent neural network (RNN) iteratively updates a segmentation map based on combinations of the image feature encoding and the token vectors. The segmentation map identifies which pixels are included in an image region referenced by the n-gram. A segmented image is generated based on the segmentation map. The RNN may be a convolutional multimodal RNN. A separate RNN, such as a long short-term memory network, may iteratively update an encoding of semantic features based on the order of tokens. The first RNN may update the segmentation map based on the semantic feature encoding.

    Image Object Segmentation Using Examples
    3.
    发明申请
    Image Object Segmentation Using Examples 有权
    使用实例的图像对象分割

    公开(公告)号:US20170039723A1

    公开(公告)日:2017-02-09

    申请号:US14817731

    申请日:2015-08-04

    IPC分类号: G06T7/00 G06K9/62

    摘要: Systems and methods are disclosed herein for using one or more computing devices to automatically segment an object in an image by referencing a dataset of already-segmented images. The technique generally involves identifying a patch of an already-segmented image in the dataset based on the patch of the already-segmented image being similar to an area of the image including a patch of the image. The technique further involves identifying a mask of the patch of the already-segmented image, the mask representing a segmentation in the already-segmented image. The technique also involves segmenting the object in the image based on at least a portion of the mask of the patch of the already-segmented image.

    摘要翻译: 本文公开的系统和方法用于使用一个或多个计算设备通过参考已经分割的图像的数据集自动地分割图像中的对象。 该技术通常涉及基于已经分段的图像的片段类似于包括图像的片段的图像的区域来识别数据集中的已经分割的图像的片段。 该技术还涉及识别已经分割的图像的斑块的掩模,该掩码表示已经分割的图像中的分割。 该技术还涉及基于已经分割的图像的补片的掩模的至少一部分来分割图像中的对象。

    EDITING DIGITAL IMAGES UTILIZING A NEURAL NETWORK WITH AN IN-NETWORK RENDERING LAYER

    公开(公告)号:US20180253869A1

    公开(公告)日:2018-09-06

    申请号:US15448206

    申请日:2017-03-02

    摘要: The present disclosure includes methods and systems for generating modified digital images utilizing a neural network that includes a rendering layer. In particular, the disclosed systems and methods can train a neural network to decompose an input digital image into intrinsic physical properties (e.g., such as material, illumination, and shape). Moreover, the systems and methods can substitute one of the intrinsic physical properties for a target property (e.g., a modified material, illumination, or shape). The systems and methods can utilize a rendering layer trained to synthesize a digital image to generate a modified digital image based on the target property and the remaining (unsubstituted) intrinsic physical properties. Systems and methods can increase the accuracy of modified digital images by generating modified digital images that realistically reflect a confluence of intrinsic physical properties of an input digital image and target (i.e., modified) properties.

    Labeling objects in image scenes
    5.
    发明授权
    Labeling objects in image scenes 有权
    在图像场景中标记对象

    公开(公告)号:US09396546B2

    公开(公告)日:2016-07-19

    申请号:US14159658

    申请日:2014-01-21

    IPC分类号: G06K9/00 G06T7/00

    摘要: Disclosed are various embodiments labeling objects using multi-scale partitioning, rare class expansion, and/or spatial context techniques. An input image may be partitioned using different scale values to produce a different set of superpixels for each of the different scale values. Potential object labels for superpixels in each different set of superpixels of the input image may be assessed by comparing descriptors of the superpixels in each different set of superpixels of the input image with descriptors of reference superpixels in labeled reference images. An object label may then be assigned for a pixel of the input image based at least in part on the assessing of the potential object labels.

    摘要翻译: 公开了使用多尺度分割,稀有类扩展和/或空间上下文技术标记对象的各种实施例。 可以使用不同的比例值对输入图像进行分区,以针对不同比例值中的每一者产生不同的超像素组。 可以通过将输入图像的每个不同的超像素集合中的超像素的描述符与标记的参考图像中的参考超像素的描述符进行比较来评估输入图像的每个不同的超像素集合中的超像素的潜在对象标签。 至少部分地基于对潜在对象标签的评估,可以为输入图像的像素分配对象标签。

    Predicting Patch Displacement Maps Using A Neural Network

    公开(公告)号:US20190114818A1

    公开(公告)日:2019-04-18

    申请号:US15785386

    申请日:2017-10-16

    摘要: Predicting patch displacement maps using a neural network is described. Initially, a digital image on which an image editing operation is to be performed is provided as input to a patch matcher having an offset prediction neural network. From this image and based on the image editing operation for which this network is trained, the offset prediction neural network generates an offset prediction formed as a displacement map, which has offset vectors that represent a displacement of pixels of the digital image to different locations for performing the image editing operation. Pixel values of the digital image are copied to the image pixels affected by the operation by: determining the vectors pixels that correspond to the image pixels affected by the image editing operation and mapping the pixel values of the image pixels represented by the determined offset vectors to the affected pixels. According to this mapping, the pixel values of the affected pixels are set, effective to perform the image editing operation.

    Forecasting multiple poses based on a graphical image

    公开(公告)号:US10096125B1

    公开(公告)日:2018-10-09

    申请号:US15481564

    申请日:2017-04-07

    摘要: A forecasting neural network receives data and extracts features from the data. A recurrent neural network included in the forecasting neural network provides forecasted features based on the extracted features. In an embodiment, the forecasting neural network receives an image, and features of the image are extracted. The recurrent neural network forecasts features based on the extracted features, and pose is forecasted based on the forecasted features. Additionally or alternatively, additional poses are forecasted based on additional forecasted features.

    Digital Image Completion Using Deep Learning
    10.
    发明申请

    公开(公告)号:US20190114748A1

    公开(公告)日:2019-04-18

    申请号:US15785359

    申请日:2017-10-16

    摘要: Digital image completion using deep learning is described. Initially, a digital image having at least one hole is received. This holey digital image is provided as input to an image completer formed with a framework that combines generative and discriminative neural networks based on learning architecture of the generative adversarial networks. From the holey digital image, the generative neural network generates a filled digital image having hole-filling content in place of holes. The discriminative neural networks detect whether the filled digital image and the hole-filling digital content correspond to or include computer-generated content or are photo-realistic. The generating and detecting are iteratively continued until the discriminative neural networks fail to detect computer-generated content for the filled digital image and hole-filling content or until detection surpasses a threshold difficulty. Responsive to this, the image completer outputs the filled digital image with hole-filling content in place of the holey digital image's holes.