Generating modified digital images utilizing a dispersed multimodal selection model

发明授权

US11594077B2 Generating modified digital images utilizing a dispersed multimodal selection model 有权

请登陆查看更多内容

专利标题： Generating modified digital images utilizing a dispersed multimodal selection model
申请号： US17025477

申请日： 2020-09-18
公开(公告)号： US11594077B2

公开(公告)日： 2023-02-28
发明人: Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt
申请人： Adobe Inc.
申请人地址： US CA San Jose
专利权人： Adobe Inc.
当前专利权人： Adobe Inc.
当前专利权人地址： US CA San Jose
代理机构： Keller Preece PLLC
主分类号： G06V40/20
IPC分类号： G06V40/20 ; G06N3/04 ; G10L15/26 ; G10L15/25

Generating modified digital images utilizing a dispersed multimodal selection model

摘要：

The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.

公开/授权文献

US20210004576A1 GENERATING MODIFIED DIGITAL IMAGES UTILIZING A DISPERSED MULTIMODAL SELECTION MODEL 公开/授权日：2021-01-07

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V40/00	识别图像或视频数据中的生物特征、人类相关或动物相关模式
G06V40/20	.动作或行为，例如手势识别（面部表情识别 G06V40/16）