发明授权
- 专利标题: Generating modified digital images utilizing a dispersed multimodal selection model
-
申请号: US17025477申请日: 2020-09-18
-
公开(公告)号: US11594077B2公开(公告)日: 2023-02-28
- 发明人: Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt
- 申请人: Adobe Inc.
- 申请人地址: US CA San Jose
- 专利权人: Adobe Inc.
- 当前专利权人: Adobe Inc.
- 当前专利权人地址: US CA San Jose
- 代理机构: Keller Preece PLLC
- 主分类号: G06V40/20
- IPC分类号: G06V40/20 ; G06N3/04 ; G10L15/26 ; G10L15/25
摘要:
The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.
公开/授权文献
信息查询