-
公开(公告)号:US20240112668A1
公开(公告)日:2024-04-04
申请号:US18529170
申请日:2023-12-05
Applicant: Adobe Inc.
Inventor: Amol Jindal , Somya Jain , Ajay Bedi
IPC: G10L15/04 , G06F40/253 , G06F40/30
CPC classification number: G10L15/04 , G06F40/253 , G06F40/30
Abstract: A media edit point selection process can include a media editing software application programmatically converting speech to text and storing a timestamp-to-text map. The map correlates text corresponding to speech extracted from an audio track for the media clip to timestamps for the media clip. The timestamps correspond to words and some gaps in the speech from the audio track. The probability of identified gaps corresponding to a grammatical pause by the speaker is determined using the timestamp-to-text map and a semantic model. Potential edit points corresponding to grammatical pauses in the speech are stored for display or for additional use by the media editing software application. Text can optionally be displayed to a user during media editing.
-
公开(公告)号:US20220414149A1
公开(公告)日:2022-12-29
申请号:US17902457
申请日:2022-09-02
Applicant: Adobe Inc.
Inventor: Amol Jindal , Subham Gupta , Poonam Bhalla , Krishna Singh Karki , Ajay Bedi
IPC: G06F16/738 , G06F16/732 , G06F16/735 , G06F16/78 , G06F16/783
Abstract: A system identifies a video comprising frames associated with content tags. The system detects features for each frame of the video. The system identifies, based on the detected features, scenes of the video. The system determines, for each frame for each scene, a frame score that indicates a number of content tags that match the other frames within the scene. The system selects, for each scene, a set of key frames that represent the scene based on the determined frame scores. The system receives a search query comprising a keyword. The system generates, for display, search results responsive to the search query including a dynamic preview of the video. The dynamic preview comprises an arrangement of frames of the video corresponding to each scene of the video. Each of the arrangement of frames is selected from the selected set of key frames representing the respective scene of the video.
-
13.
公开(公告)号:US20220343647A1
公开(公告)日:2022-10-27
申请号:US17811897
申请日:2022-07-12
Applicant: Adobe Inc.
Inventor: Amol Jindal , Ajay Bedi
IPC: G06V20/20 , G06F16/532 , G06V10/46 , G06V10/75 , G06K9/62
Abstract: Techniques are provided for identifying objects (such as products within a physical store) within a captured video scene and indicating which of object in the captured scene matches a desired object requested by a user. The matching object is then displayed in an accentuated manner to the user in real-time (via augmented reality). Object identification is carried out via a multimodal methodology. Objects within the captured video scene are identified using a neural network trained to identify different types of objects. The identified objects can then be compared against a database of pre-stored images of the desired product to determine if a close match is found. Additionally, text on the identified objects is analyzed and compared to the text of the desired object. Based on either or both identification methods, the desired object is indicated to the user on their display, via an augmented reality graphic.
-
公开(公告)号:US10998007B2
公开(公告)日:2021-05-04
申请号:US16588662
申请日:2019-09-30
Applicant: Adobe Inc.
Inventor: Ajay Bedi , Amol Jindal
IPC: G11B27/34 , H04N21/472 , H04N21/845 , G06K9/00
Abstract: This disclosure relates to methods, non-transitory computer readable media, and systems that can generate a context-aware-video-progress bar including a video-scene-proportionate timeline with time-interval sections sized according to relative scene proportions within time intervals of a video. In some implementations, for instance, the disclosed systems determine relative proportions of scenes within a video across time intervals of the video and generate a video-scene-proportionate timeline comprising time-interval sections sized proportionate to the relative proportions of scenes across the time intervals. By integrating the video-scene-proportionate timeline within a video-progress bar, the disclosed systems generate a context-aware-video-progress bar for a video. Such a context-aware-video-progress bar can facilitate more precise and intelligent scrubbing through a video, a dynamic graphical user interface for navigating within and identifying frames of the video, and a flexible user-friendly tool for quickly identifying scenes.
-
15.
公开(公告)号:US20200082586A1
公开(公告)日:2020-03-12
申请号:US16128904
申请日:2018-09-12
Applicant: Adobe Inc.
Inventor: Amol Jindal , Vivek Mishra , Neha Sharan , Anmol Dhawan
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating and providing composition effect tutorials for creating and editing digital content based on a metadata composite structure. For example, the disclosed systems can generate and/or access a metadata composite structure that includes nodes corresponding to composition effects applied to a digital content item, where a given node can include location information indicating where a composition effect is applied relative to a digital content item. The disclosed systems can further generate a tutorial to guide a user to implement a selected composition effect by identifying composition effects of nodes that correspond to a location selected within a composition interface and presenting instructions for a particular composition effect.
-
公开(公告)号:US11941049B2
公开(公告)日:2024-03-26
申请号:US17902457
申请日:2022-09-02
Applicant: Adobe Inc.
Inventor: Amol Jindal , Subham Gupta , Poonam Bhalla , Krishna Singh Karki , Ajay Bedi
IPC: G06F16/738 , G06F16/732 , G06F16/735 , G06F16/78 , G06F16/783
CPC classification number: G06F16/738 , G06F16/7328 , G06F16/735 , G06F16/783 , G06F16/7837 , G06F16/7867
Abstract: A system identifies a video comprising frames associated with content tags. The system detects features for each frame of the video. The system identifies, based on the detected features, scenes of the video. The system determines, for each frame for each scene, a frame score that indicates a number of content tags that match the other frames within the scene. The system selects, for each scene, a set of key frames that represent the scene based on the determined frame scores. The system receives a search query comprising a keyword. The system generates, for display, search results responsive to the search query including a dynamic preview of the video. The dynamic preview comprises an arrangement of frames of the video corresponding to each scene of the video. Each of the arrangement of frames is selected from the selected set of key frames representing the respective scene of the video.
-
17.
公开(公告)号:US11694440B2
公开(公告)日:2023-07-04
申请号:US17811897
申请日:2022-07-12
Applicant: Adobe Inc.
Inventor: Amol Jindal , Ajay Bedi
IPC: G06V20/00 , G06V20/20 , G06F16/532 , G06V10/46 , G06V10/75 , G06V30/10 , G06F18/2413 , G06Q30/0601
CPC classification number: G06V20/20 , G06F16/532 , G06F18/24147 , G06V10/462 , G06V10/751 , G06V30/10 , G06Q30/0625
Abstract: Techniques are provided for identifying objects (such as products within a physical store) within a captured video scene and indicating which of object in the captured scene matches a desired object requested by a user. The matching object is then displayed in an accentuated manner to the user in real-time (via augmented reality). Object identification is carried out via a multimodal methodology. Objects within the captured video scene are identified using a neural network trained to identify different types of objects. The identified objects can then be compared against a database of pre-stored images of the desired product to determine if a close match is found. Additionally, text on the identified objects is analyzed and compared to the text of the desired object. Based on either or both identification methods, the desired object is indicated to the user on their display, via an augmented reality graphic.
-
公开(公告)号:US11500927B2
公开(公告)日:2022-11-15
申请号:US16591847
申请日:2019-10-03
Applicant: Adobe Inc.
Inventor: Amol Jindal , Subham Gupta , Poonam Bhalla , Krishna Singh Karki , Ajay Bedi
IPC: G06F16/783 , G06F16/738 , G06F16/78 , G06F16/735 , G06F16/732
Abstract: Certain embodiments involve adaptive search results for multimedia search queries to provide dynamic previews. For instance, a computing system receives a search query that includes a keyword. The computing system identifies, based on the search query, a video file having keyframes with content tags that match the search query. The computing system determines matching scores for respective keyframes of the identified video file. The computing system generates a dynamic preview from at least two keyframes having the highest matching scores.
-
公开(公告)号:US11294556B1
公开(公告)日:2022-04-05
申请号:US17141745
申请日:2021-01-05
Applicant: Adobe Inc.
Inventor: Amol Jindal
IPC: G06F3/0484 , G06F3/04845 , G06F3/0485 , G06F3/0482 , G06F3/04817
Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize a multi-panel graphical user interface for modifying digital images. For example, in one or more embodiments, the disclosed systems divide the graphical user interface of a client device into first panel and a second panel. Further, the disclosed systems provide different portions of a digital image for display within the first and second panels. In some implementations, the disclosed systems receive a user interaction with the portion of the digital image displayed within the first panel. Based on the received user interaction, the disclosed systems modify the second portion of the digital image displayed within the second panel.
-
公开(公告)号:US20220068258A1
公开(公告)日:2022-03-03
申请号:US17008427
申请日:2020-08-31
Applicant: Adobe Inc.
Inventor: Amol Jindal , Somya Jain , Ajay Bedi
IPC: G10L15/04 , G06F40/30 , G06F40/253
Abstract: A media edit point selection process can include a media editing software application programmatically converting speech to text and storing a timestamp-to-text map. The map correlates text corresponding to speech extracted from an audio track for the media clip to timestamps for the media clip. The timestamps correspond to words and some gaps in the speech from the audio track. The probability of identified gaps corresponding to a grammatical pause by the speaker is determined using the timestamp-to-text map and a semantic model. Potential edit points corresponding to grammatical pauses in the speech are stored for display or for additional use by the media editing software application. Text can optionally be displayed to a user during media editing.
-
-
-
-
-
-
-
-
-