Invention Grant
- Patent Title: AI-assisted sound effect generation for silent video
-
Application No.: US16848512Application Date: 2020-04-14
-
Publication No.: US11381888B2Publication Date: 2022-07-05
- Inventor: Sudha Krishnamurthy
- Applicant: Sony Interactive Entertainment Inc.
- Applicant Address: JP Tokyo
- Assignee: Sony Interactive Entertainment Inc.
- Current Assignee: Sony Interactive Entertainment Inc.
- Current Assignee Address: JP Tokyo
- Agency: JDI Patent
- Agent Joshua D. Isenberg; Robert A. Pullman
- Main IPC: H04N21/854
- IPC: H04N21/854 ; G06F16/638 ; G06N3/04 ; G06V20/40 ; G10L13/027 ; H04N21/439 ; H04N21/44 ; H04N21/845 ; G06F16/63

Abstract:
Sound effect recommendations for visual input are generated by training machine learning models that learn coarse-grained and fine-grained audio-visual correlations from a reference visual, a positive audio signal, and a negative audio signal. A trained Sound Recommendation Network is configured to output an audio embedding and a visual embedding and use the audio embedding and visual embedding to compute a correlation distance between an image frame or video segment and one or more audio segments retrieved from a database. The correlation distances for the one or more audio segments in the database are sorted and one or more audio segments with the closest correlation distance from the sorted audio correlation distances are determined. The audio segment with the closest audio correlation distance is applied to the input image frame or video segment.
Public/Granted literature
- US20210321172A1 AI-ASSISTED SOUND EFFECT GENERATION FOR SILENT VIDEO Public/Granted day:2021-10-14
Information query