-
公开(公告)号:US20200286504A1
公开(公告)日:2020-09-10
申请号:US16296122
申请日:2019-03-07
Applicant: ADOBE INC.
Inventor: Prem Seetharaman , Gautham J. Mysore , Bryan A. Pardo
IPC: G10L25/60 , G10L25/30 , G10L25/84 , G10L21/0232
Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.
-
公开(公告)号:US11138989B2
公开(公告)日:2021-10-05
申请号:US16296122
申请日:2019-03-07
Applicant: ADOBE INC.
Inventor: Prem Seetharaman , Gautham J. Mysore , Bryan A. Pardo
IPC: G10L25/60 , G10L25/30 , G10L21/0232 , G10L25/84 , G10L21/0208
Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.
-