Media generating and editing system that generates audio playback in alignment with transcribed text
摘要:
A media generating and editing system that generates audio playback in alignment with text that has been automatically transcribed from the audio. A transcript data file that includes a plurality of text words transcribed from audio words included in the audio data is stored. Timing data is paired with the text words indicating locations in the audio data of the corresponding audio words from which the text words are transcribed. The audio data is provided for playback at a user device. The text words are displayed on a display screen at a user device and a visual marker is displayed on the display screen to indicate the text words on the display screen in time alignment with the audio playback of the corresponding audio words at the user device. The text words in the transcript data file are amended in response to inputs from the user device.
公开/授权文献
信息查询
0/0