-
公开(公告)号:US08510112B1
公开(公告)日:2013-08-13
申请号:US11469129
申请日:2006-08-31
申请人: Alistair Conkie , Ann Syrdal
发明人: Alistair Conkie , Ann Syrdal
CPC分类号: G10L13/06 , G10L2021/0135
摘要: A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database, identifying segments in the labeled audio files that have varying pronunciations based on language differences, modifying the identified segments in the primary speech database using selected mappings, enhancing the primary speech database by substituting the modified segments for the corresponding identified database segments in the primary speech database, and storing the enhanced primary speech database for use in speech synthesis.
摘要翻译: 公开了增强用于语音合成的语音数据库的系统,方法和计算机可读介质。 该方法可以包括在主语音数据库中标记音频文件,基于语言差异识别具有不同发音的标记音频文件中的片段,使用所选择的映射修改主语音数据库中的所识别的片段,通过将主要语音数据库替换为 用于主语音数据库中相应识别的数据库段的修改段,以及存储用于语音合成的增强型主语音数据库。
-
公开(公告)号:US08428443B2
公开(公告)日:2013-04-23
申请号:US11716995
申请日:2007-03-12
申请人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
发明人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
CPC分类号: H04N9/87 , H04N5/765 , H04N5/782 , H04N21/4334 , H04N21/4398 , H04N21/440281 , H04N21/4621
摘要: A method of providing modified media content is disclosed that includes providing media content to a destination device via a network, where the media content comprises video data and audio data have a first viewing rate. The method further includes receiving data indicating a selection of a second viewing rate via the network and modifying the media content to produce modified media content having approximately the second viewing rate. The modified media content includes modified video data and modified audio data synchronized at approximately the second viewing rate.
摘要翻译: 公开了一种提供修改的媒体内容的方法,其包括经由网络向目的地设备提供媒体内容,其中,媒体内容包括视频数据和音频数据具有第一观看速率。 该方法还包括接收经由网络指示选择第二观看速率的数据,并修改媒体内容以产生具有大约第二观看速率的修改的媒体内容。 修改的媒体内容包括修改的视频数据和大约第二观看速率同步的修改的音频数据。
-
公开(公告)号:US07912718B1
公开(公告)日:2011-03-22
申请号:US11469089
申请日:2006-08-31
申请人: Alistair Conkie , Ann Syrdal
发明人: Alistair Conkie , Ann Syrdal
IPC分类号: G10L13/00
CPC分类号: G10L13/06
摘要: A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database and a secondary speech database, enhancing the primary speech database by placing the labeled audio files from the secondary speech database into the primary speech database, and storing the enhanced primary speech database for use in speech synthesis.
摘要翻译: 公开了增强用于语音合成的语音数据库的系统,方法和计算机可读介质。 该方法可以包括在主语音数据库和辅助语音数据库中标记音频文件,通过将来自辅助语音数据库的标记的音频文件放入主语音数据库中来增强主语音数据库,并且存储增强的主语音数据库以用于 语音综合。
-
公开(公告)号:US20080235741A1
公开(公告)日:2008-09-25
申请号:US11725591
申请日:2007-03-19
申请人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
发明人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
IPC分类号: H04N7/173
CPC分类号: H04N21/6373 , H04N5/4401 , H04N5/765 , H04N5/775 , H04N5/783 , H04N7/56 , H04N9/8063 , H04N21/2335 , H04N21/234381 , H04N21/2393 , H04N21/4307 , H04N21/4325 , H04N21/47202 , H04N21/6587
摘要: A method and system of providing media content is disclosed. In a particular embodiment, the method includes receiving media content from a content source at a set-top box device. The media content includes video data having a first playback rate and audio data having the first playback rate. The method further includes transforming the audio data via a non-linear transformation to produce modified audio data having a second playback rate, modifying the video data to produce modified video data having the second playback rate, and synchronizing the modified audio data and the modified video data to produce modified media content having the second playback rate. A network-based media content storage device and associated logic to provide adjusted rate audio content are also disclosed.
摘要翻译: 公开了提供媒体内容的方法和系统。 在特定实施例中,该方法包括在机顶盒设备处从内容源接收媒体内容。 媒体内容包括具有第一播放速率的视频数据和具有第一播放速率的音频数据。 该方法还包括经由非线性变换来变换音频数据以产生具有第二播放速率的修改的音频数据,修改视频数据以产生具有第二播放速率的修改的视频数据,以及使修改的音频数据和修改的视频同步 数据以产生具有第二播放速率的修改的媒体内容。 还公开了一种基于网络的媒体内容存储设备和相关逻辑以提供经调整的速率音频内容。
-
公开(公告)号:US20080226256A1
公开(公告)日:2008-09-18
申请号:US11716995
申请日:2007-03-12
申请人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
发明人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
IPC分类号: H04N5/91
CPC分类号: H04N9/87 , H04N5/765 , H04N5/782 , H04N21/4334 , H04N21/4398 , H04N21/440281 , H04N21/4621
摘要: A method of providing modified media content is disclosed that includes providing media content to a destination device via a network, where the media content comprises video data and audio data have a first viewing rate. The method further includes receiving data indicating a selection of a second viewing rate via the network and modifying the media content to produce modified media content having approximately the second viewing rate. The modified media content includes modified video data and modified audio data synchronized at approximately the second viewing rate.
摘要翻译: 公开了一种提供修改的媒体内容的方法,其包括经由网络向目的地设备提供媒体内容,其中媒体内容包括视频数据和音频数据具有第一观看速率。 该方法还包括接收经由网络指示选择第二观看速率的数据,并修改媒体内容以产生具有大约第二观看速率的修改的媒体内容。 修改的媒体内容包括修改的视频数据和大约第二观看速率同步的修改的音频数据。
-
公开(公告)号:US08312492B2
公开(公告)日:2012-11-13
申请号:US11725591
申请日:2007-03-19
申请人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
发明人: Andrej Ljolje , Ann Syrdal , Alistair Conkie
CPC分类号: H04N21/6373 , H04N5/4401 , H04N5/765 , H04N5/775 , H04N5/783 , H04N7/56 , H04N9/8063 , H04N21/2335 , H04N21/234381 , H04N21/2393 , H04N21/4307 , H04N21/4325 , H04N21/47202 , H04N21/6587
摘要: A method and system of providing media content is disclosed. In a particular embodiment, the method includes receiving media content from a content source at a set-top box device. The media content includes video data having a first playback rate and audio data having the first playback rate. The method further includes transforming the audio data via a non-linear transformation to produce modified audio data having a second playback rate, modifying the video data to produce modified video data having the second playback rate, and synchronizing the modified audio data and the modified video data to produce modified media content having the second playback rate. A network-based media content storage device and associated logic to provide adjusted rate audio content are also disclosed.
摘要翻译: 公开了提供媒体内容的方法和系统。 在特定实施例中,该方法包括在机顶盒设备处从内容源接收媒体内容。 媒体内容包括具有第一播放速率的视频数据和具有第一播放速率的音频数据。 该方法还包括经由非线性变换来变换音频数据以产生具有第二播放速率的修改的音频数据,修改视频数据以产生具有第二播放速率的修改的视频数据,以及使修改的音频数据和修改的视频同步 数据以产生具有第二播放速率的修改的媒体内容。 还公开了一种基于网络的媒体内容存储设备和相关逻辑以提供经调整的速率音频内容。
-
7.
公开(公告)号:US20050256716A1
公开(公告)日:2005-11-17
申请号:US10845364
申请日:2004-05-13
申请人: Srinivas Bangalore , Junlan Feng , Mazin Rahim , Juergen Schroeter , David Schulz , Ann Syrdal
发明人: Srinivas Bangalore , Junlan Feng , Mazin Rahim , Juergen Schroeter , David Schulz , Ann Syrdal
IPC分类号: G10L13/00
CPC分类号: G10L13/033 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/197
摘要: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.
摘要翻译: 公开了用于为特定应用产生定制的文本到语音语音的系统和方法。 该方法包括通过选择用于生成与域相关联的自定义文本到语音语音的语音来生成自定义文本到语音语音,从预先存在的文本数据源收集与域相关联的文本数据,并使用收集的 文本数据,通过搜索合成语音单元的预先存在的库存来选择适合于该域的语音单元,或者通过记录所选合成质量水平的最小库存来生成合成语音单元的域内库存。 使用合成语音单元的域内库存来生成域的文本到语音定制语音。 还可以使用主动学习技术来识别问题短语,其中只需要几分钟的记录数据来传送高质量的TTS定制语音。
-
-
-
-
-
-