-
公开(公告)号:US20150347399A1
公开(公告)日:2015-12-03
申请号:US14620142
申请日:2015-02-11
Applicant: Microsoft Technology Licensing, LLC
Inventor: Anthony Aue , Arul A. Menezes , Jonas Nils Lindblom , Fredrik Furesjö , Pierre P.N. Greborio
CPC classification number: G06F17/289 , H04M3/42 , H04M3/4936 , H04M11/10 , H04M2201/39 , H04M2201/40 , H04M2203/2061 , H04M2242/12 , H04W4/14 , H04W4/18
Abstract: Call audio of a call between a source user speaking a source language and a target user speaking a target language is received from a remote source user device of a source user via a communication network of a communication system, the call audio comprising speech of the source user in the source language. An automatic speech recognition procedure is performed on the call audio. A translation of the source user's speech is generated in the target language using the results of the speech recognition procedure. A translated synthetic speech audio version of the source user's speech is mixed with the source user's call audio and/or with translated audio of the target user's speech in the source language. The mixed audio signal is transmitted to a remote target user device of the target user via the communication network for outputting to at least the target user during the call.
Abstract translation: 通过通信系统的通信网络从源用户的远程源用户设备接收到说出源语言的源用户和目标语言的呼叫的呼叫音频,该呼叫音频包括源的语音 用户的源语言。 对呼叫音频执行自动语音识别过程。 使用语音识别过程的结果,以目标语言生成源用户的语音的翻译。 源用户语音的翻译合成语音音频版本与源用户的呼叫音频和/或与源语言中的目标用户语音的翻译音频混合。 混合音频信号经由通信网络被发送到目标用户的远程目标用户设备,以在呼叫期间至少向目标用户输出。
-
公开(公告)号:US20160170970A1
公开(公告)日:2016-06-16
申请号:US14569343
申请日:2014-12-12
Applicant: Microsoft Technology Licensing, LLC
Inventor: Jonas Nils Lindblom , Steve James Pearce , Christian Wendt
IPC: G06F17/28
CPC classification number: G06F17/28 , G06F17/289 , G10L13/033 , G10L15/26 , G10L21/003 , H04L51/063 , H04M3/42 , H04M2203/2061 , H04M2242/12
Abstract: There is provided an apparatus comprising at least one processor and a memory comprising code that, when executed on the at least one processor, causes the apparatus to receive an input user setting relating to relative volumes of the speech data in a preferred language and speech data in a non-preferred language when the speech data is played-out; and cause play-out of received speech data so that the volume of the played-out speech data is set in dependence on the user input and whether the received speech data is in the preferred language or the non-preferred language.
Abstract translation: 提供了一种包括至少一个处理器和存储器的设备,该存储器包括代码,所述代码当在所述至少一个处理器上被执行时使得所述设备以优选语言和语音数据接收与所述语音数据的相对体积相关的输入用户设置 当语音数据被播放时以非优选语言; 并且导致接收到的语音数据的播放,使得根据用户输入设置播放的语音数据的音量,以及接收的语音数据是否是优选语言或非优选语言。
-
公开(公告)号:US20150350451A1
公开(公告)日:2015-12-03
申请号:US14622311
申请日:2015-02-13
Applicant: Microsoft Technology Licensing, LLC
Inventor: Anthony Aue , Arul A. Menezes , Jonas Nils Lindblom , Fredrik Furesjö , Pierre P.N. Greborio
CPC classification number: H04M3/568 , G06F17/289 , H04M3/42 , H04M3/567 , H04M2201/50 , H04M2203/2061 , H04M2242/12 , H04M2250/58 , H04N7/157
Abstract: The disclosure pertains to a communication system for effecting a voice or video call between at least a source user speaking a source language and a target user speaking a target language. A translation procedure is performed on call audio of the call to generate an audio translation of the source user's speech in the target language for outputting to the target user. A notification is outputted to the target user to notify the target user of a change in the behaviour of the translation procedure, the change relating to the generation of the translation.
Abstract translation: 该公开涉及用于在至少说出来源语言的源用户和表示目标语言的目标用户之间进行语音或视频呼叫的通信系统。 对呼叫的呼叫音频执行翻译过程,以产生用户目标语言中的源用户语音的音频转换,以输出给目标用户。 向目标用户输出通知,向目标用户通知翻译过程的行为的改变,与翻译的生成相关的变化。
-
公开(公告)号:US09614969B2
公开(公告)日:2017-04-04
申请号:US14622311
申请日:2015-02-13
Applicant: Microsoft Technology Licensing, LLC
Inventor: Anthony Aue , Arul A. Menezes , Jonas Nils Lindblom , Fredrik Furesjö , Pierre P. N. Greborio
CPC classification number: H04M3/568 , G06F17/289 , H04M3/42 , H04M3/567 , H04M2201/50 , H04M2203/2061 , H04M2242/12 , H04M2250/58 , H04N7/157
Abstract: The disclosure pertains to a communication system for effecting a voice or video call between at least a source user speaking a source language and a target user speaking a target language. A translation procedure is performed on call audio of the call to generate an audio translation of the source user's speech in the target language for outputting to the target user. A notification is outputted to the target user to notify the target user of a change in the behavior of the translation procedure, the change relating to the generation of the translation.
-
-
-