专利检索 ap:("AT&T INTELLECTUAL PROPERTY I, L.P.") AND inv:"Horst J. Schroeter" 第 9 页

81.

发明申请
Methods, Systems, and Products for Recalling and Retrieving Documentary Evidence 审中-公开

公开(公告)号：US20190278801A1

公开(公告)日：2019-09-12

申请号：US16423239

申请日：2019-05-28

申请人： AT&T Intellectual Property I, L.P.

发明人： Kevin A. Li , Troy C. Meuninck , Robert Raymond Miller, II , James H. Pratt , Horst J. Schroeter , Behzad Shahraray

IPC分类号： G06F16/58 , G06F16/951 , G06F16/73 , G06F16/93

摘要： Methods, systems, and products help users recall memories and search for content of those memories. When a user cannot recall a memory, the user is prompted with questions to help recall the memory. As the user answers the questions, a virtual recollection of the memory is synthesized from the answers to the questions. When the user is satisfied with the virtual recollection of the memory, a database of content may be searched for the virtual recollection of the memory. Video data, for example, may be retrieved that matches the virtual recollection of the memory. The video data is thus historical data documenting past events.

82.

发明授权
Exploiting visual information for enhancing audio signals via source separation and beamforming 有权

公开(公告)号：US10402651B2

公开(公告)日：2019-09-03

申请号：US15905442

申请日：2018-02-26

申请人： AT&T Intellectual Property I, L.P.

发明人： Dimitrios Dimitriadis , Donald J. Bowen , Lusheng Ji , Horst J. Schroeter

IPC分类号： G06F17/00 , G06K9/00 , G10L21/0208 , G06F3/16 , H04R5/04

摘要： A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

83.

发明授权
Acoustic enhancement by leveraging metadata to mitigate the impact of noisy environments 有权

公开(公告)号：US10170133B2

公开(公告)日：2019-01-01

申请号：US15683067

申请日：2017-08-22

申请人： AT&T Intellectual Property I, L.P.

发明人： Donald J. Bowen , Dimitrios B. Dimitriadis , Lusheng Ji , Horst J. Schroeter

IPC分类号： G10L21/00 , G10L21/0208 , G10L15/20

摘要： A system for cloud acoustic enhancement is disclosed. In particular, the system may leverage metadata and cloud-computing network resources to mitigate the impact of noisy environments that may potentially interfere with user communications. In order to do so, the system may receive an audio stream including an audio signal associated with a user, and determine if the audio stream also includes an interference signal. The system may determine that the audio stream includes the interference signal if a portion of the audio stream correlates with metadata that identifies the interference signal. If the audio stream is determined to include the interference signal, the system may cancel the interference signal from the audio stream by utilizing the metadata and the cloud-computing network resources. Once the interference signal is cancelled, the system may transmit the audio stream including the audio signal associated with the user to an intended destination.

84.

发明申请
ACOUSTIC ENVIRONMENT RECOGNIZER FOR OPTIMAL SPEECH PROCESSING 审中-公开

公开(公告)号：US20180197558A1

公开(公告)日：2018-07-12

申请号：US15912230

申请日：2018-03-05

申请人： AT&T Intellectual Property I, L.P.

发明人： Horst J. Schroeter , Donald J. Bowen , Dimitrios B. Dimitriadis , Lusheng Ji

IPC分类号： G10L21/028 , G10L25/72 , G10L15/20 , G10L21/0208 , G10L21/0216

CPC分类号： G10L21/028 , G10L15/20 , G10L21/0208 , G10L21/0216 , G10L25/72

摘要： A system for providing an acoustic environment recognizer for optimal speech processing is disclosed. In particular, the system may utilize metadata obtained from various acoustic environments to assist in suppressing ambient noise interfering with a desired audio signal. In order to do so, the system may receive an audio stream including an audio signal associated with a user and including ambient noise obtained from an acoustic environment of the user. The system may obtain first metadata associated with the ambient noise, and may determine if the first metadata corresponds to second metadata in a profile for the acoustic environment. If the first metadata corresponds to the second metadata, the system may select a processing scheme for suppressing the ambient noise from the audio stream, and process the audio stream using the processing scheme. Once the audio stream is processed, the system may provide the audio stream to a destination.

85.

发明授权
Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment 有权

公开(公告)号：US10002612B2

公开(公告)日：2018-06-19

申请号：US15350339

申请日：2016-11-14

申请人： AT&T Intellectual Property I, L.P.

发明人： Yeon-Jun Kim , David C. Gibbon , Horst J. Schroeter

IPC分类号： G10L15/00 , G10L15/26 , G11B27/10 , G10L21/06 , G10L13/08 , G10L21/055 , H04N21/44 , H04N21/488 , G10L25/51

CPC分类号： G10L15/265 , G10L13/08 , G10L15/26 , G10L21/055 , G10L21/06 , G10L25/51 , G11B27/10 , H04M3/42391 , H04M2201/14 , H04M2201/22 , H04M2201/40 , H04M2203/305 , H04N21/44004 , H04N21/4884

摘要： Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.

86.

发明申请
SENSOR ENHANCED SPEECH RECOGNITION 审中-公开

公开(公告)号：US20180137348A1

公开(公告)日：2018-05-17

申请号：US15868546

申请日：2018-01-11

申请人： AT&T Intellectual Property I, L.P.

发明人： Dimitrios Dimitriadis , Donald J. Bowen , Mazin E. Gilbert , Horst J. Schroeter

IPC分类号： G06K9/00

CPC分类号： G06K9/00335 , G06K9/00664 , G10L15/065 , G10L2015/227 , G10L2015/228

摘要： A system for sensor enhanced speech recognition is disclosed. The system may obtain visual content or other content associated with a user and an environment of the user. Additionally, the system may obtain, from the visual content, metadata associated with the user and the environment of the user. The system may also include determining, based on the visual content and metadata, if the user is speaking. If the user is determined to be speaking, the system may obtain audio content associated with the user and the environment. The system may then adapt, based on the visual content, audio content, and metadata, one or more acoustic models that match the user and the environment. Once the one or more acoustic models are adapted and loaded, the system may enhance a speech recognition process or other process associated with the user.

87.

发明授权
Pre-distortion system for cancellation of nonlinear distortion in mobile devices 有权

公开(公告)号：US09973633B2

公开(公告)日：2018-05-15

申请号：US14543261

申请日：2014-11-17

申请人： AT&T Intellectual Property I, L.P.

发明人： Horst J. Schroeter , Donald J. Bowen , Dimitrios Dimitriadis , Lusheng Ji

IPC分类号： H04M9/08 , G10L21/0208

CPC分类号： H04M9/082 , G10L2021/02082

摘要： A pre-distortion system for improved mobile device communications via cancellation of nonlinear distortion is disclosed. The pre-distortion system may transmit an acoustic signal from a network to a device, wherein the acoustic signal includes a linear signal and a nonlinear cancellation signal that cancels at least a portion of nonlinear distortions created once a loudspeaker in the device emits the linear signal. Thus, when a loudspeaker of a mobile device is operating and nonlinear distortions are generated by the loudspeaker or adjacent components of the mobile device in close proximity to the loudspeaker, the pre-distortion system may create one or more nonlinear cancellation signals in the network. The nonlinear cancellation signal may be combined with the linear signal sent to the loudspeaker to cancel the nonlinear distortion signal created by the loudspeaker emitting acoustic sounds from the linear signal. Thus, the nonlinear cancellation signal becomes a pre-distortion signal.

88.

发明授权
Exploiting visual information for enhancing audio signals via source separation and beamforming 有权

公开(公告)号：US09904851B2

公开(公告)日：2018-02-27

申请号：US14302110

申请日：2014-06-11

申请人： AT&T Intellectual Property I, L.P.

发明人： Dimitrios Dimitriadis , Donald J. Bowen , Lusheng Ji , Horst J. Schroeter

IPC分类号： G06F17/00 , G06K9/00 , G10L21/0208 , H04R5/04

CPC分类号： G06K9/00684 , G10L21/0208 , H04R5/04 , H04R2430/20 , H04R2460/07

摘要： A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

89.

发明授权
Sensor enhanced speech recognition 有权

公开(公告)号：US09870500B2

公开(公告)日：2018-01-16

申请号：US14302137

申请日：2014-06-11

申请人： AT&T Intellectual Property I, L.P.

发明人： Dimitrios Dimitriadis , Donald J. Bowen , Mazin E. Gilbert , Horst J. Schroeter

IPC分类号： G10L15/25 , G06K9/00 , G10L15/065 , G10L15/22

CPC分类号： G06K9/00335 , G06K9/00664 , G10L15/065 , G10L2015/227 , G10L2015/228

摘要： A system for sensor enhanced speech recognition is disclosed. The system may obtain visual content or other content associated with a user and an environment of the user. Additionally, the system may obtain, from the visual content, metadata associated with the user and the environment of the user. The system may also include determining, based on the visual content and metadata, if the user is speaking. If the user is determined to be speaking, the system may obtain audio content associated with the user and the environment. The system may then adapt, based on the visual content, audio content, and metadata, one or more acoustic models that match the user and the environment. Once the one or more acoustic models are adapted and loaded, the system may enhance a speech recognition process or other process associated with the user.

90.

发明申请
ACOUSTIC ENVIRONMENT RECOGNIZER FOR OPTIMAL SPEECH PROCESSING 有权
标题翻译：声音环境识别器进行最佳语音处理

公开(公告)号：US20170076736A1

公开(公告)日：2017-03-16

申请号：US15362372

申请日：2016-11-28

申请人： AT&T Intellectual Property I, L.P.

发明人： Horst J. Schroeter , Donald J. Bowen , Dimitrios B. Dimitriadis , Lusheng Ji

IPC分类号： G10L21/028 , G10L25/72 , G10L21/0216

CPC分类号： G10L21/028 , G10L15/20 , G10L21/0208 , G10L21/0216 , G10L25/72

摘要： A system for providing an acoustic environment recognizer for optimal speech processing is disclosed. In particular, the system may utilize metadata obtained from various acoustic environments to assist in suppressing ambient noise interfering with a desired audio signal. In order to do so, the system may receive an audio stream including an audio signal associated with a user and including ambient noise obtained from an acoustic environment of the user. The system may obtain first metadata associated with the ambient noise, and may determine if the first metadata corresponds to second metadata in a profile for the acoustic environment. If the first metadata corresponds to the second metadata, the system may select a processing scheme for suppressing the ambient noise from the audio stream, and process the audio stream using the processing scheme. Once the audio stream is processed, the system may provide the audio stream to a destination.

摘要翻译： 公开了一种用于提供用于最佳语音处理的声学环境识别器的系统。特别地，该系统可以利用从各种声学环境获得的元数据来帮助抑制干扰所需音频信号的环境噪声。为了这样做，系统可以接收包括与用户相关联的音频信号的音频流，并且包括从用户的声学环境获得的环境噪声。系统可以获得与环境噪声相关联的第一元数据，并且可以确定第一元数据是否对应于用于声学环境的简档中的第二元数据。如果第一元数据对应于第二元数据，则系统可以选择用于从音频流抑制环境噪声的处理方案，并且使用处理方案处理音频流。一旦音频流被处理，系统可以将音频流提供给目的地。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类