-
公开(公告)号:US11809965B2
公开(公告)日:2023-11-07
申请号:US15992013
申请日:2018-05-29
Applicant: Cisco Technology, Inc.
Inventor: Keith Griffin , Eric Chen
IPC: G10L21/00 , G10L25/00 , G06N20/00 , G10L15/06 , G10L15/02 , G06F18/23 , G06F18/40 , G06F18/25 , G06V10/98 , G06V10/774
CPC classification number: G06N20/00 , G06F18/23 , G06F18/253 , G06F18/41 , G06V10/774 , G06V10/987 , G10L15/02 , G10L15/06 , G10L15/063 , G10L2015/0631
Abstract: Systems, methods, and devices are disclosed for training a model. Media data is separated into one or more clusters, each cluster based on a feature from a first model. The media data of each cluster is sampled and, based on an analysis of the sampled media data, an accuracy of the media data of each cluster is determined. The accuracy is associated with the feature from the first model. Based on a subset dataset of the media data being outside a threshold accuracy, the subset dataset is automatically forwarded to a crowd source service. Verification of the subset dataset is received from the crowd source service, and the verified subset dataset is added to the first model.
-
公开(公告)号:US11019308B2
公开(公告)日:2021-05-25
申请号:US16678729
申请日:2019-11-08
Applicant: Cisco Technology, Inc.
Inventor: Paul Bright-Thomas , Nathan Buckles , Keith Griffin , Eric Chen , Manikandan Kesavan , Plamen Nedeltchev , Hugo Mike Latapie , Enzo Fenoglio
Abstract: Systems and methods are disclosed for anticipating a video switch to accommodate a new speaker in a video conference comprising a real time video stream captured by a camera local to a first videoconference endpoint is analyzed according to at least one speaker anticipation model. The speaker anticipation model predicts that a new speaker is about to speak. Video of the anticipated new speaker is sent to the conferencing server in response to a request for the video on the anticipated new speaker from the conferencing server. Video of the anticipated new speaker is distributed to at least a second videoconference endpoint.
-
公开(公告)号:US20200043509A1
公开(公告)日:2020-02-06
申请号:US16598059
申请日:2019-10-10
Applicant: Cisco Technology, Inc.
Inventor: Eric Chen , Asbjørn Therkelsen , Espen Moberg , Wei-Lien Hsu
IPC: G10L21/0216 , G10L25/30 , G06F17/18 , G06N7/00 , G06N20/00
Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.
-
公开(公告)号:US10477148B2
公开(公告)日:2019-11-12
申请号:US15646470
申请日:2017-07-11
Applicant: Cisco Technology, Inc.
Inventor: Paul Bright-Thomas , Nathan Buckles , Keith Griffin , Eric Chen , Manikandan Kesavan , Plamen Nedeltchev , Hugo Mike Latapie , Enzo Fenoglio
Abstract: Systems and methods are disclosed for anticipating a video switch to accommodate a new speaker in a video conference comprising a real time video stream captured by a camera local to a first videoconference endpoint is analyzed according to at least one speaker anticipation model. The speaker anticipation model predicts that a new speaker is about to speak. Video of the anticipated new speaker is sent to the conferencing server in response to a request for the video on the anticipated new speaker from the conferencing server. Video of the anticipated new speaker is distributed to at least a second videoconference endpoint.
-
公开(公告)号:US20190037002A1
公开(公告)日:2019-01-31
申请号:US15663658
申请日:2017-07-28
Applicant: Cisco Technology, Inc.
Inventor: Chidambaram Arunachalam , Gonzalo Salgueiro , Nagendra Kumar Nainar , Eric Chen , Keith Griffin
Abstract: Disclosed is a system, method and computer readable medium enabling collaboration service providers to more accurately predict packet loss, jitter and delay based on current session, historical session and user location parameters. The prediction can be used to forecast the occurrence of poor media quality at the current location and potential future locations.
-
公开(公告)号:US20180376108A1
公开(公告)日:2018-12-27
申请号:US15646470
申请日:2017-07-11
Applicant: Cisco Technology, Inc.
Inventor: Paul Bright-Thomas , Nathan Buckles , Keith Griffin , Eric Chen , Manikandan Kesavan , Plamen Nedeltchev , Hugo Mike Latapie , Enzo Fenoglio
CPC classification number: H04N7/152 , G06K9/00302 , G06K9/00711 , G06K9/00718 , G06K9/4628 , G06K9/6274 , G06K9/66 , G10L15/1815 , G10L25/57 , H04N7/147 , H04N7/15
Abstract: Systems and methods are disclosed for anticipating a video switch to accommodate a new speaker in a video conference comprising a real time video stream captured by a camera local to a first videoconference endpoint is analyzed according to at least one speaker anticipation model. The speaker anticipation model predicts that a new speaker is about to speak. Video of the anticipated new speaker is sent to the conferencing server in response to a request for the video on the anticipated new speaker from the conferencing server. Video of the anticipated new speaker is distributed to at least a second videoconference endpoint.
-
公开(公告)号:US10091348B1
公开(公告)日:2018-10-02
申请号:US15659356
申请日:2017-07-25
Applicant: Cisco Technology, Inc.
Inventor: Chidambaram Arunachalam , Gonzalo Salgueiro , Nagendra Kumar Nainar , Eric Chen , Keith Griffin
Abstract: Disclosed is a system and method for forecasting the expected quality of a call. In some examples, a system or method can generate a plurality of scenarios from network metrics, retrieve historical ratings for the network metrics from users, and assign the historical ratings for the network metrics to the plurality of scenarios. The system or method can also filter one or more users based on similarities of the historical ratings for the plurality of scenarios with current network metrics, and forecast an expected call quality based on the historical ratings of the one or more filtered users.
-
公开(公告)号:US11245788B2
公开(公告)日:2022-02-08
申请号:US17003696
申请日:2020-08-26
Applicant: Cisco Technology, Inc.
Inventor: Fuling Liu , Eric Chen , Wei Li , Wei-Lien Hsu
IPC: H04M3/00 , H04M9/08 , G10L21/0208 , H04M3/56 , G10L25/48
Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo canceller model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
-
公开(公告)号:US10771621B2
公开(公告)日:2020-09-08
申请号:US15943336
申请日:2018-04-02
Applicant: Cisco Technology, Inc.
Inventor: Fuling Liu , Eric Chen , Wei Li , Wei-Lien Hsu
IPC: H04M3/00 , H04M9/08 , G10L21/0208 , H04M3/56 , G10L25/48
Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
-
10.
公开(公告)号:US20190132452A1
公开(公告)日:2019-05-02
申请号:US15943336
申请日:2018-04-02
Applicant: Cisco Technology, Inc.
Inventor: Fuling Liu , Eric Chen , Wei Li , Wei-Lien Hsu
IPC: H04M9/08 , H04M3/56 , G10L21/0208
Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
-
-
-
-
-
-
-
-
-