-
公开(公告)号:US20210409645A1
公开(公告)日:2021-12-30
申请号:US17294573
申请日:2018-11-22
Applicant: Polycom, Inc.
Inventor: Hai Xu , Xi Lu , Yongkang Fan , Wenxue He
Abstract: A videoconferencing endpoint is described that uses a cascading sequence of convolutional neural networks to perform face detection and upper body detection of participants in a videoconference at the endpoint, where at least one member of the sequence of neural networks performs upper body detection, and where the final member of the sequence of neural networks performs face detection based on the results of the upper body detection. The models of the neural networks are trained on both large datasets of faces well as images that have been distorted by a wide-angle camera of the videoconferencing endpoint.