-
公开(公告)号:US20240284011A1
公开(公告)日:2024-08-22
申请号:US18440024
申请日:2024-02-13
Applicant: Sony Interactive Entertainment Inc.
Inventor: Ryan Spick , Timothy Edward Bradley , Guy David Moss , Ayush Raina , Pierluigi Amadori
IPC: H04N21/488 , G06T7/20 , G10L13/08 , H04N5/278
CPC classification number: H04N21/4884 , G06T7/20 , G10L13/08 , H04N5/278 , G06T2207/20084
Abstract: A data processing apparatus for determining description data for describing content includes: a video captioning model to receive an input comprising at least video images associated with the content, wherein the video captioning model is trained to detect one or more predetermined motions of one or more animated objects in the video images and determine one or more captions in dependence on one or more of the predetermined motions, one or more of the captions comprising respective caption data comprising one or more words for describing one or more of the predetermined motions, the respective caption data comprising one or more of audio data, text data and image data; and output circuitry to output description data in dependence on one or more of the captions.