Automatic personalized story generation for visual media

    公开(公告)号:US12008318B2

    公开(公告)日:2024-06-11

    申请号:US17069263

    申请日:2020-10-13

    Abstract: Exemplary embodiments relate to the automatic generation of captions for visual media, including photos, photo albums, non-live video, and live video. The visual media may be analyzed to determine contextual information (such as location information, people and objects in the video, time, etc.). A system may integrate this information with information from the user's social network and a personalized language model built using public-facing language from the user. The personalized language model captures the user's way of speaking to make the generated captions more detailed and personalized. The language model may account for the context in which the video was generated. The captions maybe used to simplify and encourage content generation, and may also be used to index visual media, rank the media, and recommend the media to users likely to engage with the media.

    Labeling video files using acoustic vectors

    公开(公告)号:US11372917B2

    公开(公告)日:2022-06-28

    申请号:US15855521

    申请日:2017-12-27

    Inventor: Ying Zhang Yun Lei

    Abstract: In one embodiment, a method includes receiving a video file. The video file includes a corresponding audio stream. The method further includes accessing the audio stream, and generating, based on the audio stream, a representative vector. The vector has a particular number of dimensions. The method further includes accessing a label-embedding space, which has the same particular number of dimensions, and includes a number of regions that each correspond to a respective label. The method further includes determining a region of the label-embedding space that corresponds to the vector, the determined region corresponding to a particular label. The method further includes associating the particular label with the video file.

    SYSTEMS AND METHODS FOR TRAINING A MACHINE LEARNING MODEL FOR A SECOND LANGUAGE BASED ON A MACHINE LEARNING MODEL FOR A FIRST LANGUAGE

    公开(公告)号:US20230169388A1

    公开(公告)日:2023-06-01

    申请号:US16016321

    申请日:2018-06-22

    CPC classification number: G06N99/005 G06F17/28 G06Q50/01

    Abstract: Systems, methods, and non-transitory computer readable media can train a machine learning model for a first language to determine a classification for a content item in the first language. Machine translation can be performed to generate respective machine translations of a plurality of content items in a second language into the first language. Respective classifications for the plurality of content items in the second language can be determined based on the machine translations of the plurality of content items in the second language and the machine learning model for the first language. Training data in the second language can be automatically generated, where the training data in the second language includes the plurality of content items in the second language and the respective classifications.

Patent Agency Ranking