Labeling video files using acoustic vectors

    公开(公告)号:US11372917B2

    公开(公告)日:2022-06-28

    申请号:US15855521

    申请日:2017-12-27

    Inventor: Ying Zhang Yun Lei

    Abstract: In one embodiment, a method includes receiving a video file. The video file includes a corresponding audio stream. The method further includes accessing the audio stream, and generating, based on the audio stream, a representative vector. The vector has a particular number of dimensions. The method further includes accessing a label-embedding space, which has the same particular number of dimensions, and includes a number of regions that each correspond to a respective label. The method further includes determining a region of the label-embedding space that corresponds to the vector, the determined region corresponding to a particular label. The method further includes associating the particular label with the video file.

Patent Agency Ranking