Creating a video for an audio file

    公开(公告)号:US11166000B1

    公开(公告)日:2021-11-02

    申请号:US16180449

    申请日:2018-11-05

    Applicant: Google LLC

    Abstract: A processor determines metadata associated with an audio track. The processor identifies categories that are related to the audio track based on the metadata. The processor determines rankings for the categories that are related to the audio track. The ranking is indicative of a relevance of a particular category to the audio track. The processor performs a query to identify visual media for one or more of ranked categories. The visual media is related to the audio track. The processor generates a visual presentation for the audio track by selecting at least some of the visual media to include in the visual presentation.

    SEQUENCE PROCESSING USING ONLINE ATTENTION
    2.
    发明申请

    公开(公告)号:US20190332919A1

    公开(公告)日:2019-10-31

    申请号:US16504924

    申请日:2019-07-08

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a target sequence including a respective output at each of multiple output time steps from respective encoded representations of inputs in an input sequence. The method includes, for each output time step, starting from the position, in the input order, of the encoded representation that was selected as a preceding context vector at a preceding output time step, traversing the encoded representations until an encoded representation is selected as a current context vector at the output time step. A decoder neural network processes the current context vector and a preceding output at the preceding output time step to generate a respective output score for each possible output and to update the hidden state of the decoder recurrent neural network. An output is selected for the output time step using the output scores.

    Sequence processing using online attention

    公开(公告)号:US11080589B2

    公开(公告)日:2021-08-03

    申请号:US16504924

    申请日:2019-07-08

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a target sequence including a respective output at each of multiple output time steps from respective encoded representations of inputs in an input sequence. The method includes, for each output time step, starting from the position, in the input order, of the encoded representation that was selected as a preceding context vector at a preceding output time step, traversing the encoded representations until an encoded representation is selected as a current context vector at the output time step. A decoder neural network processes the current context vector and a preceding output at the preceding output time step to generate a respective output score for each possible output and to update the hidden state of the decoder recurrent neural network. An output is selected for the output time step using the output scores.

Patent Agency Ranking