ARTIFICIAL LATENCY FOR MODERATING VOICE COMMUNICATION

    公开(公告)号:US20240304210A1

    公开(公告)日:2024-09-12

    申请号:US18670422

    申请日:2024-05-21

    CPC classification number: G10L25/57 G06V20/41 G10L21/043 G10L25/63 H04N21/4542

    Abstract: A computer-implemented method to determine whether to introduce latency into an audio stream from a particular speaker includes an audio stream from a sender device. The method further includes providing, as input to a trained machine-learning model, the audio stream and a speech analysis score, information about one or more voice emotion parameters, and one or more voice emotion scores for a first user associated with the sender device, wherein the trained machine-learning model is iteratively applied to the audio stream and wherein each iteration corresponds to a respective portion of the audio stream. The method further includes generating as output, with the trained machine-learning model, a level of toxicity in the audio stream. The method further includes transmitting the audio stream to a recipient device, wherein the transmitting is performed to introduce a time delay in the audio stream based on the level of toxicity.

    SYNTHESIZING AUDIO FOR SYNCHRONOUS COMMUNICATION

    公开(公告)号:US20240112689A1

    公开(公告)日:2024-04-04

    申请号:US17959937

    申请日:2022-10-04

    CPC classification number: G10L19/167 G10L21/055

    Abstract: A computer-implemented method includes receiving, at a server, a first audio stream of a performance associated with a first client device. The method further includes receiving, at the server, a second audio stream of the performance associated with a second client device. The method further includes during a time window of the performance, where the time window is less than a total time of the performance: generating a synthesized first audio stream that predicts a future of the performance based on audio features of the first audio stream and mixing the synthesized first audio stream and the second audio stream to form a combined audio stream that synchronizes the synthesized first audio stream and the second audio stream, where the time window is advanced and the generating and the mixing are repeated until the performance is complete. The method further includes transmitting the combined audio stream to the second client device.

    SYNTHESIZING AUDIO FOR SYNCHRONOUS COMMUNICATION

    公开(公告)号:US20240112691A1

    公开(公告)日:2024-04-04

    申请号:US17959736

    申请日:2022-10-04

    CPC classification number: G10L21/055 G10L13/047

    Abstract: A computer-implemented method includes receiving a first audio stream of a performance associated with a first client device. The method further includes during a time window of the performance, wherein the time window is less than a total time of the performance: generating a synthesized first audio stream that predicts a future of the performance based on audio features of the first audio stream and mixing the synthesized first audio stream and a second audio stream associated with a second client device to form a combined audio stream that synchronizes the synthesized first audio stream and the second audio stream, where the time window is advanced and the generating and the mixing are repeated until the performance is complete.

    ARTIFICIAL LATENCY FOR MODERATING VOICE COMMUNICATION

    公开(公告)号:US20240087596A1

    公开(公告)日:2024-03-14

    申请号:US17940749

    申请日:2022-09-08

    CPC classification number: G10L25/57 G06V20/41 G10L21/043 G10L25/63 H04N21/4542

    Abstract: A computer-implemented method to determine whether to introduce latency into an audio stream from a particular speaker includes an audio stream from a sender device. The method further includes providing, as input to a trained machine-learning model, the audio stream and a speech analysis score, information about one or more voice emotion parameters, and one or more voice emotion scores for a first user associated with the sender device, wherein the trained machine-learning model is iteratively applied to the audio stream and wherein each iteration corresponds to a respective portion of the audio stream. The method further includes generating as output, with the trained machine-learning model, a level of toxicity in the audio stream. The method further includes transmitting the audio stream to a recipient device, wherein the transmitting is performed to introduce a time delay in the audio stream based on the level of toxicity.

Patent Agency Ranking