PATCHED MULTI-CONDITION TRAINING FOR ROBUST SPEECH RECOGNITION

    公开(公告)号:US20240013775A1

    公开(公告)日:2024-01-11

    申请号:US18371233

    申请日:2023-09-21

    CPC classification number: G10L15/063 G10L21/0216

    Abstract: A method of obtaining a patched signal for training a model for use in at least one of a speech and an audio recognition is disclosed. The method comprises obtaining a first signal, wherein the first signal is at least one of a speech and an audio signal, modifying the first signal to obtain at least one second signal, dividing the first signal and the at least one second signal respectively into a plurality of first patches and a plurality of second patches, wherein each one of the plurality of first patches comprises a respective part of the first signal and each one of the plurality of second patches comprises a respective part of the at least one second signal and mixing selected ones of the plurality of first patches and the plurality of second patches to obtain a patched signal.

Patent Agency Ranking