Caption timestamp predictor
    2.
    发明授权

    公开(公告)号:US11342002B1

    公开(公告)日:2022-05-24

    申请号:US16210545

    申请日:2018-12-05

    Abstract: An automated solution to determine suitable time ranges or timestamps for captions is described. In one example, a content file includes subtitle data with captions for display over respective timeframes of video. Audio data is extracted from the video, and the audio data is compared against a sound threshold to identify auditory timeframes in which sound is above the threshold. The subtitle data is also parsed to identify subtitle-free timeframes in the video. A series of candidate time ranges is then identified based on overlapping ranges of the auditory timeframes and the subtitle-free timeframes. In some cases, one or more of the candidate time ranges can be merged together or omitted, and a final series of time ranges or timestamps for captions is obtained. The time ranges or timestamps can be used to add additional non-verbal and contextual captions and indicators, for example, or for other purposes.

    Context-aware spell checker
    3.
    发明授权

    公开(公告)号:US10936813B1

    公开(公告)日:2021-03-02

    申请号:US16427793

    申请日:2019-05-31

    Inventor: Prabhakar Gupta

    Abstract: A context-aware spell checker to detect non-word spelling errors and/or suggest corrections. The context-aware spell checker may utilize n-gram conditional probabilities to suggest corrections based on a context of the non-word spelling error. The suggested corrections may be presented as a prioritized list of words based on calculated scores of the n-gram conditional probabilities. Utilizing n-gram conditional probabilities may permit the context-aware spell checker to be integrated across a multitude of languages or configured according to a particular language. The context-aware spell checker may perform spell checking and suggest corrections in real-time, or may be at least partially automated, to reduce user perceived latency and delay.

    Automated quality assessment of translations

    公开(公告)号:US11551013B1

    公开(公告)日:2023-01-10

    申请号:US16806705

    申请日:2020-03-02

    Abstract: Technologies are provided for automated quality assessment of translations. In some embodiments, quality of a translation can be assessed by generating a machine-learning (ML) model that classifies the translation as pertaining to one of three quality categories. A first quality category can include, for example, translations that are deemed satisfactory. A second quality category can include, for example, translations that are deemed subject to edition prior to being deemed satisfactory. A third quality category can include, for example, translations that are deemed unsatisfactory. The generated ML model can then be applied to the translation and a corresponding sentence in a source language in order to classify the translation as pertaining to one of the three categories.

    Song generation using a pre-trained audio neural network

    公开(公告)号:US12189683B1

    公开(公告)日:2025-01-07

    申请号:US17547727

    申请日:2021-12-10

    Abstract: Described herein is a computer-implemented method for extracting and identifying an audio song. An audio file can be accessed by a computing device. A set of audio categories and a set of probabilities associated with the set of audio categories can be determined for a first audio clip. A subset of the set of audio categories can be determined based on a subset of the set of probabilities. Each audio category of the subset of the set of audio categories can correspond to an audio class label. Whether the first audio clip is part of a song can be determined. The song can be defined by combining the first audio clip with other audio clips.

Patent Agency Ranking