SYSTEM AND METHOD FOR OUT-OF-VOCABULARY PHRASE SUPPORT IN AUTOMATIC SPEECH RECOGNITION

    公开(公告)号:US20210343277A1

    公开(公告)日:2021-11-04

    申请号:US17160278

    申请日:2021-01-27

    IPC分类号: G10L15/14 G10L25/27

    摘要: An electronic device includes an audio sensor, a memory, and at least one processor coupled to the audio sensor and the memory. The at least one processor is configured to receive, via the audio sensor an audio input. The at least one processor is further configured to perform, using an automatic speech recognition (ASR) model and an entity prediction model, out-of-vocabulary prediction of an entity. The at least one processor is further configured to receive an ASR hypothesis including the predicted entity. The at least one processor is further configured to output text including the predicted entity.

    Unsupervised learning system and method for performing weighting for improvement in speech recognition performance and recording medium for performing the method

    公开(公告)号:US11164565B2

    公开(公告)日:2021-11-02

    申请号:US16561651

    申请日:2019-09-05

    发明人: Jeehye Lee

    摘要: A learning system and method for updating recognition performance by assigning weights according to a confidence level of data are discussed. The unsupervised learning system includes a memory configured to store speech data received from a server that performs speech recognition; and a processor configured to measure confidence levels of pieces of learnable data stored in the memory and classify the pieces of learnable data into learning data and adaptation data, generate a learning model by performing unsupervised learning on the learning data, generate an adaption model using the adaptation data, and evaluate speech recognition performance for the learning model and the adaptation model, wherein the processor is configured to assign weights by applying the measured confidence levels to the learning model and the adaptation model and update recognition performance with the learning model and the adaptation model to which the weights are applied.

    Building system with an entity graph storing software logic

    公开(公告)号:US11151983B2

    公开(公告)日:2021-10-19

    申请号:US17105263

    申请日:2020-11-25

    摘要: One or more non-transitory computer readable media contain program instructions that, when executed, cause one or more processors to: receive first raw data including one or more first data points generated by a first object of a plurality of objects associated with one or more buildings; generate first input timeseries according to the one or more data points; access a database of interconnected smart entities, the smart entities including object entities representing each of the plurality of objects and data entities representing stored data, the smart entities being interconnected by relational objects indicating relationships between the smart entities; identify a first object entity representing the first object from a first identifier in the first input timeseries; identify a first data entity from a first relational object indicating a relationship between the first object entity and the first data entity; and store the first input timeseries in the first data entity.

    MASKING SYSTEMS AND METHODS
    48.
    发明申请

    公开(公告)号:US20210183372A1

    公开(公告)日:2021-06-17

    申请号:US16717507

    申请日:2019-12-17

    申请人: Spotify AB

    摘要: Term masking is performed by generating a time-alignment value for a plurality of identifiable units of sound in vocal audio content contained in a mixed audio track, force-aligning each of the plurality of identifiable units of sound to the vocal audio content based on the time-alignment value, thereby generating a plurality of force-aligned identifiable units of sound, identifying from the plurality of force-aligned identifiable units of sound a force-aligned identifiable unit of sound to be muddled, and audio muddling the force-aligned identifiable unit of sound to be muddled.

    SYSTEM AND METHOD FOR AUTOMATIC SPEECH ANALYSIS

    公开(公告)号:US20210134277A1

    公开(公告)日:2021-05-06

    申请号:US16605025

    申请日:2018-04-17

    摘要: A computer implemented method for automatic speech analysis comprising setting a target phrase, the target phrase comprising a target phoneme, the target phoneme having corresponding target values of a set of phonological features associated therewith, wherein each target value is one of: an indication that the phonological feature should be present, an indication that the phonological feature should be absent, or an indication that the presence of the phonological feature is not specified, receiving a speech signal, wherein the speech signal comprises a user's attempt to say the target phrase, analysing the speech signal to determine the phonological features present within a portion of the speech signal corresponding to the target phoneme and assigning a probability to each of the set of phonological features, comparing the probability assigned to each phonological feature based on the speech signal to the expected target value of that phonological feature within the phrase, determining a deviation from the comparison; and outputting the deviation to provide feedback on the closeness of the speech signal to the phrase.