SEMIAUTOMATED RELAY METHOD AND APPARATUS

    公开(公告)号:US20250069601A1

    公开(公告)日:2025-02-27

    申请号:US18943527

    申请日:2024-11-11

    Applicant: Ultratec, Inc.

    Abstract: A method to transcribe communications includes the steps of obtaining a plurality of hypothesis transcriptions of a voice signal generated by a speech recognition system, determining consistent words that are included in at least first and second of the plurality of hypothesis transcriptions, in response to determining the consistent words, providing the consistent words to a device for presentation of the consistent words to an assisted user, and presenting the consistent words via a display screen on the device, wherein a rate of the presentation of the words on the display screen is variable.

    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM

    公开(公告)号:US20250069377A1

    公开(公告)日:2025-02-27

    申请号:US18727106

    申请日:2023-01-25

    Abstract: An information processing apparatus according to an embodiment of the present technology includes a generation unit, an evaluation unit, and an update unit. The generation unit generates input data on the basis of a predetermined parameter. The evaluation unit generates evaluation data on the basis of first output data that includes evaluation target data and is output by inputting first input data generated by the generation unit to a first recognition model, and second output data that includes a pseudo label as a pseudo correct answer of the evaluation target data and is output by inputting second input data generated by the generation unit to a second recognition model. The update unit updates the predetermined parameter on the basis of the evaluation data.

    METHOD AND APPARATUS FOR TRANSCRIBING AUDIO

    公开(公告)号:US20250006187A1

    公开(公告)日:2025-01-02

    申请号:US18885132

    申请日:2024-09-13

    Inventor: Hongtao ZOU Si CHEN

    Abstract: The present disclosure provides a method and apparatus for transcribing audio, relates to the field of artificial intelligence technology. A specific embodiment of the method includes: receiving audio information uploaded through a scenario entry of a storage service application installed on a client; determining, based on the scenario entry, a scenario type of the audio information; performing speech recognition on the audio information to obtain text information corresponding to the audio information; and inputting the text information and a prompt corresponding to the scenario type into a language model to obtain summary information, where the language model is obtained by performing supervised fine-tuning on a pre-trained model using samples corresponding to various scenario types, and the prompts corresponding to the various scenario types are obtained by tuning initial prompts corresponding to the various scenario types using the language model.

    Electronic device and control method thereof

    公开(公告)号:US12112745B2

    公开(公告)日:2024-10-08

    申请号:US17292116

    申请日:2019-09-09

    CPC classification number: G10L15/22 G10L15/01 G10L2015/223

    Abstract: An electronic device is disclosed. The present electronic device comprises: a voice receiving unit; and a processor, wherein the processor: when a user's voice is received through the voice receiving unit, determines an accumulation level of utterance history information corresponding to the characteristics of the user's voice; when the accumulation level of utterance history information is below a predetermined threshold level, provides response information corresponding to the user's voice on the basis of user information related to the characteristics of the user's voice; and when the accumulation level of utterance history information is equal to or higher than the predetermined threshold level, provides response information corresponding to the user's voice on the basis of the user information and the utterance history information.

    Creative work systems and methods thereof

    公开(公告)号:US12112740B2

    公开(公告)日:2024-10-08

    申请号:US17545815

    申请日:2021-12-08

    Applicant: SOCIETE BIC

    Abstract: A computer-implemented method for measuring cognitive load of a user creating a creative work in a creative work system, may include generating at least one verbal statement capable of provoking at least one verbal response from the user, prompting the user to vocally interact with the creative work system by vocalizing the at least one generated verbal statement to the user via an audio interface of the creative work system, and obtaining the at least one verbal response from the user via the audio interface, and determining the cognitive load of the user based on the at least one verbal response obtained from the user, wherein generating the at least one verbal statement is based on at least one predicted verbal response suitable for determining the cognitive load of the user.

    NON-SPEECH INPUT TO SPEECH PROCESSING SYSTEM
    10.
    发明公开

    公开(公告)号:US20240296829A1

    公开(公告)日:2024-09-05

    申请号:US18663831

    申请日:2024-05-14

    Inventor: Travis Grizzel

    Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.

Patent Agency Ranking