ELECTRONIC DEVICE AND CONTROL METHOD THEREOF

    公开(公告)号:US20240274128A1

    公开(公告)日:2024-08-15

    申请号:US18420338

    申请日:2024-01-23

    Abstract: An electronic device includes a microphone; at least one memory storing a wake-up word detection model; and at least one processor configured to: obtain a sound signal received through the microphone, input the sound signal into the wake-up word detection model, obtain, as an output of the wake-up word detection model, one or more first probability scores corresponding to one or more sections of the sound signal, wherein each first probability score of the one or more first probability scores represents a probability that a corresponding section of the one or more sections of the sound signal corresponds to a wake-up word, identify a first section of the sound signal, among the one or more sections of the sound signal, that corresponds to a first probability score, among the one or more first probability scores, that exceeds a first threshold value, and based on identifying a predetermined acoustic signal in the sound signal, reduce the first threshold value.

    ELECTRONIC DEVICE FOR UPDATING TARGET SPEAKER USING VOICE SIGNAL INCLUDED IN AUDIO SIGNAL AND TARGET SPEAKER UPDATING METHOD THEREFOR

    公开(公告)号:US20250149044A1

    公开(公告)日:2025-05-08

    申请号:US19013349

    申请日:2025-01-08

    Abstract: An electronic device is provided. The electronic device includes: a voice reception unit comprising circuitry, memory storing an artificial intelligence model configured to acquire a voice signal of a user from an audio signal and information on characteristics of a plurality of users, and at least one processor, comprising processing circuitry, individually and/or collectively, configured to: based on an audio signal being received through the voice reception unit, obtain a first audio signal by inputting information on a characteristic of a first user set as a target speaker among the plurality of users and the received audio signal to the artificial intelligence model, based on voice recognition based on the first audio signal failing, identify a similarity between information on a characteristic of a second audio signal excluding the first audio signal among the received audio signals and information on characteristics of remaining users excluding the first user among the plurality of users, and change the target speaker to a second user among the plurality of users.

Patent Agency Ranking