-
公开(公告)号:US20240274128A1
公开(公告)日:2024-08-15
申请号:US18420338
申请日:2024-01-23
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Changwoo HAN , Dokyun LEE , Jungwook HWANG , Jaeyoung ROH , Jonguk YOO , Youngmoon JUNG , Youngo HAN
CPC classification number: G10L15/20 , G06F3/167 , G10L15/083 , G10L15/22 , G10L2015/088 , G10L2015/223
Abstract: An electronic device includes a microphone; at least one memory storing a wake-up word detection model; and at least one processor configured to: obtain a sound signal received through the microphone, input the sound signal into the wake-up word detection model, obtain, as an output of the wake-up word detection model, one or more first probability scores corresponding to one or more sections of the sound signal, wherein each first probability score of the one or more first probability scores represents a probability that a corresponding section of the one or more sections of the sound signal corresponds to a wake-up word, identify a first section of the sound signal, among the one or more sections of the sound signal, that corresponds to a first probability score, among the one or more first probability scores, that exceeds a first threshold value, and based on identifying a predetermined acoustic signal in the sound signal, reduce the first threshold value.
-
公开(公告)号:US20250149044A1
公开(公告)日:2025-05-08
申请号:US19013349
申请日:2025-01-08
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonguk YOO , Dokyun LEE , Jaeyoung ROH , Youngmoon JUNG , Changwoo HAN , Jungwook HWANG
IPC: G10L17/06
Abstract: An electronic device is provided. The electronic device includes: a voice reception unit comprising circuitry, memory storing an artificial intelligence model configured to acquire a voice signal of a user from an audio signal and information on characteristics of a plurality of users, and at least one processor, comprising processing circuitry, individually and/or collectively, configured to: based on an audio signal being received through the voice reception unit, obtain a first audio signal by inputting information on a characteristic of a first user set as a target speaker among the plurality of users and the received audio signal to the artificial intelligence model, based on voice recognition based on the first audio signal failing, identify a similarity between information on a characteristic of a second audio signal excluding the first audio signal among the received audio signals and information on characteristics of remaining users excluding the first user among the plurality of users, and change the target speaker to a second user among the plurality of users.
-