-
公开(公告)号:US20250061873A1
公开(公告)日:2025-02-20
申请号:US18721380
申请日:2022-12-02
Applicant: CASIO COMPUTER CO., LTD.
Inventor: Makoto DANJYO , Fumiaki OTA , Atsushi NAKAMURA
IPC: G10H5/00 , G10L21/043 , G10L25/00
Abstract: An onset length changing device includes at least one processor configured to: perform a process of advancing a first vocal generation start timing of a first vowel in a certain syllable including a first consonant and a second vocal generation start timing of a second vowel in a syllable including a second consonant that is different from the certain syllable, based on a parameter designated in response to one user operation; and not to change a vocal generation start timing of a vowel in a syllable without including a consonant.
-
公开(公告)号:US12119004B2
公开(公告)日:2024-10-15
申请号:US17447068
申请日:2021-09-08
Inventor: Yichen Yu , Yunsan Guo
CPC classification number: G10L15/26 , G06F40/109 , G06N20/00 , G10L25/27 , G10L25/63
Abstract: The present disclosure may provide a voice audio data processing system. The voice audio data processing system may obtain voice audio data, which includes one or more voices, each being respectively associated with one of one or more subjects. For one of the one or more voices and the subject associated with the voice, the voice audio processing system may generate a text based on the voice audio data. The text may have one or more sizes, each size corresponding to one of one or more volumes of the voice. The text may have one or more colors, each color corresponding to one of one or more emotion types of the voice.
-
公开(公告)号:US11967339B2
公开(公告)日:2024-04-23
申请号:US17861158
申请日:2022-07-08
Applicant: Feel Therapeutics Inc.
Inventor: Georgios Eleftheriou , Panagiotis Fatouros , Charalampos Tsirmpas
CPC classification number: G10L25/63 , G06F11/327 , G06V40/20
Abstract: A method includes: prompting a user to recite a story associated with a first target emotion; recording the user reciting the story and recording a first timeseries of biosignal data via a set of sensors integrated into a wearable device worn by the user; accessing a first timeseries of emotion markers extracted from the voice recording; labeling the first timeseries of biosignal data according to the first timeseries of emotion markers; generating an emotion model linking biosignals to emotion markers for the user based on the first emotion-labeled timeseries of biosignal data; detecting a second instance of the first target emotion exhibited by the user based on a second timeseries of biosignal data and the emotion model; and notifying the user of the second instance of the first target emotion.
-
公开(公告)号:US11935527B2
公开(公告)日:2024-03-19
申请号:US17078779
申请日:2020-10-23
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L21/00 , G06F16/9032 , G06F16/9035 , G06F21/12 , G10L15/22 , G10L15/26 , G10L25/00 , G06F3/04812
CPC classification number: G10L15/22 , G06F16/90332 , G06F16/9035 , G06F21/121 , G10L15/26 , G06F3/04812
Abstract: Implementations relate to generating a proficiency measure, and utilizing the proficiency measure to adapt one or more automated assistant functionalities. The generated proficiency measure is for a particular class of automated assistant actions, and is specific to an assistant device and/or is specific to a particular user. A generated proficiency measure for a class can reflect a degree of proficiency, of a user and/or of an assistant device, for that class. Various automated assistant functionalities can be adapted, for a particular class, responsive to determining the proficiency measure satisfies a threshold, or fails to satisfy the threshold (or an alternate threshold). The adaptation(s) can make automated assistant processing more efficient and/or improve (e.g., shorten the duration of) user-assistant interaction(s).
-
公开(公告)号:US11908467B1
公开(公告)日:2024-02-20
申请号:US17000886
申请日:2020-08-24
Applicant: Amazon Technologies, Inc.
Inventor: Rohit Prasad , Anna Santos , David Sanchez , Jared Strawderman , Sarah Castle , Kerry Hammil , Christopher Schindler , Timothy Twerdahl , Joseph Tavares , Bartosz Gulik
IPC: G10L21/00 , G10L25/00 , G10L15/22 , H04N21/422 , H04N21/478 , H04N21/482
CPC classification number: G10L15/22 , H04N21/42225 , H04N21/478 , H04N21/4828 , G10L2015/223
Abstract: Systems, methods, and computer-readable media are disclosed for dynamic voice search transitioning. Example methods may include receiving, by a computer system in communication with a display, a first incoming voice data indication, initiating a first user interface theme for presentation at a display, wherein the first user interface theme is a default user interface theme, and receiving first voice data. Example methods may include sending the first voice data to a remote server for processing, receiving an indication from the remote server to initiate a second user interface theme, and initiating the second user interface theme for presentation at the display.
-
公开(公告)号:US11842724B2
公开(公告)日:2023-12-12
申请号:US17457854
申请日:2021-12-06
Applicant: Microsoft Technology Licensing, LLC
Inventor: Percy Shuo Liang , David Leo Wright Hall , Joshua James Clausman
IPC: G10L15/22 , G10L15/06 , G10L25/00 , G06F16/332 , G06F40/35 , G06F3/16 , G06N5/025 , G10L15/065
CPC classification number: G10L15/063 , G06F3/167 , G06F16/3329 , G06F40/35 , G06N5/025 , G10L15/065 , G10L15/22 , G10L25/00
Abstract: A method for training a dialogue learning model includes presenting, via a user interface of a computing device, an utterance and a list of actions based on the utterance. A selection of an action from the list of actions is received via the user interface. A designated span of the utterance is received via the user interface. The selected action and the designated span of the utterance is provided to a computing system for training the dialogue learning model.
-
公开(公告)号:US11796423B2
公开(公告)日:2023-10-24
申请号:US17986025
申请日:2022-11-14
Applicant: Kyutaek Cho
Inventor: Myung Ki Kim
IPC: G10L25/00 , G03B43/00 , G01M99/00 , G01R31/28 , G06F11/273 , G10K11/16 , G10L25/51 , H04R29/00 , B65G61/00 , G06F3/041 , H04N17/00 , B25J9/16 , G06F11/263 , G06F11/267 , G01L25/00
CPC classification number: G01M99/005 , B25J9/161 , B25J9/1674 , B65G61/00 , G01L25/00 , G01M99/008 , G01R31/2834 , G03B43/00 , G06F3/0416 , G06F11/263 , G06F11/267 , G06F11/273 , G06F11/2733 , G10K11/16 , G10L25/51 , H04N17/002 , H04R29/001 , H04R29/004
Abstract: An automatic robot control system and methods relating thereto are described. These systems include components such as a touch screen panel (“TSP”) robot controller for controlling a TSP robot, a camera robot controller for controlling a camera robot and an audio robot controller for controlling an audio robot. The TSP robot operates inside a TSP testing subsystem, the camera robot operates inside a camera testing subsystem, and the audio robot operates inside an audio testing subsystem. Inside the audio testing subsystem, an audio signals measurement system, using a bi-directional coupling, controls the operation of the audio robot controller. In this control scheme, a test application controller is designed to control the different types of subsystem robots.
Methods relating to TSP, camera, and audio robots, and their controllers, taken individually or in combination, for automatic testing of device functionalities are also described.-
公开(公告)号:US11741980B2
公开(公告)日:2023-08-29
申请号:US17232807
申请日:2021-04-16
Applicant: Huawei Technologies Co., Ltd.
Inventor: Fengyan Qi , Lei Miao
IPC: G10L21/013 , G10L25/00 , G10L19/00 , G10L21/028 , G10L25/90
CPC classification number: G10L21/013 , G10L19/00 , G10L21/028 , G10L25/00 , G10L25/90
Abstract: A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter.
-
公开(公告)号:US11727949B2
公开(公告)日:2023-08-15
申请号:US16983974
申请日:2020-08-03
Inventor: Rebecca Kleinberger , Michael Erkkinen , George Stefanakis , Akito van Troyer , Satrajit Ghosh , Janet Baker , Tod Machover
IPC: G06F15/00 , G10L25/00 , G10L21/013 , G10L21/034 , G06F3/16 , H04R3/04 , G10L17/00 , G10L21/003 , G10L21/02
CPC classification number: G10L21/013 , G06F3/16 , G10L17/00 , G10L21/003 , G10L21/02 , G10L21/034 , H04R3/04
Abstract: A feedback system may play back, to a user, an altered version of the user's voice in real time, in order to reduce stuttering by the user. The system may operate in different feedback modes at different times. For instance, the system may detect when the severity of a user's stuttering increases, which is indicative of the user habituating to the current feedback mode. The system may then switch to a different feedback mode. In some cases, the feedback modes include at least a Whisper mode, a Reverb mode, and a Harmony mode. In Whisper mode, the user's voice may be transformed to sound as if it were whispering in the user's ears. In Harmony mode, the user's voice may be altered as if the user were harmonizing with himself or herself. In Reverb mode, the user's voice may be altered so that it reverberates.
-
10.
公开(公告)号:US11568887B1
公开(公告)日:2023-01-31
申请号:US17234186
申请日:2021-04-19
Applicant: AgLogica Holding, Inc.
Inventor: Marcel Joseph Sarzen , Christopher George Rosati
Abstract: Various examples are provided for surveillance of an audio stream. In one example, a method includes identifying presence or absence of a sound type of interest at a location during a time period; selecting the sound type from a library of sound type information to provide a collection of sound type information; incorporating the collection on a device proximate to the location; acquiring an audio stream from the location by the device to provide a locational audio stream; analyzing the locational audio stream to determine whether a sound type in the collection is present in the audio stream; and generating a notification to a user or computer if a sound type in the collection is present. The device can acquire and process the audio stream. In another example, a bulk sound type information library can be generated by identifying sound types of interest including them based upon a confidence level.
-
-
-
-
-
-
-
-
-