-
公开(公告)号:US12119004B2
公开(公告)日:2024-10-15
申请号:US17447068
申请日:2021-09-08
发明人: Yichen Yu , Yunsan Guo
CPC分类号: G10L15/26 , G06F40/109 , G06N20/00 , G10L25/27 , G10L25/63
摘要: The present disclosure may provide a voice audio data processing system. The voice audio data processing system may obtain voice audio data, which includes one or more voices, each being respectively associated with one of one or more subjects. For one of the one or more voices and the subject associated with the voice, the voice audio processing system may generate a text based on the voice audio data. The text may have one or more sizes, each size corresponding to one of one or more volumes of the voice. The text may have one or more colors, each color corresponding to one of one or more emotion types of the voice.
-
公开(公告)号:US11967339B2
公开(公告)日:2024-04-23
申请号:US17861158
申请日:2022-07-08
CPC分类号: G10L25/63 , G06F11/327 , G06V40/20
摘要: A method includes: prompting a user to recite a story associated with a first target emotion; recording the user reciting the story and recording a first timeseries of biosignal data via a set of sensors integrated into a wearable device worn by the user; accessing a first timeseries of emotion markers extracted from the voice recording; labeling the first timeseries of biosignal data according to the first timeseries of emotion markers; generating an emotion model linking biosignals to emotion markers for the user based on the first emotion-labeled timeseries of biosignal data; detecting a second instance of the first target emotion exhibited by the user based on a second timeseries of biosignal data and the emotion model; and notifying the user of the second instance of the first target emotion.
-
公开(公告)号:US11935527B2
公开(公告)日:2024-03-19
申请号:US17078779
申请日:2020-10-23
申请人: Google LLC
发明人: Matthew Sharifi , Victor Carbune
IPC分类号: G10L21/00 , G06F16/9032 , G06F16/9035 , G06F21/12 , G10L15/22 , G10L15/26 , G10L25/00 , G06F3/04812
CPC分类号: G10L15/22 , G06F16/90332 , G06F16/9035 , G06F21/121 , G10L15/26 , G06F3/04812
摘要: Implementations relate to generating a proficiency measure, and utilizing the proficiency measure to adapt one or more automated assistant functionalities. The generated proficiency measure is for a particular class of automated assistant actions, and is specific to an assistant device and/or is specific to a particular user. A generated proficiency measure for a class can reflect a degree of proficiency, of a user and/or of an assistant device, for that class. Various automated assistant functionalities can be adapted, for a particular class, responsive to determining the proficiency measure satisfies a threshold, or fails to satisfy the threshold (or an alternate threshold). The adaptation(s) can make automated assistant processing more efficient and/or improve (e.g., shorten the duration of) user-assistant interaction(s).
-
公开(公告)号:US11908467B1
公开(公告)日:2024-02-20
申请号:US17000886
申请日:2020-08-24
发明人: Rohit Prasad , Anna Santos , David Sanchez , Jared Strawderman , Sarah Castle , Kerry Hammil , Christopher Schindler , Timothy Twerdahl , Joseph Tavares , Bartosz Gulik
IPC分类号: G10L21/00 , G10L25/00 , G10L15/22 , H04N21/422 , H04N21/478 , H04N21/482
CPC分类号: G10L15/22 , H04N21/42225 , H04N21/478 , H04N21/4828 , G10L2015/223
摘要: Systems, methods, and computer-readable media are disclosed for dynamic voice search transitioning. Example methods may include receiving, by a computer system in communication with a display, a first incoming voice data indication, initiating a first user interface theme for presentation at a display, wherein the first user interface theme is a default user interface theme, and receiving first voice data. Example methods may include sending the first voice data to a remote server for processing, receiving an indication from the remote server to initiate a second user interface theme, and initiating the second user interface theme for presentation at the display.
-
公开(公告)号:US11842724B2
公开(公告)日:2023-12-12
申请号:US17457854
申请日:2021-12-06
IPC分类号: G10L15/22 , G10L15/06 , G10L25/00 , G06F16/332 , G06F40/35 , G06F3/16 , G06N5/025 , G10L15/065
CPC分类号: G10L15/063 , G06F3/167 , G06F16/3329 , G06F40/35 , G06N5/025 , G10L15/065 , G10L15/22 , G10L25/00
摘要: A method for training a dialogue learning model includes presenting, via a user interface of a computing device, an utterance and a list of actions based on the utterance. A selection of an action from the list of actions is received via the user interface. A designated span of the utterance is received via the user interface. The selected action and the designated span of the utterance is provided to a computing system for training the dialogue learning model.
-
公开(公告)号:US11796423B2
公开(公告)日:2023-10-24
申请号:US17986025
申请日:2022-11-14
申请人: Kyutaek Cho
发明人: Myung Ki Kim
IPC分类号: G10L25/00 , G03B43/00 , G01M99/00 , G01R31/28 , G06F11/273 , G10K11/16 , G10L25/51 , H04R29/00 , B65G61/00 , G06F3/041 , H04N17/00 , B25J9/16 , G06F11/263 , G06F11/267 , G01L25/00
CPC分类号: G01M99/005 , B25J9/161 , B25J9/1674 , B65G61/00 , G01L25/00 , G01M99/008 , G01R31/2834 , G03B43/00 , G06F3/0416 , G06F11/263 , G06F11/267 , G06F11/273 , G06F11/2733 , G10K11/16 , G10L25/51 , H04N17/002 , H04R29/001 , H04R29/004
摘要: An automatic robot control system and methods relating thereto are described. These systems include components such as a touch screen panel (“TSP”) robot controller for controlling a TSP robot, a camera robot controller for controlling a camera robot and an audio robot controller for controlling an audio robot. The TSP robot operates inside a TSP testing subsystem, the camera robot operates inside a camera testing subsystem, and the audio robot operates inside an audio testing subsystem. Inside the audio testing subsystem, an audio signals measurement system, using a bi-directional coupling, controls the operation of the audio robot controller. In this control scheme, a test application controller is designed to control the different types of subsystem robots.
Methods relating to TSP, camera, and audio robots, and their controllers, taken individually or in combination, for automatic testing of device functionalities are also described.-
公开(公告)号:US11741980B2
公开(公告)日:2023-08-29
申请号:US17232807
申请日:2021-04-16
发明人: Fengyan Qi , Lei Miao
IPC分类号: G10L21/013 , G10L25/00 , G10L19/00 , G10L21/028 , G10L25/90
CPC分类号: G10L21/013 , G10L19/00 , G10L21/028 , G10L25/00 , G10L25/90
摘要: A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter.
-
公开(公告)号:US11727949B2
公开(公告)日:2023-08-15
申请号:US16983974
申请日:2020-08-03
发明人: Rebecca Kleinberger , Michael Erkkinen , George Stefanakis , Akito van Troyer , Satrajit Ghosh , Janet Baker , Tod Machover
IPC分类号: G06F15/00 , G10L25/00 , G10L21/013 , G10L21/034 , G06F3/16 , H04R3/04 , G10L17/00 , G10L21/003 , G10L21/02
CPC分类号: G10L21/013 , G06F3/16 , G10L17/00 , G10L21/003 , G10L21/02 , G10L21/034 , H04R3/04
摘要: A feedback system may play back, to a user, an altered version of the user's voice in real time, in order to reduce stuttering by the user. The system may operate in different feedback modes at different times. For instance, the system may detect when the severity of a user's stuttering increases, which is indicative of the user habituating to the current feedback mode. The system may then switch to a different feedback mode. In some cases, the feedback modes include at least a Whisper mode, a Reverb mode, and a Harmony mode. In Whisper mode, the user's voice may be transformed to sound as if it were whispering in the user's ears. In Harmony mode, the user's voice may be altered as if the user were harmonizing with himself or herself. In Reverb mode, the user's voice may be altered so that it reverberates.
-
9.
公开(公告)号:US11568887B1
公开(公告)日:2023-01-31
申请号:US17234186
申请日:2021-04-19
摘要: Various examples are provided for surveillance of an audio stream. In one example, a method includes identifying presence or absence of a sound type of interest at a location during a time period; selecting the sound type from a library of sound type information to provide a collection of sound type information; incorporating the collection on a device proximate to the location; acquiring an audio stream from the location by the device to provide a locational audio stream; analyzing the locational audio stream to determine whether a sound type in the collection is present in the audio stream; and generating a notification to a user or computer if a sound type in the collection is present. The device can acquire and process the audio stream. In another example, a bulk sound type information library can be generated by identifying sound types of interest including them based upon a confidence level.
-
公开(公告)号:US11348475B2
公开(公告)日:2022-05-31
申请号:US15649382
申请日:2017-07-13
申请人: The Boeing Company
IPC分类号: G10L21/00 , G10L25/00 , G09B5/02 , G05B19/042 , G06T19/00 , G09B19/24 , G10L15/22 , G06F16/242 , H04L67/12
摘要: A cognitive assistant that allows a maintainer to speak to an application using natural language is disclosed. The maintainer can quickly interact with an application hands-free without the need to use complex user interfaces or memorized voice commands. The assistant provides instructions to the maintainer using augmented reality audio and visual cues. The assistant will walk the maintainer through maintenance tasks and verify proper execution using IoT sensors. If after completing a step, the IoT sensors are not as expected, the maintainer is notified on how to resolve the situation.
-
-
-
-
-
-
-
-
-