-
公开(公告)号:US11381534B2
公开(公告)日:2022-07-05
申请号:US16973746
申请日:2019-06-07
Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
Inventor: Ryosuke Aoki , Yusuke Kameyama , Naoki Ohshima , Naoki Mukawa
Abstract: A communication server device that mediates communication information transmitted and received between a plurality of communication terminals provides people who are participating in a conversation using real-time chat with information that can create a trigger to prompt them to end the conversation. Included are a chat log update unit (13) that causes a chat log DB (15) to update and store communication information transmitted or received between a plurality of communication terminals in association with information of transmission or reception time; a conversation duration update unit (14) that causes a conversation duration DB (16) to update and store a duration from the time when the information is stored in the chat log DB (15); and an ongoing conversation determination unit (12) that presents, based on the duration stored in the conversation duration DB (16), interruption information related to the duration to the plurality of communication terminals in response to receiving a request to end transmission and reception of the communication information from at least one of the plurality of communication terminals.
-
公开(公告)号:US20220208208A1
公开(公告)日:2022-06-30
申请号:US17579875
申请日:2022-01-20
Applicant: Samsung Electronics Co., Ltd.
Inventor: Seungyoon HEO , Jungyeol AN , Taewoo KIM , Sunghwa WOO , Gangyoul KIM
Abstract: An electronic device is provided. The electronic device includes a communication module, a sensor module, a sound input module, a sound output module, a memory including a buffer, and a processor. The processor is configured to determine whether an external electronic device has at least one speaker and/or microphone, select, a speech reception device from among the electronic device and the external electronic device based on sensing information received when a call is connected, select a speech transmission device, from among the electronic device and the external electronic device based on a comparison of a speech signal received from the electronic device with a speech signal received from the external electronic device when a call is connected, and if the speech transmission device and the speech reception device are different devices, enable at least one microphone included in the speech transmission device, disable at least one speaker included in the speech transmission device, disable at least one microphone included in the speech reception device, and enable at least one speaker included in the speech reception device, acquire an echo reference signal related to reception speech received from the speech reception device and store the echo reference signal in the buffer, determine, based on comparison of the stored echo reference signal to transmission speech received from the speech transmission device, an EPD value relating to an echo signal included in the transmission speech, and cancel the echo signal by using the echo reference signal and the determined EPD value.
-
公开(公告)号:US11360791B2
公开(公告)日:2022-06-14
申请号:US16499197
申请日:2018-03-27
Applicant: Samsung Electronics Co., Ltd
Inventor: Jihyun Kim , Dongho Jang , Minkyung Hwang , Kyungtae Kim , Inwook Song , Yongjoon Jeon
IPC: G06F9/451 , G06F3/16 , G10L15/22 , G06F3/04817 , G06F3/04883 , G10L15/28
Abstract: Various embodiments of the present invention relate to an electronic device and a screen control method for processing a user input by using the same, and according to the various embodiments of the present invention, the electronic device comprises: a housing; a touchscreen display located inside the housing and exposed through a first part of the housing; a microphone located inside the housing and exposed through a second part of the housing; at least one speaker located inside the housing and exposed through a third part of the housing; a wireless communication circuit located inside the housing; a processor located inside the housing and electrically connected to the touchscreen display, the microphone, the at least one speaker, and the wireless communication circuit; and a memory located inside the housing and electrically connected to the processor, wherein the memory stores a first application program including a first user interface and a second application program including a second user interface, wherein the memory stores instructions, and when the memory is executed, cause the processor to: display the first user interface on the display, while displaying the first user interface, receive a user input through at least one of the display or the microphone, wherein the user input includes a request for performing a task using the second application program, transmit data associated with the user input to an external server via the communication circuit, receive a response from the external server via the communication circuit, wherein the response includes information on a sequence of states of the electronic device to perform the task, and after receiving the response, display the second user interface on a first region of the display, based on the sequence of the states, while displaying a portion of the first user interface on a second region of the display. Other various embodiments, in addition to the various embodiments disclosed in the present invention, are possible.
-
公开(公告)号:US20220172724A1
公开(公告)日:2022-06-02
申请号:US17650468
申请日:2022-02-09
Applicant: Sorenson IP Holdings, LLC
Inventor: Michael Holm , Jasper C. Pan
IPC: G10L15/26 , G10L15/28 , H04M7/00 , G10L15/01 , G10L15/18 , G10L15/30 , H04M3/42 , H04M1/247 , H04M1/253
Abstract: A system is provided that includes a first network interface for a first network type and a second network interface for a second network type that is different from the first network type. The system also includes at least one processor configured to cause the system to perform operations. The operations may include obtaining, from the first network interface, audio from a communication session with a remote device established over the first network and obtaining an indication of a communication device available to participate in the communication session and direct audio obtained from the communication session to a remote transcription system. The operations may also include directing the audio to the second network interface for transmission to the communication device, obtaining transcript data from the remote transcription system based on the audio, and directing the transcript data to the second network interface for transmission to the communication device.
-
公开(公告)号:US20220167082A1
公开(公告)日:2022-05-26
申请号:US17453633
申请日:2021-11-04
Applicant: Sonos, Inc.
Inventor: Eric Frank
Abstract: Systems and methods are disclosed in which a playback device transmits a first sound signal including a predetermined waveform. In one example, the playback device receives a second sound signal including at least one reflection of the first sound signal. The second sound signal is processed to determine a location of a person relative to the playback device, and a characteristic of audio reproduction by the playback device is selected, based on the determined location of the person.
-
公开(公告)号:US20220157297A1
公开(公告)日:2022-05-19
申请号:US17361408
申请日:2021-06-29
Applicant: Tata Consultancy Services Limited
Inventor: Swarnava Dey , Jeet Dutta
Abstract: Automatic speech recognition techniques are implemented in resource constrained devices such as edge devices in internet of things where on-device speech recognition is required for low latency and privacy preservation. Existing neural network models for speech recognition have a large size and are not suitable for deployment in such devices. The present disclosure provides an architecture of a size constrained neural network and a method of training the size constrained neural network. The architecture of the size constrained neural network provides a way of increasing or decreasing number of feature blocks to achieve an accuracy-model size trade off. The method of training the size constrained neural network comprises creating a training dataset with short utterances and training the size constrained neural network with the training dataset to learn short term dependencies in the utterances. The trained size constrained neural network model is suitable for deployment in resource constrained devices.
-
公开(公告)号:US11323800B2
公开(公告)日:2022-05-03
申请号:US16842241
申请日:2020-04-07
Applicant: X Development LLC
Inventor: Thomas Peter Hunt
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for analyzing ultrasonic signals to recognize mouthed articulations. One of the methods includes generating an ultrasonic carrier signal and coupling the ultrasonic carrier signal to a person's vocal tract. The method includes detecting a modulated ultrasonic signal, the modulated ultrasonic signal corresponding to the ultrasonic carrier signal modulated by the person's vocal tract to include information about articulations mouthed by the person; analyzing, using a data processing apparatus, the modulated ultrasonic signal to recognize the articulations mouthed by the person from the information in the modulated ultrasonic signal; and generating, using the data processing apparatus, an output in response to the recognized articulations.
-
公开(公告)号:US11322145B2
公开(公告)日:2022-05-03
申请号:US16802149
申请日:2020-02-26
Applicant: SHARP KABUSHIKI KAISHA
Inventor: Keiko Hirukawa , Satoshi Terada
IPC: G10L15/22 , G10L15/28 , G10L15/183
Abstract: A voice processing device includes a voice receiver that receives a voice, an imager, an image acquirer that acquires a captured image captured by the imager, an utterer identifier that identifies an utterer based on the voice received by the voice receiver and the captured image acquired by the image acquirer, a voice determiner that determines whether the voice is a specific word based on the voice received by the voice receiver and an image of the utterer identified by the utterer identifier, the image being included in the captured image, and a voice transmitter that switches a transmission destination of the voice received by the voice receiver based on a determination result by the voice determiner.
-
公开(公告)号:US11302331B2
公开(公告)日:2022-04-12
申请号:US16750274
申请日:2020-01-23
Applicant: Samsung Electronics Co., Ltd.
Inventor: Dhananjaya N. Gowda , Kwangyoun Kim , Abhinav Garg , Chanwoo Kim
Abstract: Provided are an electronic device for recognizing speech of a user, and a method, performed by the electronic device, of recognizing speech. The method includes obtaining an audio signal based on a speech input based on the audio signal being input, obtaining an output value of a first automatic speech recognition (ASR) model that outputs a character string at a first level; obtaining an output value of a second ASR model that outputs a character string at a second level corresponding to the audio signal based on the output value of the first ASR model based on the audio signal being input; and recognizing the speech from the output value of the second ASR model.
-
公开(公告)号:US20220101841A1
公开(公告)日:2022-03-31
申请号:US17549528
申请日:2021-12-13
Inventor: John Paul LESSO , Robert James HATFIELD
Abstract: A speech recognition system comprises: an input, for receiving an input signal from at least one microphone; a first buffer, for storing the input signal; a noise reduction block, for receiving the input signal and generating a noise reduced input signal; a speech recognition engine, for receiving either the input signal output from the first buffer or the noise reduced input signal from the noise reduction block; and a selection circuit for directing either the input signal output from the first buffer or the noise reduced input signal from the noise reduction block to the speech recognition engine.
-
-
-
-
-
-
-
-
-