Abstract:
A voice registration device includes an acquisition unit configured to acquire a voice signal of an utterance voice of a speaker, an emotion identification unit configured to identify at least one type of emotion of the speaker included in the voice signal, and a registration unit configured to register a voice signal for the each type of emotion in a database based on an identification result by the emotion identification unit.
Abstract:
A sound and video processing system includes: a display that displays a video image captured by the camera; a sound collector that collects sound; an input device that receives designation of at least one designated location in the video image displayed on the display. A processor generates emphasized audio data, in which sound is emphasized in at least one direction from a position of the sound collector toward at least one position corresponding to the at least one designated location. The processor displays at least one identification shape at the at least one designated location. In response to receiving re-designation of one of the at least one designated location by the input device, the processor outputs audio data in which emphasis of sound stops in a direction from the position of the sound collector toward the position corresponding to the re-designated location.
Abstract:
An authentification device includes: an acquisition unit configured to acquire and detect a voice signal of an utterance voice of a speaker; an authentication unit configured to authenticate whether the speaker is the person himself/herself based on collation between the voice signal detected by the acquisition unit and a database; a storage unit configured to store, as question example sentences, a plurality of questions for acquiring a voice signal used for authentication of the speaker by the authentication unit; a display interface configured to display the question example sentence for the speaker on a terminal device; and an example sentence selection unit configured to select a question example sentence to be displayed on the terminal device from the plurality of question example sentences stored in the storage unit.
Abstract:
A sound and video processing system includes: a display, having a rectangular display region, that displays a video image in a circular video-image display region smaller than the rectangular display region; and a sound collector that collects sound. A processor generates emphasized audio data, in which sound is emphasized in at least one direction from a position of the sound collector toward at least one position corresponding to at least one designated location in the video image. In response to receiving designation outside the video-image display region, the processor displays a state display area or an adjustment operation area for the sound to be output from the speaker in a rectangular region which has a diagonal line extending from one of four corners of the rectangular display region to a center of the video-image display region and intersecting with a boundary line of the video-image display region.
Abstract:
Sound collection directionality is formed toward a location corresponding to a position designated in a video of a predetermined region which is imaged by a camera apparatus with a microphone array apparatus as a reference, and audio data is collected with high accuracy. In a directionality control system (10), a signal processing unit (33) derives a sound collection direction (θMAh,θMAv) which is directed from an installation position of a microphone array apparatus (2) toward a sound position corresponding to a position designated in video data on a screen of the display device (36) in response to a user's designation of any position in the video data displayed on the screen. The signal processing unit (33) forms sound collection directionality of audio data in the derived sound collection direction (θMAh,θMAv).
Abstract:
A voice registration device includes an acquisition unit that acquires a voice signal of an utterance voice of a speaker, a detection unit that detects, from the voice signal, a first utterance section of the speaker and a second utterance section different from the first utterance section, a sensing unit that compares a voice signal of the first utterance section with a voice signal of the second utterance section and senses switching from the speaker to another speaker different from the speaker, and a registration unit that registers the voice signal of the speaker in a database based on the sensing of the switching by the sensing unit.
Abstract:
An authentification device includes an acquisition unit configured to acquire and detect a voice signal of an utterance voice of a speaker, an authentication unit configured to authenticate whether the speaker is the person himself/herself based on collation between the voice signal detected by the acquisition unit and a database, and a display interface configured to display, on a terminal device, an authentication status indicating whether the speaker is the person himself/herself based on an authentication result of the authentication unit, in which the display interface updates a display content of the authentication status of the speaker by the authentication unit every time the authentication status changes.
Abstract:
A voice registration device includes an acquisition unit that acquires a voice signal of an utterance voice of a speaker and speaker information capable of identifying the speaker, a registration unit that registers the acquired voice signal and speaker information in an associated manner in a database, a progress degree determination unit that repeatedly determines a registration progress degree of the voice signal registered in the database with respect to a registration target amount of the voice signal registered in the database, and a notification unit that notifies the determined registration progress degree.
Abstract:
An authentication device includes an acquisition unit configured to acquire a voice signal of a speaker; a detection unit configured to detect a first utterance period during which the speaker is speaking; and an authentication unit configured to authenticate the speaker based on a comparison between a voice signal of the first utterance period and a database. The detection unit detects a second utterance period different from the first utterance period when the authentication unit determines that the speaker authentication is impossible, and the authentication unit authenticates the speaker based on a comparison between the voice signal of the first utterance period and a voice signal of the second utterance period, and the database.
Abstract:
A camera and a microphone array configuring a monitoring system are capable of receiving electric power from a PoE apparatus through a LAN cable. In a case where a first switching operation is performed on a microphone array side, an output terminal of an input switch is connected to an input terminal of a PoE electric power reception circuit side. An input terminal of an output switch is connected to an output terminal of a PoE electric power transmission circuit side. On a camera side, an output terminal of the input switch is connected to an input terminal of a PoE electric power receptor side. The microphone array receives electric power that is supplied from the PoE apparatus for operation and transmits the electric power towards the camera. The camera receives the supplied electric power from the PoE apparatus through the microphone array and the LAN cable for operation.