摘要:
In an embodiment, a transcription support system includes: a first storage, a playback unit, a second storage, a text generating unit, an estimating unit, and a setting unit. The first storage stores the voice data therein; a playback unit plays back the voice data; and a second storage stores voice indices, each of which associates a character string obtained from a voice recognition process with voice positional information, for which the voice positional information is indicative of a temporal position in the voice data and corresponds to the character string. The text creating unit creates text; the estimating unit estimates already-transcribed voice positional information based on the voice indices; and the setting unit sets a playback starting position that indicates a position at which playback is started in the voice data based on the already-transcribed voice positional information.
摘要:
An information processing apparatus is provided in which with respect to video/audio data to be recorded and stored, the determination of division and a control point suitable for viewing and listening and the giving of relevant information can be performed without requiring a manual operation each time. The information processing apparatus includes a recording medium 90, a video data acquisition unit 48, a video data specification unit 47, an audio data separation unit 25, a key creation unit 31, a key relevant data acquisition unit 55, and a key data management unit 10. When a key is created while specifying a section in first audio data, a name and attribute information based on a near division point and control point are stored, and when an audio section similar to an audio pattern of the key is detected from second audio data, in accordance with the stored attribute information, a division point and a control point are determined on the basis of starting and terminal ends of the detected section, and the stored name or a name given in accordance with a naming method is set for the divided section, the control point or the whole audio data.
摘要:
A media data memory stores media data including at least one of speech data and image data both playable in time series. A play control display unit displays a plurality of time figures. Each time figure corresponds to a play time of a part of the media data in time series order. A data selection unit selects at least one time figure from the plurality of time figures through the play control display unit. A play control unit moves a play position to a part of the play time corresponding to the at least one time figure in the media data. A play unit plays the media data from the play position moved by the play control unit.
摘要:
States included in a deterministic finite automaton are classified into states having the same input symbols associated with outgoing transitions and the same finality, and a calculates an intersection set between each of the state sets and a set of transition destination states which is obtained by translating each of states included in the state sets, until the number of states included in the intersection set becomes equal to one, while regarding the set of the transition destination states for each of the input symbol included in the intersection set as new state sets, and plural indistinguishable states are merged into one state by tracing a route in a reverse direction to a transition direction, when the number of states has become equal to one.
摘要:
According to one embodiment, a finite state transducer determinizing device includes a symbol determination unit, a state merging unit, and a single-value processing unit. The symbol determination unit generates an identification symbol different from an input symbol assigned to each transition of a finite state transducer. The state merging unit extracts one or more states at a transition destination by the same input symbol from among the states of the finite state transducer and generates states having the extracted states as sub-states. The single-value processing unit applies the input symbol assigned to each transition of the finite state transducer or the identification symbol as an input symbol of a transition between the states generated by the state merging unit to perform determinizing.
摘要:
According to one embodiment, a finite state transducer determinizing device includes a symbol determination unit, a state merging unit, and a single-value processing unit. The symbol determination unit generates an identification symbol different from an input symbol assigned to each transition of a finite state transducer. The state merging unit extracts one or more states at a transition destination by the same input symbol from among the states of the finite state transducer and generates states having the extracted states as sub-states. The single-value processing unit applies the input symbol assigned to each transition of the finite state transducer or the identification symbol as an input symbol of a transition between the states generated by the state merging unit to perform determinizing.
摘要:
In an embodiment, a transcription support system includes: a first storage, a playback unit, a second storage, a text generating unit, an estimating unit, and a setting unit. The first storage stores the voice data therein; a playback unit plays back the voice data; and a second storage stores voice indices, each of which associates a character string obtained from a voice recognition process with voice positional information, for which the voice positional information is indicative of a temporal position in the voice data and corresponds to the character string. The text creating unit creates text; the estimating unit estimates already-transcribed voice positional information based on the voice indices; and the setting unit sets a playback starting position that indicates a position at which playback is started in the voice data based on the already-transcribed voice positional information.
摘要:
According to one embodiment, a speaker clustering apparatus includes a clustering unit, an extraction unit, and an error detection unit. The clustering unit is configured to extract acoustic features for speakers from an acoustic signal, and to cluster utterances included in the acoustic signal into the speakers by using the acoustic features. The extraction unit is configured to acquire character strings representing contents of the utterances, and to extract linguistic features of the speakers by using the character strings. The error detection unit is configured to decide that, when one of the character strings does not fit with a linguistic feature of a speaker into which an utterance of the one is clustered, the utterance is erroneously clustered by the clustering unit.
摘要:
An information processing apparatus is provided in which with respect to video/audio data to be recorded and stored, the determination of division and a control point suitable for viewing and listening and the giving of relevant information can be performed without requiring a manual operation each time. The information processing apparatus includes a recording medium 90, a video data acquisition unit 48, a video data specification unit 47, an audio data separation unit 25, a key creation unit 31, a key relevant data acquisition unit 55, and a key data management unit 10. When a key is created while specifying a section in first audio data, a name and attribute information based on a near division point and control point are stored, and when an audio section similar to an audio pattern of the key is detected from second audio data, in accordance with the stored attribute information, a division point and a control point are determined on the basis of starting and terminal ends of the detected section, and the stored name or a name given in accordance with a naming method is set for the divided section, the control point or the whole audio data.
摘要:
States included in a deterministic finite automaton are classified into states having the same input symbols associated with outgoing transitions and the same finality, and a calculates an intersection set between each of the state sets and a set of transition destination states which is obtained by translating each of states included in the state sets, until the number of states included in the intersection set becomes equal to one, while regarding the set of the transition destination states for each of the input symbol included in the intersection set as new state sets, and plural indistinguishable states are merged into one state by tracing a route in a reverse direction to a transition direction, when the number of states has become equal to one.