-
公开(公告)号:US20210158836A1
公开(公告)日:2021-05-27
申请号:US16633161
申请日:2018-04-24
Applicant: Sony Corporation
Inventor: Hiro IWASE , Shinichi KAWANO , Mari SAITO , Yuhei TAKI
IPC: G10L25/54 , G10L21/028 , G10L15/22 , G10L25/84
Abstract: Provided is an information processing device including an output control unit that controls presentation of content to a user, and when a non-viewing/listening period is detected in a viewing and listening behavior of the user corresponding to the content, causes a summary of the content to be output. The output control unit determines an amount of information in the summary of the content, based on the length of the non-viewing/listening period. Moreover, provided is an information processing method including: by a processor, controlling presentation of content to a user; and when a non-viewing/listening period is detected in a viewing and listening behavior of the user corresponding to the content, causing a summary of the content to be output. The causing the summary of the content to be output further includes determining an amount of information in the summary of the content, based on the length of the non-viewing/listening period.
-
公开(公告)号:US20210064640A1
公开(公告)日:2021-03-04
申请号:US16961273
申请日:2018-11-05
Applicant: Sony Corporation
Inventor: Yuhei TAKI , Hiro IWASE , Shinichi KAWANO , Kunihito SAWAI
IPC: G06F16/28 , G06F16/22 , G06F16/2457
Abstract: To enable more appropriate assistance for input to an information processing apparatus.
Provided is an information processing apparatus including: an acquisition unit that acquires text information in the middle of performance of input; and an input-candidate extraction unit that extracts a candidate for the input on the basis of attribute information that is extracted on the basis of the text information.-
公开(公告)号:US20200051545A1
公开(公告)日:2020-02-13
申请号:US16478602
申请日:2018-02-27
Applicant: SONY CORPORATION
Inventor: Hiro IWASE , Mari SAITO , Shinichi KAWANO
IPC: G10L13/047 , G10L15/25 , G10L15/22 , G10L25/84 , G10L15/16
Abstract: The present technology relates to a learning device, a learning method, a voice synthesis device, and a voice synthesis method configured so that information can be provided via voice allowing easy understanding of contents by a user as a speech destination. A learning device according to one embodiment of the present technology performs voice recognition of speech voice of a plurality of users, estimates statuses when a speech is made, and learns, on the basis of speech voice data, a voice recognition result, and the statuses when the speech is made, voice synthesis data to be used for generation of synthesized voice according to statuses upon voice synthesis. Moreover, a voice synthesis device estimates statuses, and uses the voice synthesis data to generate synthesized voice indicating the contents of predetermined text data and obtained according to the estimated statuses. The present technology can be applied to an agent device.
-
4.
公开(公告)号:US20210398517A1
公开(公告)日:2021-12-23
申请号:US17283957
申请日:2019-10-09
Applicant: SONY CORPORATION
Inventor: Hiro IWASE , Mari SAITO
IPC: G10L13/027 , G10L15/22 , G10L15/183
Abstract: A response generating apparatus (10) includes an acquiring unit (40) that acquires input information that is a trigger for generating a response with respect to a user and context information that is information indicating a situation of the user and a response generating unit (50) that generates, based on the context information acquired from the user, a response associated with the input information.
-
公开(公告)号:US20210134278A1
公开(公告)日:2021-05-06
申请号:US16472544
申请日:2018-11-01
Applicant: SONY CORPORATION
Inventor: Hiro IWASE , Shinichi KAWANO , Yuhei TAKI , Kunihito SAWAI
Abstract: There is provided an information processing device and an information processing method that enable speeding up of a responsivity of a system response to a speech of a user. The information processing device includes a processing unit configured to determine, on the basis of a result of semantic analysis that is to be obtained from an interim result of speech recognition of a speech of a user, presence or absence of a response to the speech of the user. It thereby becomes possible to speed up a responsivity of a system response to the speech of the user. The present technology can be applied to a speech dialogue system, for example.
-
公开(公告)号:US20210110814A1
公开(公告)日:2021-04-15
申请号:US16464494
申请日:2018-10-19
Applicant: SONY CORPORATION
Inventor: Hiro IWASE , Shinichi KAWANO , Yuhei TAKI , Kunihito SAWAI
Abstract: There is provided an information processing device and an information processing method that enable the intention of a speech of a user to be estimated more accurately. The information processing device includes: a detection unit configured to detect a breakpoint of a speech of a user on the basis of a result of recognition that is to be obtained during the speech of the user; and an estimation unit configured to estimate an intention of the speech of the user on the basis of a result of semantic analysis of a divided speech sentence obtained by dividing a speech sentence at the detected breakpoint of the speech. The present technology can be applied, for example, to a speech dialogue system.
-
7.
公开(公告)号:US20200272407A1
公开(公告)日:2020-08-27
申请号:US16647018
申请日:2018-06-19
Applicant: Sony Corporation
Inventor: Mari SAITO , Hiro IWASE , Shinichi KAWANO , Yuhei TAKI
Abstract: [Problem] The problem of the present disclosure relates to proposing an information processing device, an information processing terminal, an information processing method, and a program, which are capable of controlling the output of a voice so as to be adaptive to an action purpose of a user.[Solution] An information processing device including: an inference unit that infers an action purpose of a user on the basis of a result of sensing by one or more sensors; and an output control unit that controls, on the basis of a result of inference by the inference unit, output of a voice to the user performed by an audio output unit.
-
公开(公告)号:US20200051586A1
公开(公告)日:2020-02-13
申请号:US16485789
申请日:2018-04-12
Applicant: Sony Corporation
Inventor: Mari SAITO , Hiro IWASE
Abstract: A sound state estimating unit detects surrounding sound at a timing at which a notification to a destination user occurs. A user state estimating unit detects a position of the destination user and positions of users other than the destination user at the timing at which the notification occurs. An output control unit controls output of the notification to the destination user at a timing at which it is determined that the surrounding sound detected by the sound state estimating unit is masking possible sound which can be used for masking in a case where the position of the destination user detected by the user state estimating unit is within a predetermined area.
-
公开(公告)号:US20210035554A1
公开(公告)日:2021-02-04
申请号:US16959680
申请日:2018-10-26
Applicant: Sony Corporation
Inventor: Hiro IWASE , Shinichi KAWANO , Yuhei TAKI , Kunihito SAWAI , Masaki TAKASE , Akira MIYASHITA
Abstract: An apparatus and method are capable of controlling the output of the system utterance upon the occurrence of barge-in utterance and enabling a smooth interactive between a user and the system. Fade processing is applied to lower at least one of volume, a speech rate, or a pitch (voice pitch) of system utterance from a starting time of the barge-in utterance acting as the user interruption utterance during executing the system utterance. Even after the completion of the fade processing, the output state upon completing the fade processing is maintained. In a case where the system utterance level is equal to or less than the predefined threshold during the fade processing, the system utterance is displayed on a display unit. One of stop, continuation, and rephrasing is executed based on an intention of the barge-in utterance and whether an important word is included in in the system utterance.
-
公开(公告)号:US20200243074A1
公开(公告)日:2020-07-30
申请号:US16637763
申请日:2018-08-03
Applicant: Sony Corporation
Inventor: Yuhei TAKI , Shinichi KAWANO , Hiro IWASE
Abstract: The present technology relates to an information processor, an information processing method, and a program that allow a user to obtain a speech recognition result that the user expects. A search unit retrieves a second word that is a candidate for replacement of a first word with a predetermined attribute. The predetermined attribute is identified by a semantic analysis in a text including character strings obtained by speech recognition. The present technology is applicable to an agent apparatus of a user interaction type, for example.
-
-
-
-
-
-
-
-
-