Patent search ap:("Apple Inc.") AND inv:"Ladan Golipour" Page 1

1.

发明授权
Text normalization based on a data-driven learning network 有权

公开(公告)号：US10395654B2

公开(公告)日：2019-08-27

申请号：US15673574

申请日：2017-08-10

Applicant: Apple Inc.

Inventor： Ladan Golipour , Matthias Neeracher , Ramya Rasipuram

IPC: G10L15/00 , G10L15/22 , G10L15/18 , G10L15/16 , G10L15/26 , G10L15/30 , G10L13/08

Abstract: Systems and processes for operating an intelligent automated assistant to perform text-to-speech conversion are provided. An example method includes, at an electronic device having one or more processors, receiving a text corpus comprising unstructured natural language text. The method further includes generating a sequence of normalized text based on the received text corpus; and generating a pronunciation sequence representing the sequence of the normalized text. The method further includes causing an audio output to be provided to the user based on the pronunciation sequence. At least one of the sequence of normalized text and the pronunciation sequence is generated based on a data-driven learning network.

2.

发明授权
Unit-selection text-to-speech synthesis based on predicted concatenation parameters 有权

公开(公告)号：US09934775B2

公开(公告)日：2018-04-03

申请号：US15266930

申请日：2016-09-15

Applicant: Apple Inc.

Inventor： Tuomo J. Raitio , Kishore Sunkeswari Prahallad , Alistair D. Conkie , Ladan Golipour , David A. Winarsky

IPC: G10L13/10 , G10L13/033 , G10L13/06

CPC classification number: G10L13/10 , G10L13/0335 , G10L13/06 , G10L13/07

Abstract: Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.

Patent Agency Ranking