-
公开(公告)号:US10446143B2
公开(公告)日:2019-10-15
申请号:US15268360
申请日:2016-09-16
申请人: Apple Inc.
发明人: Murat Akbacak , Bryan Hansen , Gunnar Evermann
IPC分类号: G10L15/26 , G10L15/22 , G06F17/24 , G06F17/27 , G06F21/00 , G10L15/08 , G10L15/24 , G10L15/30
摘要: Systems and processes for identifying of a voice input providing one or more user credentials are provided. In one example process, a voice input can be received. A first character, a phrase identifying a second character, and a word can be identified based on the voice input. In response to the identification, the first character, the second character, and the word can be converted to text. The text can be caused to display, with a display, in a sequence corresponding to an order of the first character, the second character, and the word in the voice input.
-
公开(公告)号:US11727219B2
公开(公告)日:2023-08-15
申请号:US16984714
申请日:2020-08-04
申请人: Apple Inc.
发明人: Gunnar Evermann
CPC分类号: G06F40/35 , G10L15/1822 , G10L15/26
摘要: A text string with a first and a second portion is provided. A domain of the text string is determined by applying a first word-matching process to the first portion of the text string. It is then determined whether the second portion of the text string matches a word of a set of words associated with the domain by applying a second word-matching process to the second portion of the text string. Upon determining that the second portion of the text string matches the word of the set of words, it is determined whether a user intent from the text string based at least in part on the domain and the word of the set of words.
-
公开(公告)号:US10403278B2
公开(公告)日:2019-09-03
申请号:US15703013
申请日:2017-09-13
申请人: Apple Inc.
发明人: Adrian Skilling , Melvyn J. Hunt , Gunnar Evermann
IPC分类号: G10L15/22 , G10L15/187 , G06F3/16 , G10L15/10 , G10L15/26 , G06F16/432 , G10L15/02 , G10L15/08
摘要: Systems and processes for operating an intelligent automated assistant to provide media items based on phonetic matching techniques are provided. An example method includes receiving a speech input from a user and determining whether the speech input includes a user request for a media item. The method further includes, in accordance with a determination that the speech input includes a user request for obtaining a media item, determining a candidate media item from a plurality of media items. The method further includes determining, based on a difference between a phonetic representation of the candidate media item and a phonetic representation of the speech input, whether the candidate media item is to be provided to the user. The method further includes, in accordance with a determination that the candidate media item is to be provided to the user, providing the candidate media item to the user.
-
4.
公开(公告)号:US09966060B2
公开(公告)日:2018-05-08
申请号:US15445863
申请日:2017-02-28
申请人: Apple Inc.
发明人: Devang K. Naik , Thomas R. Gruber , Liam Weiner , Justin G. Binder , Charles Srisuwananukorn , Gunnar Evermann , Shaun Eric Williams , Hong Chen , Lia T. Napolitano
CPC分类号: G10L13/027 , G10L13/04 , G10L13/08 , G10L15/063 , G10L15/22 , G10L15/26 , G10L15/265 , G10L2015/0631 , G10L2015/0638
摘要: The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
-
公开(公告)号:US10431204B2
公开(公告)日:2019-10-01
申请号:US15803584
申请日:2017-11-03
申请人: Apple Inc.
IPC分类号: G06F17/27 , G10L15/06 , G10L15/02 , G10L25/33 , G10L15/197 , G06F16/9535 , G10L15/18 , G10L15/183
摘要: Systems and processes are disclosed for discovering trending terms in automatic speech recognition. Candidate terms (e.g., words, phrases, etc.) not yet found in a speech recognizer vocabulary or having low language model probability can be identified based on trending usage in a variety of electronic data sources (e.g., social network feeds, news sources, search queries, etc.). When candidate terms are identified, archives of live or recent speech traffic can be searched to determine whether users are uttering the candidate terms in dictation or speech requests. Such searching can be done using open vocabulary spoken term detection to find phonetic matches in the audio archives. As the candidate terms are found in the speech traffic, notifications can be generated that identify the candidate terms, provide relevant usage statistics, identify the context in which the terms are used, and the like.
-
公开(公告)号:US20180336197A1
公开(公告)日:2018-11-22
申请号:US15703013
申请日:2017-09-13
申请人: Apple Inc.
发明人: Adrian Skilling , Melvyn J. Hunt , Gunnar Evermann
CPC分类号: G10L15/22 , G06F3/167 , G06F16/433 , G10L15/02 , G10L15/10 , G10L15/187 , G10L15/265 , G10L2015/025 , G10L2015/088
摘要: Systems and processes for operating an intelligent automated assistant to provide media items based on phonetic matching techniques are provided. An example method includes receiving a speech input from a user and determining whether the speech input includes a user request for a media item. The method further includes, in accordance with a determination that the speech input includes a user request for obtaining a media item, determining a candidate media item from a plurality of media items. The method further includes determining, based on a difference between a phonetic representation of the candidate media item and a phonetic representation of the speech input, whether the candidate media item is to be provided to the user. The method further includes, in accordance with a determination that the candidate media item is to be provided to the user, providing the candidate media item to the user.
-
7.
公开(公告)号:US09620104B2
公开(公告)日:2017-04-11
申请号:US14298690
申请日:2014-06-06
申请人: Apple Inc.
发明人: Devang K. Naik , Thomas R. Gruber , Liam Weiner , Justin G. Binder , Charles Srisuwananukorn , Gunnar Evermann , Shaun Eric Williams , Hong Chen , Lia T. Napolitano
CPC分类号: G10L13/027 , G10L13/04 , G10L13/08 , G10L15/063 , G10L15/22 , G10L15/26 , G10L15/265 , G10L2015/0631 , G10L2015/0638
摘要: The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
-
公开(公告)号:US10769385B2
公开(公告)日:2020-09-08
申请号:US16204467
申请日:2018-11-29
申请人: Apple Inc.
发明人: Gunnar Evermann
摘要: A text string with a first and a second portion is provided. A domain of the text string is determined by applying a first word-matching process to the first portion of the text string. It is then determined whether the second portion of the text string matches a word of a set of words associated with the domain by applying a second word-matching process to the second portion of the text string. Upon determining that the second portion of the text string matches the word of the set of words, it is determined whether a user intent from the text string based at least in part on the domain and the word of the set of words.
-
公开(公告)号:US10176167B2
公开(公告)日:2019-01-08
申请号:US14298725
申请日:2014-06-06
申请人: Apple Inc.
发明人: Gunnar Evermann
摘要: A text string with a first and a second portion is provided. A domain of the text string is determined by applying a first word-matching process to the first portion of the text string. It is then determined whether the second portion of the text string matches a word of a set of words associated with the domain by applying a second word-matching process to the second portion of the text string. Upon determining that the second portion of the text string matches the word of the set of words, it is determined whether a user intent from the text string based at least in part on the domain and the word of the set of words.
-
公开(公告)号:US09818400B2
公开(公告)日:2017-11-14
申请号:US14839835
申请日:2015-08-28
申请人: Apple Inc.
CPC分类号: G10L15/063 , G06F17/30867 , G10L15/02 , G10L15/1815 , G10L15/183 , G10L15/197 , G10L25/33
摘要: Systems and processes are disclosed for discovering trending terms in automatic speech recognition. Candidate terms (e.g., words, phrases, etc.) not yet found in a speech recognizer vocabulary or having low language model probability can be identified based on trending usage in a variety of electronic data sources (e.g., social network feeds, news sources, search queries, etc.). When candidate terms are identified, archives of live or recent speech traffic can be searched to determine whether users are uttering the candidate terms in dictation or speech requests. Such searching can be done using open vocabulary spoken term detection to find phonetic matches in the audio archives. As the candidate terms are found in the speech traffic, notifications can be generated that identify the candidate terms, provide relevant usage statistics, identify the context in which the terms are used, and the like.
-
-
-
-
-
-
-
-
-