-
公开(公告)号:US09489371B2
公开(公告)日:2016-11-08
申请号:US13941195
申请日:2013-07-12
Applicant: Apple Inc.
Inventor: Olivier Bonnet , Frederick de Jaeger , Romain Goyet , Jean-Pierre Ciudad
CPC classification number: G06F17/2705 , G06F17/2715 , G06K9/723 , G06K9/726 , G06K2209/01
Abstract: A method for detecting data in a sequence of characters or text using both a statistical engine and a pattern engine. The statistical engine is trained to recognize certain types of data and the pattern engine is programmed to recognize the grammatical pattern of certain types of data. The statistical engine may scan the sequence of characters to output first data, and the pattern engine may break down the first data into subsets of data. Alternatively, the statistical engine may output items that have a predetermined probability or greater of being a certain type of data and the pattern engine may then detect the data from the output items and/or remove incorrect information from the output items.
Abstract translation: 使用统计引擎和模式引擎来检测字符或文本序列中的数据的方法。 训练统计引擎来识别某些类型的数据,并且模式引擎被编程以识别某些类型的数据的语法模式。 统计引擎可以扫描字符序列以输出第一数据,并且模式引擎可以将第一数据分解成数据子集。 或者,统计引擎可以输出具有作为特定类型的数据的预定概率或更大的项目,并且模式引擎然后可以从输出项检测数据和/或从输出项中移除不正确的信息。
-
公开(公告)号:US20140025370A1
公开(公告)日:2014-01-23
申请号:US13941195
申请日:2013-07-12
Applicant: Apple Inc.
Inventor: Olivier Bonnet , Frederick de Jaeger , Romain Goyet , Jean-Pierre Ciudad
IPC: G06F17/27
CPC classification number: G06F17/2705 , G06F17/2715 , G06K9/723 , G06K9/726 , G06K2209/01
Abstract: A method for detecting data in a sequence of characters or text using both a statistical engine and a pattern engine. The statistical engine is trained to recognize certain types of data and the pattern engine is programmed to recognize the grammatical pattern of certain types of data. The statistical engine may scan the sequence of characters to output first data, and the pattern engine may break down the first data into subsets of data. Alternatively, the statistical engine may output items that have a predetermined probability or greater of being a certain type of data and the pattern engine may then detect the data from the output items and/or remove incorrect information from the output items.
Abstract translation: 使用统计引擎和模式引擎来检测字符或文本序列中的数据的方法。 训练统计引擎来识别某些类型的数据,并且模式引擎被编程以识别某些类型的数据的语法模式。 统计引擎可以扫描字符序列以输出第一数据,并且模式引擎可以将第一数据分解成数据子集。 或者,统计引擎可以输出具有作为特定类型的数据的预定概率或更大的项目,并且模式引擎然后可以从输出项检测数据和/或从输出项中移除不正确的信息。
-
公开(公告)号:US09275169B2
公开(公告)日:2016-03-01
申请号:US13670052
申请日:2012-11-06
Applicant: Apple Inc.
Inventor: Olivier Bonnet , Frederic de Jaeger , Romain Goyet
CPC classification number: G06F17/30961 , G06F17/27 , G06F17/2705 , G06F17/271
Abstract: An apparatus for processing a sequence of tokens to detect predetermined data, wherein each said token has a token type, and the predetermined data has a structure that comprises a predetermined sequence of token types, including at least one optional token type. The apparatus comprises a processor arranged to: provide a tree for detecting the predetermined data, the tree comprising a plurality of states, each said state being linked with at least one other state by a respective condition, the arrangement of linked states forming a plurality of paths; and compare the token types of the sequence of tokens to respective conditions in the tree to match the sequence of tokens to one or more paths in the tree, wherein the predetermined data can be detected without using an epsilon reduction to take account of said at least one optional token type.
Abstract translation: 一种用于处理令牌序列以检测预定数据的装置,其中每个所述令牌具有令牌类型,并且所述预定数据具有包括至少一个可选令牌类型的令牌类型的预定序列的结构。 该装置包括:处理器,被配置为:提供用于检测预定数据的树,该树包括多个状态,每个所述状态通过相应条件与至少一个其他状态链接,形成多个状态的链接状态的布置 路径 并将所述令牌序列的令牌类型与所述树中的相应条件进行比较,以将所述令牌序列与所述树中的一个或多个路径匹配,其中,可以在不使用ε减少的情况下检测所述预定数据以考虑所述至少 一种可选的令牌类型。
-
公开(公告)号:US20140372103A1
公开(公告)日:2014-12-18
申请号:US14286838
申请日:2014-05-23
Applicant: Apple Inc.
Inventor: Olivier Bonnet , Frederic de Jaeger , Romain Goyet
CPC classification number: G06F17/2705 , G06F8/427 , G06F17/271 , G06F17/277 , G06F17/2775 , G06F17/28 , G06F17/30595
Abstract: A method of processing a sequence of characters, the method comprising converting the sequence of characters into a sequence of tokens so that each token comprises a lexeme and one of a plurality of token types. Each of the plurality of token types relates to at least one of a plurality of predetermined functions, wherein at least one said token type relates to multiple functions of the plurality of predetermined functions.
Abstract translation: 一种处理字符序列的方法,所述方法包括将字符序列转换成令牌序列,使得每个令牌包括词法和多个令牌类型之一。 多个令牌类型中的每一个与多个预定功能中的至少一个相关,其中至少一个所述令牌类型涉及多个预定功能的多个功能。
-
公开(公告)号:US09454522B2
公开(公告)日:2016-09-27
申请号:US14286838
申请日:2014-05-23
Applicant: Apple Inc.
Inventor: Olivier Bonnet , Frederic de Jaeger , Romain Goyet
CPC classification number: G06F17/2705 , G06F8/427 , G06F17/271 , G06F17/277 , G06F17/2775 , G06F17/28 , G06F17/30595
Abstract: A method of processing a sequence of characters, the method comprising converting the sequence of characters into a sequence of tokens so that each token comprises a lexeme and one of a plurality of token types. Each of the plurality of token types relates to at least one of a plurality of predetermined functions, wherein at least one said token type relates to multiple functions of the plurality of predetermined functions.
Abstract translation: 一种处理字符序列的方法,所述方法包括将字符序列转换成令牌序列,使得每个令牌包括词法和多个令牌类型之一。 多个令牌类型中的每一个与多个预定功能中的至少一个相关,其中至少一个所述令牌类型涉及多个预定功能的多个功能。
-
公开(公告)号:US20130091174A1
公开(公告)日:2013-04-11
申请号:US13670052
申请日:2012-11-06
Applicant: Apple Inc.
Inventor: Olivier Bonnet , Frederic de Jaeger , Romain Goyet
IPC: G06F17/30
CPC classification number: G06F17/30961 , G06F17/27 , G06F17/2705 , G06F17/271
Abstract: An apparatus for processing a sequence of tokens to detect predetermined data, wherein each said token has a token type, and the predetermined data has a structure that comprises a predetermined sequence of token types, including at least one optional token type. The apparatus comprises a processor arranged to: provide a tree for detecting the predetermined data, the tree comprising a plurality of states, each said state being linked with at least one other state by a respective condition, the arrangement of linked states forming a plurality of paths; and compare the token types of the sequence of tokens to respective conditions in the tree to match the sequence of tokens to one or more paths in the tree, wherein the predetermined data can be detected without using an epsilon reduction to take account of said at least one optional token type.
-
-
-
-
-