-
公开(公告)号:US08832549B2
公开(公告)日:2014-09-09
申请号:US12479850
申请日:2009-06-07
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
CPC classification number: G06F17/2294 , G06F17/21 , G06F17/211 , G06F17/212 , G06F17/218 , G06F17/2217 , G06F17/2247 , G06F17/243 , G06F17/248 , G06F17/2705 , G06F17/28 , G06F17/30011 , G06K9/00456 , G06K9/00463
Abstract: Some embodiments provide a for analyzing a document that includes a number of primitive elements. The method identifies boundaries between sets of primitive elements and identifies regions bounded by the boundaries. The method uses the identified regions to define structural elements for the document. The method defines a structured document based on the primitive elements and the structural elements.
Abstract translation: 一些实施例提供用于分析包括多个基元的文档。 该方法识别原始元素组之间的边界,并识别由边界界定的区域。 该方法使用识别的区域来定义文档的结构元素。 该方法基于原始元素和结构元素定义结构化文档。
-
2.
公开(公告)号:US08549399B2
公开(公告)日:2013-10-01
申请号:US13109918
申请日:2011-05-17
Applicant: Philip Andrew Mansfield , Michael Robert Levy , Derek B. Clegg
Inventor: Philip Andrew Mansfield , Michael Robert Levy , Derek B. Clegg
IPC: G06F17/27
CPC classification number: G06F17/2229 , G06F17/212 , G06F17/2241
Abstract: For a document with content that has been structured into a set primitive areas, a novel method for performing contiguous selection of document content across different primitive areas in the document is disclosed. The method defines a contiguous section in the ordered list by identifying the first and last primitive elements of the contiguous selection. The first primitive element is identified as the primitive element that is closest in reading flow to a start selection point on the page, while the last primitive element is identified as the primitive element that is closest in reading flow to an end selection point on the page.
Abstract translation: 对于具有已经被构造成集合原始区域的内容的文档,公开了一种用于在文档中的不同原始区域执行连续选择文档内容的新颖方法。 该方法通过识别连续选择的第一个和最后一个原始元素来定义有序列表中的连续部分。 第一个原始元素被识别为在页面上的开始选择点读取流中最接近的原始元素,而最后一个元素被识别为在页面中的最终选择点的读取流中最接近的元素元素 。
-
公开(公告)号:US20120185491A1
公开(公告)日:2012-07-19
申请号:US13106806
申请日:2011-05-12
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
IPC: G06F17/30
CPC classification number: G06F17/2241 , G06F17/2745 , G06F17/30
Abstract: Some embodiments provide a method for analyzing a document that includes several primitive elements. The method identifies that a set of primitive elements include an implicit list in the document based on location and appearance of the set of primitive elements. The method defines the identified implicit list as an explicit list. The method stores the explicit list as a structure associated with the document.
Abstract translation: 一些实施例提供了一种用于分析包含若干基元的文档的方法。 该方法基于原始元素集合的位置和外观来识别一组原始元素包括文档中的隐式列表。 该方法将识别的隐式列表定义为显式列表。 该方法将显式列表存储为与文档相关联的结构。
-
公开(公告)号:US20120182317A1
公开(公告)日:2012-07-19
申请号:US13106803
申请日:2011-05-12
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
IPC: G09G5/00
CPC classification number: G06T3/00 , G06T3/0006
Abstract: Some embodiments provide a method that defines a group of associated graphic objects for display on a display device. The method defines a set of operations to perform on the associated graphic objects in a particular order. The operations include one or more transforms applied to at least one of the graphic objects. For each particular transform applied to a set of the graphic objects, each graphic object in the set has a set of parameters indicating whether the graphic object is affected by each of a set of primitive transforms of the particular transform. The method stores the set of associated graphic objects and set of operations as a single graphic object.
Abstract translation: 一些实施例提供了定义用于在显示设备上显示的一组相关联的图形对象的方法。 该方法定义了以特定顺序对关联的图形对象执行的一组操作。 操作包括应用于至少一个图形对象的一个或多个变换。 对于应用于一组图形对象的每个特定变换,集合中的每个图形对象具有指示图形对象是否受特定变换的一组原始变换中的每一个影响的一组参数。 该方法将一组关联的图形对象和一组操作存储为单个图形对象。
-
公开(公告)号:US20100174978A1
公开(公告)日:2010-07-08
申请号:US12479847
申请日:2009-06-07
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
IPC: G06F17/00
CPC classification number: G06F17/2294 , G06F17/21 , G06F17/211 , G06F17/212 , G06F17/218 , G06F17/2217 , G06F17/2247 , G06F17/243 , G06F17/248 , G06F17/2705 , G06F17/28 , G06F17/30011 , G06K9/00456 , G06K9/00463
Abstract: Some embodiments provide a method for analyzing an unstructured document that includes a number of words. Each word is an associated set of glyphs and each glyph has location coordinates. The method identifies clusters of words based on the location coordinates. Based on the identified clusters, the method defines a set of boundary elements for the glyphs that identify a set of borders for the glyphs. The method defines a structured document for the unstructured document based on the glyphs and the defined boundary elements. To identify clusters of words, the method orders the location coordinates and identifies several partitions of the location coordinates. Each partition specifies a particular grouping of the coordinates into subsets. For each partition, the method identifies a particular set of subsets of location values that satisfy a particular set of constraints and determines a set of subsets of location values that optimizes a particular measure.
Abstract translation: 一些实施例提供了一种用于分析包括多个单词的非结构化文档的方法。 每个单词都是一组关联的字形,每个字形都具有位置坐标。 该方法基于位置坐标来识别词群。 基于所识别的集群,该方法定义了用于标识字形的一组边框的字形的一组边界元素。 该方法基于字形和定义的边界元素定义非结构化文档的结构化文档。 为了识别单词群集,该方法命令位置坐标并标识位置坐标的几个分区。 每个分区将坐标的特定分组指定为子集。 对于每个分区,该方法识别满足特定的约束集合的位置值子集的特定集合,并且确定优化特定度量的位置值子集的集合。
-
公开(公告)号:US20100174975A1
公开(公告)日:2010-07-08
申请号:US12479848
申请日:2009-06-07
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
CPC classification number: G06F17/2294 , G06F17/21 , G06F17/211 , G06F17/212 , G06F17/218 , G06F17/2217 , G06F17/2247 , G06F17/243 , G06F17/248 , G06F17/2705 , G06F17/28 , G06F17/30011 , G06K9/00456 , G06K9/00463
Abstract: Some embodiments provide a method for analyzing an unstructured document that includes a number of glyphs. The method identifies boundaries between sets of glyphs. The method identifies that several of the boundaries form a table. The method defines a tabular structural element based on the table. The tabular structural element includes several cells arranged in a plurality of rows and columns, each of which includes an associated set of glyphs.
Abstract translation: 一些实施例提供了一种用于分析包括多个字形的非结构化文档的方法。 该方法识别字形集之间的边界。 该方法识别出几个边界形成一个表。 该方法基于该表定义了一个表格结构元素。 表格结构元素包括布置在多个行和列中的几个单元格,每个列和列都包括一组关联的字形。
-
公开(公告)号:US20080238927A1
公开(公告)日:2008-10-02
申请号:US11728814
申请日:2007-03-26
Applicant: Philip Andrew Mansfield
Inventor: Philip Andrew Mansfield
IPC: G06T11/00
CPC classification number: G06T11/60
Abstract: Rendering glyphs is disclosed. A set of glyphs to be flowed along a nonlinear path are received. A first glyph included in the set is placed at a corresponding location along the nonlinear path such that the first glyph is spaced from a second glyph, at a point nearest the second glyph, by at least a prescribed distance.
Abstract translation: 公开了渲染字形。 接收沿非线性路径流动的一组字形。 包括在集合中的第一字形被放置在沿着非线性路径的对应位置处,使得第一字形在距离第二字形最接近的点处与第二字形间隔至少规定的距离。
-
公开(公告)号:US08892992B2
公开(公告)日:2014-11-18
申请号:US13555053
申请日:2012-07-20
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
CPC classification number: G06F17/2294 , G06F17/21 , G06F17/211 , G06F17/212 , G06F17/218 , G06F17/2217 , G06F17/2247 , G06F17/243 , G06F17/248 , G06F17/2705 , G06F17/28 , G06F17/30011 , G06K9/00456 , G06K9/00463
Abstract: Some embodiments provide a method for defining structure for an unstructured document that includes a number of primitive elements that are defined in terms of their position in the document. The method identifies a pairwise grouping of nearest primitive elements. The method sorts the pairwise primitive elements based on an order from the closest to the furthest pairs. The method stores a single value that identifies which of the pairwise primitive elements are sufficiently far apart to form a partition. The method uses the stored value to identify and analyze the partitions in order to define structural elements for the document.
Abstract translation: 一些实施例提供了一种用于定义非结构化文档的结构的方法,该结构化文档包括根据其在文档中的位置定义的多个基元元素。 该方法识别最近的原始元素的成对分组。 该方法根据最接近最远对的顺序对成对的原始元素进行排序。 该方法存储一个单一的值,它识别成对的原始元素中的哪一个足够远以形成分区。 该方法使用存储的值来标识和分析分区,以便定义文档的结构元素。
-
公开(公告)号:US08719701B2
公开(公告)日:2014-05-06
申请号:US12479847
申请日:2009-06-07
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
IPC: G06F17/00
CPC classification number: G06F17/2294 , G06F17/21 , G06F17/211 , G06F17/212 , G06F17/218 , G06F17/2217 , G06F17/2247 , G06F17/243 , G06F17/248 , G06F17/2705 , G06F17/28 , G06F17/30011 , G06K9/00456 , G06K9/00463
Abstract: Some embodiments provide a method for analyzing an unstructured document that includes a number of words. Each word is an associated set of glyphs and each glyph has location coordinates. The method identifies clusters of words based on the location coordinates. Based on the identified clusters, the method defines a set of boundary elements for the glyphs that identify a set of borders for the glyphs. The method defines a structured document for the unstructured document based on the glyphs and the defined boundary elements. To identify clusters of words, the method orders the location coordinates and identifies several partitions of the location coordinates. Each partition specifies a particular grouping of the coordinates into subsets. For each partition, the method identifies a particular set of subsets of location values that satisfy a particular set of constraints and determines a set of subsets of location values that optimizes a particular measure.
Abstract translation: 一些实施例提供了一种用于分析包括多个单词的非结构化文档的方法。 每个单词都是一组关联的字形,每个字形都具有位置坐标。 该方法基于位置坐标来识别词群。 基于所识别的集群,该方法定义了用于标识字形的一组边框的字形的一组边界元素。 该方法基于字形和定义的边界元素定义非结构化文档的结构化文档。 为了识别单词群集,该方法命令位置坐标并标识位置坐标的几个分区。 每个分区将坐标的特定分组指定为子集。 对于每个分区,该方法识别满足特定的约束集合的位置值子集的特定集合,并且确定优化特定度量的位置值子集的集合。
-
公开(公告)号:US08380753B2
公开(公告)日:2013-02-19
申请号:US13106806
申请日:2011-05-12
Applicant: Philip Andrew Mansfield , Michael Robert Levy
Inventor: Philip Andrew Mansfield , Michael Robert Levy
IPC: G06F17/00
CPC classification number: G06F17/2241 , G06F17/2745 , G06F17/30
Abstract: Some embodiments provide a method for analyzing a document that includes several primitive elements. The method identifies that a set of primitive elements include an implicit list in the document based on location and appearance of the set of primitive elements. The method defines the identified implicit list as an explicit list. The method stores the explicit list as a structure associated with the document.
Abstract translation: 一些实施例提供了一种用于分析包含若干基元的文档的方法。 该方法基于原始元素集合的位置和外观来识别一组原始元素包括文档中的隐式列表。 该方法将识别的隐式列表定义为显式列表。 该方法将显式列表存储为与文档相关联的结构。
-
-
-
-
-
-
-
-
-