SYSTEMS AND METHODS FOR SCALABLE HIERARCHICAL POLYHEDRAL COMPILATION

    公开(公告)号:US20170097815A1

    公开(公告)日:2017-04-06

    申请号:US15285810

    申请日:2016-10-05

    IPC分类号: G06F9/45

    CPC分类号: G06F8/4436 G06F8/41 G06F8/452

    摘要: A system for compiling programs for execution thereof using a hierarchical processing system having two or more levels of memory hierarchy can perform memory-level-specific optimizations, without exceeding a specified maximum compilation time. To this end, the compiler system employs a polyhedral model and limits the dimensions of a polyhedral program representation that is processed by the compiler at each level using a focalization operator that temporarily reduces one or more dimensions of the polyhedral representation. Semantic correctness is provided via a defocalization operator that can restore all polyhedral dimensions that had been temporarily removed.

    Methods and apparatus for joint scheduling and layout optimization to enable multi-level vectorization
    34.
    发明授权
    Methods and apparatus for joint scheduling and layout optimization to enable multi-level vectorization 有权
    联合调度和布局优化的方法和装置,以实现多级向量化

    公开(公告)号:US09489180B1

    公开(公告)日:2016-11-08

    申请号:US13679861

    申请日:2012-11-16

    IPC分类号: G06F9/44 G06F9/45

    CPC分类号: G06F8/443 G06F8/447

    摘要: Methods, apparatus and computer software product for source code optimization are provided. In an exemplary embodiment, a first custom computing apparatus is used to optimize the execution of source code on a second computing apparatus. In this embodiment, the first custom computing apparatus contains a memory, a storage medium and at least one processor with at least one multi-stage execution unit. The second computing apparatus contains at least one vector execution unit that allow for parallel execution of tasks on constant-strided memory locations. The first custom computing apparatus optimizes the code for parallelism, locality of operations, constant-strided memory accesses and vectorized execution on the second computing apparatus. This Abstract is provided for the sole purpose of complying with the Abstract requirement rules. This Abstract is submitted with the explicit understanding that it will not be used to interpret or to limit the scope or the meaning of the claims.

    摘要翻译: 提供了用于源代码优化的方法,设备和计算机软件产品。 在示例性实施例中,使用第一定制计算装置来优化第二计算装置上的源代码的执行。 在该实施例中,第一定制计算装置包含存储器,存储介质和具有至少一个多级执行单元的至少一个处理器。 第二计算装置包含至少一个向量执行单元,其允许并行执行恒定跨度存储器位置上的任务。 第一定制计算装置优化用于并行性的代码,操作的局部性,在第二计算装置上的恒定帧存储器访问和向量化执行。 本摘要仅用于遵守抽象要求规则。 本摘要以明确的理解提交,不会用于解释或限制权利要求的范围或含义。

    SYSTEMS AND METHODS FOR MULTIRESOLUTION PARSING
    35.
    发明申请
    SYSTEMS AND METHODS FOR MULTIRESOLUTION PARSING 审中-公开
    用于多分辨率分析的系统和方法

    公开(公告)号:US20160171077A1

    公开(公告)日:2016-06-16

    申请号:US14883294

    申请日:2015-10-14

    IPC分类号: G06F17/30

    CPC分类号: H04L63/1408 G06F17/30985

    摘要: A multiresolution parser (MRP) can selectively extract one or more information units from a dataset based on the available processing capacity and/or the arrival rate of the dataset. Should any of these parameters change, the MRP can adaptively change the information units to be extracted such that the benefit or value of the extracted information is maximized while minimizing the cost of extraction. This tradeoff is facilitated, at least in part, by an analysis of the spectral energy of the datasets expected to be processed by the MRP. The MRP can also determine its state after a processing iteration and use that state information in subsequent iterations to minimize the required computations in such subsequent iterations, so as to improve processing efficiency.

    摘要翻译: 多分辨率解析器(MRP)可以基于数据集的可用处理能力和/或到达速率从数据集中选择性地提取一个或多个信息单元。 如果这些参数中的任何一个变化,则MRP可以自适应地改变要提取的信息单元,使得提取的信息的利益或价值最大化,同时最小化提取的成本。 至少部分地通过对期望由MRP处理的数据集的光谱能量的分析来促进这种折衷。 MRP还可以在处理迭代之后确定其状态,并在随后的迭代中使用该状态信息以最小化在这样的后续迭代中所需的计算,以便提高处理效率。

    SYSTEM AND METHOD FOR GENERATION OF EVENT DRIVEN, TUPLE-SPACE BASED PROGRAMS
    36.
    发明申请
    SYSTEM AND METHOD FOR GENERATION OF EVENT DRIVEN, TUPLE-SPACE BASED PROGRAMS 审中-公开
    用于生成事件驱动,基于空间的程序的系统和方法

    公开(公告)号:US20150089485A1

    公开(公告)日:2015-03-26

    申请号:US14492899

    申请日:2014-09-22

    IPC分类号: G06F9/45

    摘要: In a system for automatic generation of event-driven, tuple-space based programs from a sequential specification, a hierarchical mapping solution can target different runtimes relying on event-driven tasks (EDTs). The solution uses loop types to encode short, transitive relations among EDTs that can be evaluated efficiently at runtime. Specifically, permutable loops translate immediately into conservative point-to-point synchronizations of distance one. A runtime-agnostic which can be used to target the transformed code to different runtimes.

    摘要翻译: 在一个从顺序规范自动生成基于元组空间的程序的系统中,分层映射解决方案可以针对不依赖于事件驱动任务(EDT)的运行时间。 该解决方案使用循环类型来编码EDT之间的短期,传递关系,可以在运行时有效地评估。 具体来说,可置换循环立即转换为距离1的保守点对点同步。 与运行时无关的,可用于将转换后的代码定位到不同的运行时。

    METHODS AND APPARATUS FOR DATA TRANSFER OPTIMIZATION
    37.
    发明申请
    METHODS AND APPARATUS FOR DATA TRANSFER OPTIMIZATION 有权
    数据传输优化的方法和设备

    公开(公告)号:US20140165047A1

    公开(公告)日:2014-06-12

    申请号:US14181201

    申请日:2014-02-14

    IPC分类号: G06F9/45

    CPC分类号: G06F8/443 G06F12/023

    摘要: Methods, apparatus and computer software product for optimization of data transfer between two memories includes determining access to master data stored in one memory and/or to local data stored in another memory such that either or both of the size of total data transferred and the number of data transfers required to transfer the total data can be minimized. The master and/or local accesses are based on, at least in part, respective structures of the master and local data.

    摘要翻译: 用于优化两个存储器之间的数据传输的方法,装置和计算机软件产品包括确定对存储在一个存储器中的主数据的访问和/或存储在另一存储器中的本地数据,使得传送的总数据的大小和数量 可以最小化传输总数据所需的数据传输。 主和/或本地访问至少部分地基于主数据和本地数据的相应结构。