Patent search ap:"Grigorios Magklis" Page 1

1.

发明申请
INSTRUCTION AND LOGIC FOR OPTIMIZATION LEVEL AWARE BRANCH PREDICTION 审中-公开
Title translation: 指导和逻辑优化水平分支预测

公开(公告)号：US20140095849A1

公开(公告)日：2014-04-03

申请号：US13631402

申请日：2012-09-28

Applicant: Polychronis Xekalakis , Pedro Marcuello , Alejandro Vicente Martinez , Christos E. Kotselidis , Grigorios Magklis , Fernando Latorre , Raul Martinez , Josep M. Codina , Enric Gibert Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Pedro Lopez , Marc Lupon , Carlos Madriles , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis

Inventor： Polychronis Xekalakis , Pedro Marcuello , Alejandro Vicente Martinez , Christos E. Kotselidis , Grigorios Magklis , Fernando Latorre , Raul Martinez , Josep M. Codina , Enric Gibert Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Pedro Lopez , Marc Lupon , Carlos Madriles , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis

IPC: G06F9/38

CPC classification number: G06F9/38 , G06F9/30036 , G06F9/3848 , G06F9/3877 , G06F9/3887 , G06F9/3889

Abstract: A computer-readable storage medium, method and system for optimization-level aware branch prediction is described. A gear level is assigned to a set of application instructions that have been optimized. The gear level is also stored in a register of a branch prediction unit of a processor. Branch prediction is then performed by the processor based upon the gear level.

Abstract translation: 描述了用于优化级感知分支预测的计算机可读存储介质，方法和系统。齿轮级别被分配给已优化的一组应用指令。齿轮级也存储在处理器的分支预测单元的寄存器中。然后由处理器基于齿轮级执行分支预测。

2.

发明申请
PROFILING ASYNCHRONOUS EVENTS RESULTING FROM THE EXECUTION OF SOFTWARE AT CODE REGION GRANULARITY 有权
Title translation: 根据代码区域格式的软件执行情况分析异常事件

公开(公告)号：US20130332705A1

公开(公告)日：2013-12-12

申请号：US13993054

申请日：2011-12-29

Applicant: Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

Inventor： Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

IPC: G06F9/30

CPC classification number: G06F11/3037 , G06F11/3409 , G06F11/3466 , G06F12/0862 , G06F2201/86 , G06F2201/865 , G06F2201/88 , G06F2201/885

Abstract: A combination of hardware and software collect profile data for asynchronous events, at code region granularity. An exemplary embodiment is directed to collecting metrics for prefetching events, which are asynchronous in nature. Instructions that belong to a code region are identified using one of several alternative techniques, causing a profile bit to be set for the instruction, as a marker. Each line of a data block that is prefetched is similarly marked. Events corresponding to the profile data being collected and resulting from instructions within the code region are then identified. Each time that one of the different types of events is identified, a corresponding counter is incremented. Following execution of the instructions within the code region, the profile data accumulated in the counters are collected, and the counters are reset for use with a new code region.

Abstract translation: 硬件和软件的组合以代码区域粒度收集用于异步事件的简档数据。示例性实施例涉及收集在本质上是异步的预取事件的度量。使用几种替代技术之一来标识属于代码区域的指令，使得为指令设置配置文件位作为标记。预取数据块的每一行都被类似地标记。然后识别与正在收集并且由代码区域内的指令产生的简档数据相对应的事件。每次识别出不同类型的事件之一时，相应的计数器将被递增。在代码区域中执行指令之后，收集在计数器中累积的简档数据，并且重置计数器以与新的代码区域一起使用。

3.

发明授权
Compressing address communications between processors 有权
Title translation: 压缩处理器之间的地址通信

公开(公告)号：US07698512B2

公开(公告)日：2010-04-13

申请号：US11827904

申请日：2007-07-13

Applicant: Grigorios Magklis , Jose Gonzalez , Pedro Chaparro , Qiong Cai , Antonio Gonzalez

Inventor： Grigorios Magklis , Jose Gonzalez , Pedro Chaparro , Qiong Cai , Antonio Gonzalez

IPC: G06F12/00

CPC classification number: G06F12/084 , G06F12/0833 , Y02D10/13

Abstract: In one embodiment, the present invention includes a method for determining if data of a memory request by a first agent is in a memory region represented by a region indicator of a region table of the first agent, and transmitting a compressed address for the memory request to other agents of a system if the memory region is represented by the region indicator, otherwise transmitting a full address. Other embodiments are described and claimed.

Abstract translation: 在一个实施例中，本发明包括一种方法，用于确定由第一代理的存储器请求的数据是否在由第一代理的区域表的区域指示符表示的存储器区域中，并且发送用于存储器请求的压缩地址如果存储器区域由区域指示符表示，则传送到系统的其它代理，否则发送完整地址。描述和要求保护其他实施例。

4.

发明授权
Meeting point thread characterization 有权
Title translation: 汇点线程表征

公开(公告)号：US07665000B2

公开(公告)日：2010-02-16

申请号：US11714938

申请日：2007-03-07

Applicant: Antonio Gonzalez , Qiong Cai , Jose Gonzalez , Pedro Chaparro , Grigorios Magklis , Ryan Rakvic

Inventor： Antonio Gonzalez , Qiong Cai , Jose Gonzalez , Pedro Chaparro , Grigorios Magklis , Ryan Rakvic

IPC: G01R31/28

CPC classification number: G06F9/52 , G06F11/3404 , G06F11/3495

Abstract: An apparatus associated with identifying a critical thread based on information gathered during meeting point processing is provided. One embodiment of the apparatus may include logic to selectively update meeting point counts for threads upon determining that they have arrived at a meeting point. The embodiment may also include logic to periodically identify which thread in a set of threads is a critical thread. The critical thread may be the slowest thread and criticality may be determined by examining meeting point counts. The embodiment may also include logic to selectively manipulate a configurable attribute of the critical thread and/or core upon which the critical thread will run.

Abstract translation: 提供了一种基于在会议点处理期间收集的信息来识别关键线程的装置。设备的一个实施例可以包括用于在确定它们已经到达会议点时选择性地更新线程的会议点计数的逻辑。该实施例还可以包括用于周期性地识别一组线程中的哪个线程是关键线程的逻辑。关键线程可能是最慢的线程，可以通过检查会议点数来确定关键性。该实施例还可以包括选择性地操纵关键线程和/或核心的可配置属性的逻辑，关键线程将在其上运行。

5.

发明申请
Leakage Power Estimation 有权
Title translation: 泄漏功率估计

公开(公告)号：US20080244278A1

公开(公告)日：2008-10-02

申请号：US10594827

申请日：2006-06-30

Applicant: Pedro Chaparro Monferrer , Grigorios Magklis , Jose Gonzalez , Antonio Gonzalez

Inventor： Pedro Chaparro Monferrer , Grigorios Magklis , Jose Gonzalez , Antonio Gonzalez

IPC: G06F1/26 , G01K13/00

CPC classification number: G01R31/025

Abstract: Methods and apparatus to provide leakage power estimation are described. In one embodiment, one or more sensed temperature values (108) and one or more voltage values (110) are utilized to determine the leakage power of an integrated circuit (IC) component. Other embodiments are also described.

Abstract translation: 描述提供泄漏功率估计的方法和装置。在一个实施例中，利用一个或多个感测温度值（108）和一个或多个电压值（110）来确定集成电路（IC）组件的泄漏功率。还描述了其它实施例。

6.

发明授权
Propagating a prefetching profile bit from a prefetch queue to a data cache to indicate that a line was prefetched in response to an instruction within a code region 有权

公开(公告)号：US10013326B2

公开(公告)日：2018-07-03

申请号：US13993054

申请日：2011-12-29

Applicant: Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

Inventor： Raul Martinez , Enric Gibert Codina , Pedro Lopez , Marti Torrents Lapuerta , Polychronis Xekalakis , Georgios Tournavitis , Kyriakos A. Stavrou , Demos Pavlou , Daniel Ortega , Alejandro Martinez Vicente , Pedro Marcuello , Grigorios Magklis , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Mirem Hyuseinova , Christos Kotselidis , Fernando Latorre , Marc Lupon , Carlos Madriles

IPC: G06F11/30 , G06F11/34 , G06F12/0862

CPC classification number: G06F11/3037 , G06F11/3409 , G06F11/3466 , G06F12/0862 , G06F2201/86 , G06F2201/865 , G06F2201/88 , G06F2201/885

Abstract: A combination of hardware and software collect profile data for asynchronous events, at code region granularity. An exemplary embodiment is directed to collecting metrics for prefetching events, which are asynchronous in nature. Instructions that belong to a code region are identified using one of several alternative techniques, causing a profile bit to be set for the instruction, as a marker. Each line of a data block that is prefetched is similarly marked. Events corresponding to the profile data being collected and resulting from instructions within the code region are then identified. Each time that one of the different types of events is identified, a corresponding counter is incremented. Following execution of the instructions within the code region, the profile data accumulated in the counters are collected, and the counters are reset for use with a new code region.

7.

发明授权
Double rounded combined floating-point multiply and add 有权
Title translation: 双圆形组合浮点乘法和加法

公开(公告)号：US09213523B2

公开(公告)日：2015-12-15

申请号：US13539198

申请日：2012-06-29

Applicant: Sridhar Samudrala , Grigorios Magklis , Marc Lupon , David R. Ditzel

Inventor： Sridhar Samudrala , Grigorios Magklis , Marc Lupon , David R. Ditzel

IPC: G06F7/38 , G06F7/483 , G06F7/544

CPC classification number: G06F7/4876 , G06F7/483 , G06F7/485 , G06F7/4991 , G06F7/49915 , G06F7/5443 , G06F2207/4802

Abstract: Methods, apparatus, instructions and logic are disclosed providing double rounded combined floating-point multiply and add functionality as scalar or vector SIMD instructions or as fused micro-operations. Embodiments include detecting floating-point (FP) multiplication operations and subsequent FP operations specifying as source operands results of the FP multiplications. The FP multiplications and the subsequent FP operations are encoded as combined FP operations including rounding of the results of FP multiplication followed by the subsequent FP operations. The encoding of said combined FP operations may be stored and executed as part of an executable thread portion using fused-multiply-add hardware that includes overflow detection for the product of FP multipliers, first and second FP adders to add third operand addend mantissas and the products of the FP multipliers with different rounding inputs based on overflow, or no overflow, in the products of the FP multiplier. Final results are selected respectively using overflow detection.

Abstract translation: 公开了提供双向组合浮点乘法和附加功能作为标量或向量SIMD指令或作为融合微操作的方法，装置，指令和逻辑。实施例包括检测浮点（FP）乘法运算和指定作为FP乘法的源操作数结果的后续FP操作。 FP乘法和随后的FP操作被编码为组合FP操作，包括对FP乘法的结果进行舍入，随后是随后的FP操作。所述组合FP操作的编码可以作为可执行线程部分的一部分使用融合乘法硬件来存储和执行，所述融合乘法加法器包括用于FP乘法器的乘积的溢出检测，第一和第二FP加法器来添加第三操作数加法尾数，基于FP乘法器产品中溢出或不溢出的FP乘法器的不同舍入输入的产品。分别使用溢出检测选择最终结果。

8.

发明授权
Thread migration to improve power efficiency in a parallel processing environment 有权
Title translation: 线程迁移以提高并行处理环境中的功率效率

公开(公告)号：US08806491B2

公开(公告)日：2014-08-12

申请号：US13453904

申请日：2012-04-23

Applicant: Qiong Cai , José González , Pedro Chappero Monferrer , Grigorios Magklis , Antonio González

Inventor： Qiong Cai , José González , Pedro Chappero Monferrer , Grigorios Magklis , Antonio González

IPC: G06F9/46 , G06F1/00

CPC classification number: G06F9/5094 , G06F1/329 , Y02D10/22 , Y02D10/24

Abstract: A method and system to selectively move one or more of a plurality threads which are executing in parallel by a plurality of processing cores. In one embodiment, a thread may be moved from executing in one of the plurality of processing cores to executing in another of the plurality of processing cores, the moving based on a performance characteristic associated with the plurality of threads. In another embodiment of the invention, a power state of the plurality of processing cores may be changed to improve a power efficiency associated with the executing of the multiple threads.

Abstract translation: 一种选择性地移动由多个处理核并行执行的多个线程中的一个或多个的方法和系统。在一个实施例中，可以将线程从执行多个处理核心中的一个移动到在多个处理核心中的另一个处理核心中执行，基于与多个线程相关联的性能特性来移动线程。在本发明的另一实施例中，可以改变多个处理核心的功率状态，以提高与执行多个线程相关联的功率效率。

9.

发明申请
SUPPORT FOR SPECULATIVE OWNERSHIP WITHOUT DATA 审中-公开
Title translation: 支持无数据的统计所有权

公开(公告)号：US20130268735A1

公开(公告)日：2013-10-10

申请号：US13994686

申请日：2011-12-29

Applicant: Enric Gibert Codina , Fernando Latorre , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Meyrem Hyuseinova , Christos E. Kotselidis , Pedro Lopez , Marc Lupon , Carlos Madriles , Grigorios Magklis , Pedro Marcuello , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis , Polychronis Xekalakis

Inventor： Enric Gibert Codina , Fernando Latorre , Josep M. Codina , Crispin Gomez Requena , Antonio Gonzalez , Meyrem Hyuseinova , Christos E. Kotselidis , Pedro Lopez , Marc Lupon , Carlos Madriles , Grigorios Magklis , Pedro Marcuello , Alejandro Martinez Vicente , Raul Martinez , Daniel Ortega , Demos Pavlou , Kyriakos A. Stavrou , Georgios Tournavitis , Polychronis Xekalakis

IPC: G06F12/08

CPC classification number: G06F12/0815 , G06F2212/507

Abstract: Techniques are described for providing an enhanced cache coherency protocol for a multi-core processor that includes a Speculative Request For Ownership Without Data (SRFOWD) for a portion of cache memory. With a SRFOWD, only an acknowledgement message may be provided as an answer to a requesting core. The contents of the affected cache line are not required to be a part of the answer. The enhanced cache coherency protocol may assure that a valid copy of the current cache line exists in case of misspeculation by the requesting core. Thus, an owner of the current copy of the cache line may maintain a copy of the old contents of the cache line. The old contents of the cache line may be discarded if speculation by the requesting core turns out to be correct. Otherwise, in case of misspeculation by the requesting core, the old contents of the cache line may be set back to a valid state.

Abstract translation: 描述了用于为多核处理器提供增强的高速缓存一致性协议的技术，所述多核处理器包括用于高速缓冲存储器的一部分的无数据的投机请求（SRFOWD）。使用SRFOWD，可以仅提供确认消息作为请求内核的答案。受影响的缓存行的内容不需要作为答案的一部分。增强的高速缓存一致性协议可以确保在请求核心的错误配置的情况下存在当前高速缓存行的有效副本。因此，高速缓存行的当前副本的所有者可以维护高速缓存行的旧内容的副本。如果请求的核心的推测证明是正确的，则可以丢弃该高速缓存行的旧内容。否则，在请求核心发生错误的情况下，可以将高速缓存行的旧内容设置回有效状态。

10.

发明申请
Frequency And Voltage Scaling Architecture 有权
Title translation: 频率和电压调节架构

公开(公告)号：US20130173948A1

公开(公告)日：2013-07-04

申请号：US13780023

申请日：2013-02-28

Applicant: Grigorios Magklis , Jose Gonzalez , Antonio Gonzalez

Inventor： Grigorios Magklis , Jose Gonzalez , Antonio Gonzalez

IPC: G06F1/06

CPC classification number: G06F1/06 , G06F1/3203 , G06F1/324 , G06F1/3296 , G06F9/3869 , Y02D10/126 , Y02D10/172

Abstract: A method and apparatus for scaling frequency and operating voltage of at least one clock domain of a microprocessor. More particularly, embodiments of the invention relate to techniques to divide a microprocessor into clock domains and control the frequency and operating voltage of each clock domain independently of the others.

Abstract translation: 一种用于缩放微处理器的至少一个时钟域的频率和工作电压的方法和装置。更具体地，本发明的实施例涉及将微处理器划分为时钟域并且独立于其他时钟域来控制每个时钟域的频率和工作电压的技术。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification