专利检索 ap:"Markus Kaltenbach" 第 1 页

1.

发明授权
Reversing processing order in half-pumped SIMD execution units to achieve K cycle issue-to-issue latency 有权

公开(公告)号：US08949575B2

公开(公告)日：2015-02-03

申请号：US13326249

申请日：2011-12-14

申请人： Maarten J. Boersma , Markus Kaltenbach , Christophe J. Layer , Jens Leenstra , Silvia M. Mueller

发明人： Maarten J. Boersma , Markus Kaltenbach , Christophe J. Layer , Jens Leenstra , Silvia M. Mueller

IPC分类号： G06F9/38 , G06F9/30 , G06F15/80

CPC分类号： G06F9/30036 , G06F9/345 , G06F9/38 , G06F9/3826 , G06F9/3867 , G06F9/3869 , G06F15/76 , G06F15/8053

摘要： Techniques for reducing issue-to-issue latency by reversing processing order in half-pumped single instruction multiple data (SIMD) execution units are described. In one embodiment a processor functional unit is provided comprising a frontend unit, and execution core unit, a backend unit, an execution order control signal unit, a first interconnect coupled between and output and an input of the execution core unit and a second interconnect coupled between an output of the backend unit and an input of the frontend unit. In operation, the execution order control signal unit generates a forwarding order control signal based on the parity of an applied clock signal on reception of a first vector instruction. This control signal is in turn used to selectively forward first and second portions of an execution result of the first vector instruction via the interconnects for use in the execution of a dependent second vector instruction.

2.

发明授权
Reuse of rounder for fixed conversion of log instructions 有权
标题翻译：重复使用圆形固定转换日志指令

公开(公告)号：US08626807B2

公开(公告)日：2014-01-07

申请号：US12350680

申请日：2009-01-08

申请人： Maarten Boersma , Markus Kaltenbach , Michael Klein , Silvia Melitta Mueller , Jochen Preiss

发明人： Maarten Boersma , Markus Kaltenbach , Michael Klein , Silvia Melitta Mueller , Jochen Preiss

IPC分类号： G06F7/00

CPC分类号： H03M7/24

摘要： A method for converting a signed fixed point number into a floating point number that includes reading an input number corresponding to a signed fixed point number to be converted, determining whether the input number is less than zero, setting a sign bit based upon whether the input number is less than zero or greater than or equal to zero, computing a first intermediate result by exclusive-ORing the input number with the sign bit, computing leading zeros of the first intermediate result, padding the first intermediate result based upon the sign bit, computing a second intermediate result by shifting the padded first intermediate result to the left by the leading zeros, computing an exponent portion and a fraction portion, conditionally incrementing the fraction portion based on the sign bit, correcting the exponent portion and the fraction portion if the incremented fraction portion overflows, and returning the floating point number.

摘要翻译： 一种用于将有符号固定点数转换为浮点数的方法，该浮点数包括读取与要转换的有符号固定点数相对应的输入数，确定输入数是否小于零，根据输入数量小于零或大于或等于零，通过将输入数字与符号位进行异或运算来计算第一中间结果，计算第一中间结果的前导零，基于符号位填充第一中间结果，通过将填充的第一中间结果向左移动前导零来计算第二中间结果，计算指数部分和分数部分，基于符号位有条件地增加分数部分，校正指数部分和分数部分，如果递增分数部分溢出，返回浮点数。

3.

发明授权
Instruction set architecture with instruction characteristic bit indicating a result is not of architectural importance 失效
标题翻译：具有指示结果的指令特征位的指令集架构不具有架构重要性

公开(公告)号：US08266411B2

公开(公告)日：2012-09-11

申请号：US12366169

申请日：2009-02-05

申请人： Tobias Gemmeke , Markus Kaltenbach , Nicolas Maeding

发明人： Tobias Gemmeke , Markus Kaltenbach , Nicolas Maeding

IPC分类号： G06F9/30

CPC分类号： G06F9/30076 , G06F9/30185 , G06F9/30192 , G06F9/3832 , G06F9/3838 , G06F9/384 , G06F9/3861

摘要： Instead of having a processor with an instruction set architecture (ISA) that includes fixed architected operands, an improved processor supports additional characteristic bits for computing instructions (e.g., a multiply-add, load/store instructions). Such additional bits for the certain instructions influence the processing of these instructions by the processor. Also, a new instruction is introduced for further usage of the proposed method. Typically these additional characteristic bits as well as the instruction can be automatically generated by compilers to provide relatively well-suited instruction sequences for the processor.

摘要翻译： 改进的处理器代替具有包括固定架构操作数的指令集体系结构（ISA）的处理器，而不是用于计算指令（例如，乘法加载/存储指令）的附加特征位。这些特定指令的附加位影响处理器对这些指令的处理。另外，引入了新的指令来进一步使用所提出的方法。通常，这些附加特征位以及指令可以由编译器自动生成，以为处理器提供相对适合的指令序列。

4.

发明授权
Method and system for changing a description for a state transition function of a state machine engine 失效
标题翻译：用于改变状态机引擎的状态转换功能的描述的方法和系统

公开(公告)号：US07703058B2

公开(公告)日：2010-04-20

申请号：US11740487

申请日：2007-04-26

申请人： Rolf Fritz , Markus Kaltenbach , Ulrich Mayer , Thomas Pflueger , Cordt Starke , Jan Van Lunteren

发明人： Rolf Fritz , Markus Kaltenbach , Ulrich Mayer , Thomas Pflueger , Cordt Starke , Jan Van Lunteren

IPC分类号： G06F17/50

CPC分类号： G06F17/5045

摘要： The invention relates to a method and system for the design and implementation of state machine engines. A first constraints checking step checks a state transition function created by a designer against constraints imposed by the implementation technology in order to detect all portions of the state transition function that are in conflict with the constraints. A subsequent conflict resolution step tries to determine one or more suggested ways to meet the conflicting constraints, by investigating how the original state transition function can be modified such that all constraints are met. A final presentation and selection step provides the designer textual and/or graphically results of the constraints check and suggested modifications. The modifications can be accepted interactively, or the state transition function can be changed manually. In the latter case, the modified state transition function will be processed starting again with the constraints checking step.

摘要翻译： 本发明涉及一种用于设计和实施状态机发动机的方法和系统。第一约束检查步骤检查由设计者创建的状态转换函数，以抵制由实施技术施加的约束，以便检测与约束相冲突的状态转换函数的所有部分。随后的冲突解决步骤尝试通过调查如何修改原始状态转换功能以满足所有约束来确定满足冲突约束的一种或多种建议方式。最后的演示和选择步骤为设计者提供了约束检查和建议修改的文本和/或图形结果。可以交互接受修改，也可以手动更改状态转换功能。在后一种情况下，将通过约束检查步骤再次开始处理修改的状态转换功能。

5.

发明申请
Method for a Hash Table Lookup and Processor Cache 审中-公开
标题翻译：哈希表查找和处理器缓存的方法

公开(公告)号：US20080052488A1

公开(公告)日：2008-02-28

申请号：US11742718

申请日：2007-05-01

申请人： Rolf Fritz , Markus Kaltenbach , Ulrich Mayer , Thomas Pflueger , Cordt Starke , Jan Van Lunteren

发明人： Rolf Fritz , Markus Kaltenbach , Ulrich Mayer , Thomas Pflueger , Cordt Starke , Jan Van Lunteren

IPC分类号： G06F12/00

CPC分类号： G06F12/0864 , G06F12/0897

摘要： The present, invention improves the hash table lookup operation by using a new processor cache architecture. A speculative processing of entries stored in the cache is combined with a delayed evaluation of cache entries. The speculative processing means that for each cache entry retrieved from main memory in a step of the hash table lookup operation it is assumed that it already contains the selected hash table entry. The delayed evaluation means that certain steps of the lookup operation are performed in parallel with others. In advantageous embodiments the invention can also be used in conjunction with a hierarchy of inclusive caches. The preferred embodiments of the invention involve a new approach for a transition rule cache of a BaRT-FSM controller.

摘要翻译： 本发明通过使用新的处理器高速缓存架构来改进哈希表查找操作。存储在缓存中的条目的推测性处理与缓存条目的延迟评估相结合。推测处理意味着对于在哈希表查找操作的步骤中从主存储器检索的每个高速缓存条目，假设它已经包含所选择的哈希表条目。延迟评估意味着查找操作的某些步骤与其他操作并行执行。在有利的实施例中，本发明还可以与包含缓存的层次结构一起使用。本发明的优选实施例涉及用于BaRT-FSM控制器的转换规则高速缓存的新方法。

6.

发明申请
Method and System for Changing a Description for a State Transition Function of a State Machine Engine 失效
标题翻译：用于更改状态机引擎的状态转换功能的描述的方法和系统

公开(公告)号：US20070282573A1

公开(公告)日：2007-12-06

申请号：US11740558

申请日：2007-04-26

申请人： Rolf Fritz , Markus Kaltenbach , Ulrich Mayer , Thomas Pflueger , Cordt Starke , Jan Van Lunteren

发明人： Rolf Fritz , Markus Kaltenbach , Ulrich Mayer , Thomas Pflueger , Cordt Starke , Jan Van Lunteren

IPC分类号： G06F17/18

CPC分类号： G06F17/2247

摘要： The invention relates to a method of optimizing a state transition function specification for a state machine engine based on a probability distribution for the state transitions. For the preferred embodiment of the invention, a B-FSM state machine engine accesses a transition rule memory using a processor cache. The invention allows improving the cache hit rate by exploiting the probability distribution. The N transition rules that comprise a hash table entry will be loaded in a burst mode from the main memory, from which the N transition rules are transferred to the processor cache. Because the comparison of the actual state and input values against each of the transition rules can immediately start after each of these rules has been received, the overall performance is improved as the transition rule that is most likely to be selected is the first to be transferred as part of the burst access.

摘要翻译： 本发明涉及一种基于状态转换的概率分布优化状态机引擎的状态转换功能规范的方法。对于本发明的优选实施例，B-FSM状态机引擎使用处理器高速缓存来访问转换规则存储器。本发明允许通过利用概率分布来提高缓存命中率。构成哈希表条目的N个转换规则将以从主存储器的突发模式加载，N个转换规则从该存储器传送到处理器高速缓存。因为实际状态和输入值与每个转换规则的比较可以在接收到每个这些规则之后立即开始，因为最有可能被选择的转换规则是首先被转移的整体性能得到改善作为突发访问的一部分。

7.

发明授权
Active power dissipation detection based on erroneus clock gating equations 有权
标题翻译：基于错误时钟门控方程的有功功耗检测

公开(公告)号：US09495490B2

公开(公告)日：2016-11-15

申请号：US13550207

申请日：2012-07-16

申请人： Christopher M Abernathy , Maarten J. Boersma , Markus Kaltenbach , Ulrike Schmidt

发明人： Christopher M Abernathy , Maarten J. Boersma , Markus Kaltenbach , Ulrike Schmidt

IPC分类号： G06F17/50

CPC分类号： G06F17/5022 , G06F2217/78

摘要： A method detects active power dissipation in an integrated circuit. The method includes receiving a hardware design for the integrated circuit having one or more clock domains, wherein the hardware design comprises a local clock buffer for a clock domain, wherein the local clock buffer is configured to receive a clock signal and an actuation signal. The method includes adding instrumentation logic to the design for the clock domain, wherein the instrumentation logic is configured to compare a first value of the actuation signal determined at a beginning point of a test period to a second value of the actuation signal determined at a time when the clock domain is in an idle condition. The method includes detecting the clock domain includes unintended active power dissipation, in response to the first value of the actuation signal not being equal to the second value of the actuation signal.

摘要翻译： 一种方法可以检测集成电路中的有功功率。该方法包括接收具有一个或多个时钟域的集成电路的硬件设计，其中硬件设计包括用于时钟域的本地时钟缓冲器，其中本地时钟缓冲器被配置为接收时钟信号和致动信号。该方法包括将仪器逻辑添加到时钟域的设计中，其中仪器逻辑被配置为将在测试周期的起始点确定的致动信号的第一值与每次确定的致动信号的第二值进行比较当时钟域处于空闲状态时。响应于致动信号的第一值不等于致动信号的第二值，该方法包括检测时钟域包括非预期的有功功耗。

8.

发明授权
Mechanism to speed-up multithreaded execution by register file write port reallocation 有权
标题翻译：通过注册文件写入端口重新分配来加快多线程执行的机制

公开(公告)号：US09207995B2

公开(公告)日：2015-12-08

申请号：US13170003

申请日：2011-06-27

申请人： Maarten J. Boersma , Markus Kaltenbach , Jens Leenstra , Tim Niggemeier , Philipp Oehler , Philipp Panitz

发明人： Maarten J. Boersma , Markus Kaltenbach , Jens Leenstra , Tim Niggemeier , Philipp Oehler , Philipp Panitz

IPC分类号： G06F9/30 , G06F9/52 , G06F9/38

CPC分类号： G06F9/52 , G06F9/3012 , G06F9/30123 , G06F9/30141 , G06F9/30189 , G06F9/3891

摘要： Various systems and processes may be used to speed up multi-threaded execution. In certain implementations, a system and process may include the ability to write results of a first group of execution units associated with a first register file into the first register file using a first write port of the first register file and write results of a second group of execution units associated with a second register file into the second register file using a first write port of the second register file. The system, apparatus, and process may also include the ability to connect, in a shared register file mode, results of the second group of execution units to a second write port of the first register file and connect, in a split register file mode, results of a part of the first group of execution units to the second write port of the first register file.

摘要翻译： 可以使用各种系统和过程来加速多线程执行。在某些实现中，系统和过程可以包括使用第一寄存器堆的第一写入端口将与第一寄存器堆相关联的第一组执行单元的结果写入第一寄存器堆的能力，以及第二组的写入结果使用第二寄存器文件的第一写入端口将与第二寄存器文件相关联的执行单元分配到第二寄存器堆中。系统，装置和过程还可以包括以共享寄存器文件模式将第二组执行单元的结果连接到第一寄存器堆的第二写入端口并以分割寄存器文件模式连接的能力，将第一组执行单元的一部分的结果提供给第一注册文件的第二写入端口。

9.

发明授权
Apparatus and method for calculating an SHA-2 hash function in a general purpose processor 有权
标题翻译：用于在通用处理器中计算SHA-2哈希函数的装置和方法

公开(公告)号：US09164725B2

公开(公告)日：2015-10-20

申请号：US13181678

申请日：2011-07-13

申请人： Maarten J. Boersma , Markus Kaltenbach , Jens Leenstra , Tim Niggemeier , Philipp Oehler , Philipp Panitz

发明人： Maarten J. Boersma , Markus Kaltenbach , Jens Leenstra , Tim Niggemeier , Philipp Oehler , Philipp Panitz

IPC分类号： G06F7/00 , H04L9/32

CPC分类号： G06F7/00 , H04L9/3239 , H04L2209/125

摘要： Various systems, apparatuses, processes, and/or products may be used to calculate an SHA-2 hash function in a general-purpose processor. In some implementations, a system, apparatus, process, and/or product may include the ability to calculate at least one SHA-2 sigma function by using an execution unit adapted for performing a processor instruction, the execution unit including an integrated circuit primarily designed for calculating the SHA-2 sigma function(s), and calculating the SHA-2 hash function with general-purpose hardware processing components of the processor based on the sigma function(s). In certain implementations, the calculation of the SHA-2 sigma function(s) can be performed by the integrated circuit within a single instruction, allowing for a faster calculation of the SHA-2 hash function.

摘要翻译： 可以使用各种系统，装置，处理和/或产品来计算通用处理器中的SHA-2哈希函数。在一些实现中，系统，装置，过程和/或产品可以包括通过使用适于执行处理器指令的执行单元来计算至少一个SHA-2σ功能的能力，所述执行单元包括主要设计的集成电路用于计算SHA-2西格玛函数，以及基于σ函数计算具有处理器的通用硬件处理组件的SHA-2哈希函数。在某些实现中，SHA-2西格玛函数的计算可以由单个指令中的集成电路执行，从而可以更快地计算SHA-2哈希函数。

10.

发明申请
SPLITABLE AND SCALABLE NORMALIZER FOR VECTOR DATA 有权
标题翻译：用于矢量数据的可分离和可扩展的标准化

公开(公告)号：US20150067298A1

公开(公告)日：2015-03-05

申请号：US14016607

申请日：2013-09-03

申请人： Maarten J. Boersma , Markus Kaltenbach , Christophe J. Layer , Silvia M. Mueller

发明人： Maarten J. Boersma , Markus Kaltenbach , Christophe J. Layer , Silvia M. Mueller

IPC分类号： G06F15/78

CPC分类号： G06F7/49936 , G06F7/00 , G06F7/483 , G06F7/49915 , G06F9/30014 , G06F9/30036 , G06F9/30189 , G06F9/3887 , G06F15/78 , G06F15/8053 , G06F2207/382

摘要： A hardware circuit component configured to support vector operations in a scalar data path. The hardware circuit component configured to operate in a vector mode configuration and in a scalar mode configuration. The hardware circuit component configured to split the scalar mode configuration into a left half and a right half of the vector mode configuration. The hardware circuit component configured to perform one or more bit shifts over one or more stages of interconnected multiplexers in the vector mode configuration. The hardware circuit component configured to include duplicated coarse shift multiplexers at bit positions that receive data from both the left half and the right half of the vector mode configuration, resulting in one or more coarse shift multiplexers sharing the bit position.

摘要翻译： 配置为支持标量数据路径中的向量操作的硬件电路组件。硬件电路组件被配置为以矢量模式配置和标量模式配置操作。硬件电路组件被配置为将标量模式配置分解为向量模式配置的左半部分和右半部分。硬件电路组件被配置为在矢量模式配置中的一个或多个互连多路复用器级上执行一个或多个位移位。硬件电路组件被配置为在位向量模式配置的左半部分和右半部分接收数据的比特位置包括复制的粗略移位复用器，从而产生一个或多个共享比特位置的粗移位复用器。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类