-
公开(公告)号:US09886384B2
公开(公告)日:2018-02-06
申请号:US14931019
申请日:2015-11-03
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jun-Kyoung Kim , Dong-Hoon Yoo , Jeong-Wook Kim , Soo-Jung Ryu
IPC: G06F12/00 , G06F12/0862 , G06F12/0875 , G06F9/38 , G06F9/45
CPC classification number: G06F12/0862 , G06F8/4442 , G06F9/383 , G06F12/0875 , G06F2212/452 , G06F2212/6026 , G06F2212/6028
Abstract: The present examples relate to prefetching, and to a cache control device for prefetching and a prefetching method using the cache control device, wherein the cache control device analyzes a memory access pattern of program code, inserts, into the program code, a prefetching command generated by encoding the analyzed access pattern, and executes the prefetching command inserted into the program code in order to prefetch data into a cache, thereby maximizing prefetching efficiency.
-
公开(公告)号:US09367291B2
公开(公告)日:2016-06-14
申请号:US14228868
申请日:2014-03-28
Inventor: Jin-Seok Lee , Seong-Gun Kim , Dong-Hoon Yoo , Seok-Joong Hwang , Jeongho Nah , Jaejin Lee , Jun Lee
IPC: G06F9/45
CPC classification number: G06F8/40
Abstract: An apparatus and method for generating vector code are provided. The apparatus and method generate vector code using scalar-type kernel code, without user's changing a code type or modifying data layout, thereby enhancing user's convenience of use and retaining the portability of OpenCL.
Abstract translation: 提供了一种用于产生矢量码的装置和方法。 该设备和方法使用标量型内核代码生成向量代码,无需用户更改代码类型或修改数据布局,从而提高用户使用方便性并保留OpenCL的可移植性。
-
公开(公告)号:US09280330B2
公开(公告)日:2016-03-08
申请号:US14230462
申请日:2014-03-31
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jin-Seok Lee , Seong-Gun Kim , Dong-Hoon Yoo , Seok-Joong Hwang
IPC: G06F9/45
Abstract: An apparatus and method for executing code are provided. The apparatus includes a memory manager that allocates a stack in memory to store processed data that needs to be retained; a loop generator that divides program code programmed to be processed in parallel into regions based on a barrier function, transforms a region that includes the processed data that needs to be retained in the stack into a first coalescing loop, and transforms a region that uses the processed data stored in the stack into a second coalescing loop such that the transformed program code may be serially processed; and a loop changer that reverses a processing order of the second coalescing loop in comparison to a processing order of the first coalescing loop.
Abstract translation: 提供了一种用于执行代码的设备和方法。 该装置包括:存储器管理器,其在存储器中分配堆栈以存储需要保留的处理数据; 基于屏障功能将被编程为并行处理的程序代码划分为区域的环路发生器,将包含需要保留在栈中的处理数据的区域变换为第一聚结循环,并且将使用 将存储在堆栈中的处理数据转换成第二聚结循环,使得变换的程序代码可以被串行处理; 以及循环更换器,其与第一聚结循环的处理顺序相比逆转第二聚结循环的处理顺序。
-
-