-
公开(公告)号:US20190018798A1
公开(公告)日:2019-01-17
申请号:US16134933
申请日:2018-09-18
Applicant: QUALCOMM Incorporated
Inventor: Rami Mohammad AL SHEIKH , Shivam PRIYADARSHI , Harold Wade CAIN III
IPC: G06F12/123 , G06F12/128 , G06F12/127
Abstract: Systems and methods relate to cost-aware cache management policies. In a cost-aware least recently used (LRU) replacement policy, temporal locality as well as miss cost is taken into account in selecting a cache line for replacement, wherein the miss cost is based on an associated operation type including instruction cache read, data cache read, data cache write, prefetch, and write back. In a cost-aware dynamic re-reference interval prediction (DRRIP) based cache management policy, miss costs associated with operation types pertaining to a cache line are considered for assigning re-reference interval prediction values (RRPV) for inserting the cache line, pursuant to a cache miss and for updating the RRPV upon a hit for the cache line. The operation types comprise instruction cache access, data cache access, prefetch, and write back. These policies improve victim selection, while minimizing cache thrashing and scans.
-
公开(公告)号:US20170293561A1
公开(公告)日:2017-10-12
申请号:US15270331
申请日:2016-09-20
Applicant: QUALCOMM Incorporated
IPC: G06F12/0862 , G06F13/16 , G06F12/0891 , G06F12/0811 , G06F12/084
CPC classification number: G06F12/0862 , G06F12/0811 , G06F12/084 , G06F12/0886 , G06F12/0891 , G06F12/0897 , G06F13/1668 , G06F2212/1021 , G06F2212/602 , G06F2212/6024 , G06F2212/62
Abstract: Systems and methods for managing memory access bandwidth include a spatial locality predictor. The spatial locality predictor includes a memory region table with prediction counters associated with memory regions of a memory. When cache lines are evicted from a cache, the sizes of the cache lines which were accessed by a processor are used for updating the prediction counters. Depending on values of the prediction counters, the sizes of cache lines which are likely to be used the processor predicted for the corresponding memory regions. Correspondingly, the memory access bandwidth between the processor and the memory may be reduced to fetch a smaller size data than a full cache line if the size of the cache line likely to be used is predicted to be less than that of the full cache line.
-
3.
公开(公告)号:US20180074568A1
公开(公告)日:2018-03-15
申请号:US15814361
申请日:2017-11-15
Applicant: QUALCOMM Incorporated
Inventor: Shivam PRIYADARSHI , Anil KRISHNA , Raguram DAMODARAN , Jeffrey Todd BRIDGES , Ryan WELLS , Norman GARGASH , Rodney Wayne SMITH
IPC: G06F1/32
CPC classification number: G06F1/3228 , G06F1/3206 , G06F1/324 , G06F1/3296 , Y02D10/126 , Y02D10/172
Abstract: The disclosure generally relates to dynamic clock and voltage scaling (DCVS) based on program phase. For example, during each program phase, a first hardware counter may count each cycle where a dispatch stall occurs and an oldest instruction in a load queue is a last-level cache miss, a second hardware counter may count total cycles, and a third hardware counter may count committed instructions. Accordingly, a software/firmware mechanism may read the various hardware counters once the committed instruction counter reaches a threshold value and divide a value of the first hardware counter by a value of the second hardware counter to measure a stall fraction during a current program execution phase. The measured stall fraction can then be used to predict a stall fraction in a next program execution phase such that optimal voltage and frequency settings can be applied in the next phase based on the predicted stall fraction.
-
公开(公告)号:US20170293571A1
公开(公告)日:2017-10-12
申请号:US15271167
申请日:2016-09-20
Applicant: QUALCOMM Incorporated
IPC: G06F12/123 , G06F12/127
CPC classification number: G06F12/124 , G06F12/123 , G06F12/127 , G06F12/128
Abstract: Systems and methods relate to cost-aware cache management policies. In a cost-aware least recently used (LRU) replacement policy, temporal locality as well as miss cost is taken into account in selecting a cache line for replacement, wherein the miss cost is based on an associated operation type including instruction cache read, data cache read, data cache write, prefetch, and write back. In a cost-aware dynamic re-reference interval prediction (DRRIP) based cache management policy, miss costs associated with operation types pertaining to a cache line are considered for assigning re-reference interval prediction values (RRPV) for inserting the cache line, pursuant to a cache miss and for updating the RRPV upon a hit for the cache line. The operation types comprise instruction cache access, data cache access, prefetch, and write back. These policies improve victim selection, while minimizing cache thrashing and scans.
-
公开(公告)号:US20170060593A1
公开(公告)日:2017-03-02
申请号:US14843921
申请日:2015-09-02
Applicant: QUALCOMM Incorporated
Inventor: Anil KRISHNA , Rodney Wayne SMITH , Sandeep Suresh NAVADA , Shivam PRIYADARSHI , Niket Kumar CHOUDHARY , Raguram DAMODARAN
CPC classification number: G06F9/30105 , G06F9/30138 , G06F9/384 , G06F9/3867
Abstract: Systems and methods relate to a hierarchical register file system including a level 1 physical register file (L1 PRF) and a backing physical register file (PRF). A subset of productions of instructions executed in an instruction pipeline of a processor which have a high likelihood of use for one or more future instructions are identified. The subset of productions are stored in the L1 PRF, while all productions are stored in the backing PRF.
Abstract translation: 系统和方法涉及包括1级物理寄存器文件(L1 PRF)和后置物理寄存器文件(PRF)的分级寄存器文件系统。 识别在处理器的指令流水线中执行的指令的生成的子集,其具有用于一个或多个未来指令的高似然性。 生产的子集存储在L1 PRF中,而所有生产都存储在后备PRF中。
-
公开(公告)号:US20190013062A1
公开(公告)日:2019-01-10
申请号:US15644737
申请日:2017-07-07
Applicant: QUALCOMM Incorporated
Inventor: Francois Ibrahim ATALLAH , Gregory Michael WRIGHT , Shivam PRIYADARSHI , Garrett Michael DRAPALA , Harold Wade CAIN, III , Erik HEDBERG
IPC: G11C11/406 , G06F12/128 , G06F12/122 , G06F12/0871
Abstract: Systems and methods for selective refresh of a cache, such as a last-level cache implemented as an embedded DRAM (eDRAM). A refresh bit and a reuse bit are associated with each way of at least one set of the cache. A least recently used (LRU) stack tracks positions of the ways, with positions towards a most recently used position of a threshold comprising more recently used positions and positions towards a least recently used position of the threshold comprise less recently used positions. A line in a way is selectively refreshed if the position of the way is one of the more recently used positions and if the refresh bit associated with the way is set, or the position of the way is one of the less recently used positions and if the refresh bit and the reuse bit associated with the way are both set.
-
7.
公开(公告)号:US20190332385A1
公开(公告)日:2019-10-31
申请号:US15963126
申请日:2018-04-26
Applicant: QUALCOMM Incorporated
Inventor: Rodney Wayne SMITH , Raghavan MADHAVAN , Luke YEN , Shivam PRIYADARSHI , Yusuf Cagatay TEKMEN
IPC: G06F9/38
Abstract: In certain aspects of the disclosure, an apparatus comprises a first scheduling pool associated with a first minimum scheduling latency and a second scheduling pool associated with a second minimum scheduling latency, the second minimum scheduling latency greater than the first minimum scheduling latency. A common instruction picker is coupled to both the first scheduling pool and the second scheduling pool. The common instruction picker may be configured to select a first instruction from the first scheduling pool and a second instruction from the second scheduling pool, and then choose either the first instruction or second instruction for dispatch according to a picking policy.
-
公开(公告)号:US20190065384A1
公开(公告)日:2019-02-28
申请号:US15683350
申请日:2017-08-22
Applicant: QUALCOMM Incorporated
Inventor: Rami Mohammad AL SHEIKH , Shivam PRIYADARSHI , Brandon DWIEL , David John PALFRAMAN , Derek HOWER
IPC: G06F12/0875 , G06F12/084 , G06F12/0811
Abstract: A request to access data at a first physical address misses in a private cache of a processor. A confidence value is received for the first physical address based on a hash value of the first physical address. A determination is made that the received confidence value exceeds a threshold value. In response, a speculative read request specifying the first physical address is issued to a memory controller of a main memory to expedite a miss for the data at the first physical address in a shared cache.
-
公开(公告)号:US20190065375A1
公开(公告)日:2019-02-28
申请号:US15683391
申请日:2017-08-22
Applicant: QUALCOMM Incorporated
Inventor: Rami Mohammad AL SHEIKH , Shivam PRIYADARSHI , Brandon DWIEL , David John PALFRAMAN , Derek HOWER , Muntaquim Faruk CHOWDHURY
IPC: G06F12/0862 , G06F12/0875 , G06F12/109 , G06F12/1045
Abstract: A first load instruction specifying a first virtual address misses in a data cache. A delta value is received based on a program counter value of the first load instruction. A second virtual address is computed based on the delta value and the first virtual address. Data associated with the second virtual address is then prefetched from a main memory to the data cache prior to a second load instruction specifying the second virtual address missing in the data cache.
-
公开(公告)号:US20170293565A1
公开(公告)日:2017-10-12
申请号:US15273270
申请日:2016-09-22
Applicant: QUALCOMM Incorporated
Inventor: Shivam PRIYADARSHI , Brandon Harley Anthony DWIEL , Rami Mohammad A. AL SHEIKH , Harold Wade CAIN III
IPC: G06F12/0888 , G06F12/128 , G06F12/0811
CPC classification number: G06F12/0888 , G06F12/0811 , G06F12/084 , G06F12/12 , G06F12/126 , G06F12/128 , G06F2212/1021 , G06F2212/1028 , G06F2212/283 , G06F2212/502 , G06F2212/6046 , Y02D10/13
Abstract: Systems and methods are directed to selectively bypassing allocation of cache lines in a cache. A bypass predictor table is provided with reuse counters to track reuse characteristics of cache lines, based on memory regions to which the cache lines belong in memory. A contender reuse counter provides an indication of a likelihood of reuse of a contender cache line in the cache pursuant to a miss in the cache for the contender cache line, and a victim reuse counter provides an indication of a likelihood of reuse for a victim cache line that will be evicted if the contender cache line is allocated in the cache. A decision whether to allocate the contender cache line in the cache or bypass allocation of the contender cache line in the cache is based on the contender reuse counter value and the victim reuse counter value.
-
-
-
-
-
-
-
-
-