-
公开(公告)号:US12197374B2
公开(公告)日:2025-01-14
申请号:US17359321
申请日:2021-06-25
Applicant: Intel Corporation
Inventor: Rahul Pal , Nayan Amrutlal Suthar , David M. Puffer , Ashok Jagannathan
IPC: G06F13/42 , G06F12/0815 , G06T1/20
Abstract: A processor unit comprising a first controller to couple to a host processing unit over a first link; a second controller to couple to a second processor unit over a second link, wherein the second processor unit is to couple to the host central processing unit via a third link; and circuitry to determine whether to send a cache coherent request to the host central processing unit over the first link or over the second link via the second processing unit.
-
公开(公告)号:US20230114164A1
公开(公告)日:2023-04-13
申请号:US17551681
申请日:2021-12-15
Applicant: Intel Corporation
Inventor: Rahul Pal , Aravindh Anantaraman , Lakshminarayana Pappu , Dongsheng Bi , Guadalupe J. Garcia , Altug Koker , Joydeep Ray , Rahul Joshi , Shrikul Atulkumar Joshi , Mahak Gupta
IPC: G06F12/0871 , G06F12/0891 , G06F13/16 , G06F13/28 , G06F15/78
Abstract: In a further embodiment, a system on a chip integrated circuit (SoC) is provided that includes an active base die including a first cache memory, a first die mounted on and coupled with the active base die, and a second die mounted on the active base die and coupled with the active base die and the first die. The first die includes an interconnect fabric, an input/output interface, and an atomic operation handler. The second die includes an array of graphics processing elements and an interface to the first cache memory of the active base die. At least one of the graphics processing elements are configured to perform, via the atomic operation handler, an atomic operation to a memory device.
-
公开(公告)号:US20190114243A1
公开(公告)日:2019-04-18
申请号:US16218078
申请日:2018-12-12
Applicant: Intel Corporation
Inventor: Umberto Santoni , Rahul Pal , Philip Abraham , Mahesh Mamidipaka , C. Santhosh
IPC: G06F11/273 , G06F11/16
CPC classification number: G06F11/273 , G06F11/1004 , G06F11/1616 , G06F11/1629 , G06F11/1633 , G06F11/1641 , G06F11/1666
Abstract: A method to check for redundancy in two or more data lines comprises receiving data on a first data line, computing a first cyclic redundancy check (CRC) value on the data of the first data line, performing an exclusive OR (XOR) function on the first CRC value with a stored memory value, and updating the stored memory value with a result of the XOR function, and repeating on additional data lines until a last line is processed such that an error is indicated if a final stored memory value is not zero. An apparatus to check that two cores are operating in lockstep comprises a first core comprising a first data checker, a second core comprising a second data checker, and a lockstep checker to compare an output of the first data checker with an output of the second data checker.
-
4.
公开(公告)号:US20180349144A1
公开(公告)日:2018-12-06
申请号:US15614757
申请日:2017-06-06
Applicant: Intel Corporation
Inventor: Rahul Pal , Ragavendra Natarajan , Niranjan K. Soundararajan , Sreenivas Subramoney , Daniel Deng , Jared Warner Stark, IV , Hong Wang , Ronak Singhal
IPC: G06F9/38
Abstract: In one embodiment, a processor comprises a branch predictor to generate, in association with a program loop, a frozen history vector comprising a snapshot of a branch history vector; track a current iteration of the program loop; and provide a prediction for a branch instruction associated with the program loop, the prediction based on the frozen history vector and the current iteration of the program loop.
-
公开(公告)号:US20180285115A1
公开(公告)日:2018-10-04
申请号:US15477064
申请日:2017-04-01
Applicant: Intel Corporation
Inventor: Niranjan K. Soundararajan , Saurabh Gupta , Sreenivas Subramoney , Rahul Pal , Ragavendra Natarajan , Daniel Deng , Jared W. Stark , Ronak Singhal , Hong Wang
CPC classification number: G06F9/46 , G06F9/3848
Abstract: Embodiments of apparatuses, methods, and systems for misprediction-triggered local history-based branch prediction are described. In one embodiments, an apparatus includes a current pattern table and a local pattern table. The current pattern table has a plurality of entries, each entry in which to store a plurality of pattern lengths of a current pattern of one of a plurality of branch instructions. The local pattern table is to provide a first branch prediction based on the current pattern.
-
公开(公告)号:US20210318980A1
公开(公告)日:2021-10-14
申请号:US17359321
申请日:2021-06-25
Applicant: Intel Corporation
Inventor: Rahul Pal , Nayan Amrutlal Suthar , David M. Puffer , Ashok Jagannathan
IPC: G06F13/42 , G06F12/0815 , G06T1/20
Abstract: A processor unit comprising a first controller to couple to a host processing unit over a first link; a second controller to couple to a second processor unit over a second link, wherein the second processor unit is to couple to the host central processing unit via a third link; and circuitry to determine whether to send a cache coherent request to the host central processing unit over the first link or over the second link via the second processing unit.
-
公开(公告)号:US20190220284A1
公开(公告)日:2019-07-18
申请号:US15870595
申请日:2018-01-12
Applicant: Intel Corporation
Inventor: Saurabh Gupta , Rahul Pal , Niranjan Soundararajan , Ragavendra Natarajan , Sreenivas Subramoney
IPC: G06F9/38
CPC classification number: G06F9/3844 , G06F9/3806 , G06F9/3859 , G06F9/3861
Abstract: One embodiment provides an apparatus. The apparatus includes a store direct dependent (SDD) branch prediction circuitry and an SDD management circuitry. The store direct dependent (SDD) branch prediction circuitry is to store an SDD branch table. The SDD branch table is to store at least one record. Each record includes a branch instruction pointer (IP) field, a load IP field, a store IP field, a comparison info field and at least one of a store value field and/or a predicted outcome field. The SDD management circuitry is to populate the SDD branch table at runtime and to override a baseline branch prediction associated with an incoming branch IP with an SDD branch prediction, if the SDD branch table contains a first record populated with the incoming branch IP and at least one of a store value and/or an SDD predicted outcome.
-
公开(公告)号:US20170269959A1
公开(公告)日:2017-09-21
申请号:US15070146
申请日:2016-03-15
Applicant: Intel Corporation
Inventor: Ishwar Agarwal , Eric R. Wehage , David M. Lee , Swadesh Choudhary , Rahul Pal
CPC classification number: G06F13/4068 , G06F13/16
Abstract: In one embodiment, an apparatus comprises: an encoder to receive a non-posted transaction from a requester and encode information of the non-posted transaction into an encoded transaction identifier having a predetermined root bus identifier reserved for non-posted transactions; and a first transmitter to send the non-posted transaction including the encoded transaction identifier to a fabric, to enable the non-posted transaction to be routed to a destination. Other embodiments are described and claimed.
-
公开(公告)号:US09690706B2
公开(公告)日:2017-06-27
申请号:US14668831
申请日:2015-03-25
Applicant: Intel Corporation
Inventor: Rahul Pal , Ishwar Agarwal , Manoj K. Arora
IPC: G06F12/08 , G06F12/084
CPC classification number: G06F12/084 , G06F2212/2542 , G06F2212/271
Abstract: Resolving coherency issues inherent in sharing distributed cache is described. A chip multiprocessor may include at least first and second processing clusters, each having multiple cores of a processor, multiple cache slices co-located with the multiple cores, and a memory controller (MC). The processor stores directory information in a memory coupled to the processor to indicate cluster cache ownership of a first address space to the first cluster. In response to a request to change the cluster cache ownership of the first address space, the processor may remap first lines of first cache slices, corresponding to the first address space, to second lines in second cache slices of the second cluster, and update the directory information (e.g., a state of the first cache lines) to change the cluster cache ownership of the first address space to the second cluster. One of the MCs may manage such updating of the directory.
-
公开(公告)号:US09552308B2
公开(公告)日:2017-01-24
申请号:US14126919
申请日:2013-09-30
Applicant: Intel Corporation
Inventor: Suresh Sugumar , Mahesh K. Kumashikar , Rahul Pal , Sridhar Muthrasanallur
CPC classification number: G06F13/00 , G06F1/3206 , G06F9/46 , G06F13/4273 , Y02D10/14 , Y02D10/151
Abstract: A request associated with a particular cache record is generated to be sent to a system component associated with a cache bank over an interconnect. A wake-warn signal is sent over a dedicated wake-warn channel to indicate to the system component that the request is to arrive. Wake-warn signals cause a disabled clock to be ungated to an enabled state. The request is then sent to the system component.
Abstract translation: 生成与特定高速缓存记录相关联的请求,以通过互连发送到与高速缓存组相关联的系统组件。 唤醒警告信号通过专用的唤醒通道发送,以向系统组件指示请求到达。 唤醒信号导致禁用的时钟被禁止到启用状态。 然后将请求发送到系统组件。
-
-
-
-
-
-
-
-
-