-
公开(公告)号:US11194583B2
公开(公告)日:2021-12-07
申请号:US16658688
申请日:2019-10-21
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Krishnan V. Ramani
Abstract: Speculative execution using a page-level tracked load order queue includes: determining that a first load instruction targets a determined memory region; and in response to the first load instruction targeting the determined memory region, adding an entry to a page-level tracked load order queue instead of a load order queue, where the entry indicates a page address of a target of the first load instruction.
-
2.
公开(公告)号:US20200319889A1
公开(公告)日:2020-10-08
申请号:US16671097
申请日:2019-10-31
Applicant: Advanced Micro Devices, Inc.
Inventor: John Kalamatianos , Susumu Mashimo , Krishnan V. Ramani , Scott Thomas Bingham
Abstract: A technique for speculatively executing load-dependent instructions includes detecting that a memory ordering consistency queue is full for a completed load instruction. The technique also includes storing data loaded by the completed load instruction into a storage location for storing data when the memory ordering consistency queue is full. The technique further includes speculatively executing instructions that are dependent on the completed load instruction. The technique also includes in response to a slot becoming available in the memory ordering consistency queue, replaying the load instruction. The technique further includes in response to receiving loaded data for the replayed load instruction, testing for a data mis-speculation by comparing the loaded data for the replayed load instruction with the data loaded by the completed load instruction that is stored in the storage location.
-
公开(公告)号:US10949201B2
公开(公告)日:2021-03-16
申请号:US16286702
申请日:2019-02-27
Applicant: Advanced Micro Devices, Inc.
Inventor: Scott Thomas Bingham , Marius Evers , Krishnan V. Ramani , Thomas Kunjan
IPC: G06F9/30 , G06F12/0875 , G06F12/0815
Abstract: A processor and method for handling lock instructions identifies which of a plurality of older store instructions relative to a current lock instruction are able to be locked. The method and processor lock the identified older store instructions as an atomic group with the current lock instruction. The method and processor negatively acknowledge probes until all of the older store instructions in the atomic group have written to cache memory. In some implementations, an atomic grouping unit issues an indication to lock identified older store instructions that are retired and lockable, and in some implementations, also issues an indication to lock older stores that are determined to be lockable that are non-retired.
-
公开(公告)号:US20190310845A1
公开(公告)日:2019-10-10
申请号:US16450897
申请日:2019-06-24
Applicant: Advanced Micro Devices, Inc.
Inventor: Krishnan V. Ramani , Kai Troester , Frank C. Galloway , David N. Suggs , Michael D. Achenbach , Betty Ann McDaniel , Marius Evers
Abstract: A system and method for tracking stores and loads to reduce load latency when forming the same memory address by bypassing a load store unit within an execution unit is disclosed. Store-load pairs which have a strong history of store-to-load forwarding are identified. Once identified, the load is memory renamed to the register stored by the store. The memory dependency predictor may also be used to detect loads that are dependent on a store but cannot be renamed. In such a configuration, the dependence is signaled to the load store unit and the load store unit uses the information to issue the load after the identified store has its physical address.
-
5.
公开(公告)号:US11099846B2
公开(公告)日:2021-08-24
申请号:US16013069
申请日:2018-06-20
Applicant: Advanced Micro Devices, Inc.
Inventor: Krishnan V. Ramani , Chetana N. Keltcher
Abstract: A method and apparatus generates control information that indicates whether to change a counter value associated with a particular load instruction. In response to the control information, the method and apparatus causes a hysteresis effect for operating between a speculative mode and a non-speculative mode based on the counter value. The hysteresis effect is in favor of the non-speculative mode. The method and apparatus causes the hysteresis effect by incrementing the counter value associated with the particular load instruction by a first value or decrementing the counter value by a second value. The first value is greater than the second value.
-
公开(公告)号:US20200272463A1
公开(公告)日:2020-08-27
申请号:US16286702
申请日:2019-02-27
Applicant: Advanced Micro Devices, Inc.
Inventor: Scott Thomas Bingham , Marius Evers , Krishnan V. Ramani , Thomas Kunjan
IPC: G06F9/30 , G06F12/0875
Abstract: A processor and method for handling lock instructions identifies which of a plurality of older store instructions relative to a current lock instruction are able to be locked. The method and processor lock the identified older store instructions as an atomic group with the current lock instruction. The method and processor negatively acknowledge probes until all of the older store instructions in the atomic group have written to cache memory. In some implementations, an atomic grouping unit issues an indication to lock identified older store instructions that are retired and lockable, and in some implementations, also issues an indication to lock older stores that are determined to be lockable that are non-retired.
-
公开(公告)号:US10331357B2
公开(公告)日:2019-06-25
申请号:US15380778
申请日:2016-12-15
Applicant: Advanced Micro Devices, Inc.
Inventor: Betty Ann McDaniel , Michael D. Achenbach , David N. Suggs , Frank C. Galloway , Kai Troester , Krishnan V. Ramani
IPC: G06F3/06 , G06F12/0871 , G06F12/0897 , G06F9/30
Abstract: A system and method for tracking stores and loads to reduce load latency when forming the same memory address by bypassing a load store unit within an execution unit is disclosed. The system and method include storing data in one or more memory dependent architectural register numbers (MdArns), allocating the one or more MdArns to a MEMFILE, writing the allocated one or more MdArns to a map file, wherein the map file contains a MdArn map to enable subsequent access to an entry in the MEMFILE, upon receipt of a load request, checking a base, an index, a displacement and a match/hit via the map file to identify an entry in the MEMFILE and an associated store, and on a hit, providing the entry responsive to the load request from the one or more MdArns.
-
公开(公告)号:US20180052613A1
公开(公告)日:2018-02-22
申请号:US15380778
申请日:2016-12-15
Applicant: Advanced Micro Devices, Inc.
Inventor: Betty Ann McDaniel , Michael D. Achenbach , David N. Suggs , Frank C. Galloway , Kai Troester , Krishnan V. Ramani
IPC: G06F3/06 , G06F12/0871 , G06F12/0897
CPC classification number: G06F3/0611 , G06F3/0631 , G06F3/0643 , G06F3/0659 , G06F3/0673 , G06F9/30 , G06F12/0871 , G06F12/0897 , G06F2212/1024 , G06F2212/304 , G06F2212/463 , G06F2212/604
Abstract: A system and method for tracking stores and loads to reduce load latency when forming the same memory address by bypassing a load store unit within an execution unit is disclosed. The system and method include storing data in one or more memory dependent architectural register numbers (MdArns), allocating the one or more MdArns to a MEMFILE, writing the allocated one or more MdArns to a map file, wherein the map file contains a MdArn map to enable subsequent access to an entry in the MEMFILE, upon receipt of a load request, checking a base, an index, a displacement and a match/hit via the map file to identify an entry in the MEMFILE and an associated store, and on a hit, providing the entry responsive to the load request from the one or more MdArns.
-
公开(公告)号:US11645073B2
公开(公告)日:2023-05-09
申请号:US17238844
申请日:2021-04-23
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: John Kalamatianos , Krishnan V. Ramani , Susumu Mashimo
CPC classification number: G06F9/30043 , G06F9/35 , G06F9/3842
Abstract: Address-based filtering for load/store speculation includes maintaining a filtering table including table entries associated with ranges of addresses; in response to receiving an ordering check triggering transaction, querying the filtering table using a target address of the ordering check triggering transaction to determine if an instruction dependent upon the ordering check triggering transaction has previously been generated a physical address; and in response to determining that the filtering table lacks an indication that the instruction dependent upon the ordering check triggering transaction has previously been generated a physical address, bypassing a lookup operation in an ordering violation memory structure to determine whether the instruction dependent upon the ordering check triggering transaction is currently in-flight.
-
10.
公开(公告)号:US11113065B2
公开(公告)日:2021-09-07
申请号:US16671097
申请日:2019-10-31
Applicant: Advanced Micro Devices, Inc.
Inventor: John Kalamatianos , Susumu Mashimo , Krishnan V. Ramani , Scott Thomas Bingham
Abstract: A technique for speculatively executing load-dependent instructions includes detecting that a memory ordering consistency queue is full for a completed load instruction. The technique also includes storing data loaded by the completed load instruction into a storage location for storing data when the memory ordering consistency queue is full. The technique further includes speculatively executing instructions that are dependent on the completed load instruction. The technique also includes in response to a slot becoming available in the memory ordering consistency queue, replaying the load instruction. The technique further includes in response to receiving loaded data for the replayed load instruction, testing for a data mis-speculation by comparing the loaded data for the replayed load instruction with the data loaded by the completed load instruction that is stored in the storage location.
-
-
-
-
-
-
-
-
-