RASTERIZATION OF COMPUTE WORKLOADS
    1.
    发明公开

    公开(公告)号:US20230394738A1

    公开(公告)日:2023-12-07

    申请号:US18035507

    申请日:2020-11-09

    CPC classification number: G06T15/005

    Abstract: The present disclosure relates to methods and apparatus for graphics processing, e.g., a GPU. The apparatus may receive an image including a plurality of pixels associated with one or more workgroups and one or more pixel tiles, each of the workgroups and the pixel tiles including one or more pixels of the plurality of pixels. The apparatus may determine whether the one or more workgroups are misaligned with the one or more pixel tiles. The apparatus may determine a conversion order of the one or more workgroups when the one or more workgroups are misaligned with the one or more pixel tiles, the conversion order corresponding to a common multiple of one of the one or more workgroups and one of the one or more pixel tiles. The apparatus may convert each of the one or more workgroups based on the conversion order of the one or more workgroups.

    METADATA UPDATING
    2.
    发明公开
    METADATA UPDATING 审中-公开

    公开(公告)号:US20240296153A1

    公开(公告)日:2024-09-05

    申请号:US18177390

    申请日:2023-03-02

    CPC classification number: G06F16/23 G06F16/2282

    Abstract: Aspects of the disclosure are directed to metadata updating. In accordance with one aspect, an apparatus includes an external memory unit configured for storing an original descriptor tag; a descriptor loading block coupled to the external memory, the descriptor loading block configured to fetch the original descriptor tag from the external memory for storage in an internal cache memory and further configured to compare the original descriptor tag stored in the internal cache memory to each of a plurality of original base values; and a remap table database coupled to the descriptor loading block, the remap table database configured to store the plurality of original base values, a plurality of updated base values and a plurality of updated miscellaneous base values.

    METHODS AND APPARATUS FOR WAVE SLOT RETIREMENT PROCEDURES

    公开(公告)号:US20220357983A1

    公开(公告)日:2022-11-10

    申请号:US17315205

    申请日:2021-05-07

    Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may receive a plurality of workloads based on a workload order, each of the plurality of workloads being received in the workload order including at least a first workload and a second workload. The apparatus may also allocate one or more workloads of the plurality of workloads to one or more wave slots. Additionally, the apparatus may execute the one or more allocated workloads at the one or more wave slots, such that at least the first workload is executed at the first wave slot and the second workload is executed at the second wave slot. The apparatus may also allocate at least one other workload of the plurality of workloads to at least one previously-allocated wave slot of the one or more wave slots.

    ELIMINATION CACHE
    6.
    发明公开
    ELIMINATION CACHE 审中-公开

    公开(公告)号:US20240289912A1

    公开(公告)日:2024-08-29

    申请号:US18175480

    申请日:2023-02-27

    CPC classification number: G06T1/20 G06T1/60

    Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for an elimination cache. A graphics processor may obtain an indication of at least one state update from at least one CP associated with a graphics processor, where the at least one state update corresponds to one or more states in a set of states associated with the graphics processor. The graphics processor may determine whether the one or more states are stored in a cache associated with the graphics processor. The graphics processor may discard the at least one state update based on a determination that the one or more states are stored in the cache or update the cache based on a determination that the one or more states are not stored in the cache.

    OUT OF ORDER WAVE SLOT RELEASE FOR A TERMINATED WAVE

    公开(公告)号:US20210209717A1

    公开(公告)日:2021-07-08

    申请号:US16734252

    申请日:2020-01-03

    Abstract: Methods, systems, and devices for image processing are described. A device may determine, based on a test operation, to terminate a first wave associated with a first slot of a set of slots. The device may update a terminated wave bit associated with the first slot based on the determination to terminate the first wave. In some aspects, the device may update a number of invocations field associated with the first wave based on the determination to terminate the first wave. The device may release the first slot based on updating the terminated wave bit and the number of invocations field. In some examples, the device may output the number of invocations field to a rendering backend of the device based on the terminated wave bit.

    GENERAL PURPOSE REGISTER AND WAVE SLOT ALLOCATION IN GRAPHICS PROCESSING

    公开(公告)号:US20200312006A1

    公开(公告)日:2020-10-01

    申请号:US16364829

    申请日:2019-03-26

    Abstract: Example techniques are described for generating graphics content by obtaining texture operation instructions corresponding to a texture operation, in response to determining at least one of insufficient general purpose register space is available for the texture operation or insufficient wave slots are available for the texture operation, generating an indication that the texture operation corresponds to a deferred wave, executing the texture operation, sending, to a texture processor, initial texture sample instructions corresponding to the texture operation that was executed, and receiving texture mapped data corresponding to the initial texture sample instructions.

    RUNTIME MECHANISM TO OPTIMIZE SHADER EXECUTION FLOW

    公开(公告)号:US20240046543A1

    公开(公告)日:2024-02-08

    申请号:US17817815

    申请日:2022-08-05

    CPC classification number: G06T15/005 G06T15/80

    Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for runtime optimization of the shader execution flow. A graphics processor may obtain instruction execution data associated with a graphics workload, the instruction execution data including graphics data for a set of shader operations. The graphics processor may configure, at a first iteration, at least one predication value based on the instruction execution data including the graphics data for the set of shader operations. The graphics processor may adjust, at a second iteration, an execution flow of the graphics workload based on the configured at least one predication value, the execution flow of the graphics workload including the set of shader operations. The graphics processor may execute or refrain from executing, at the second iteration, each of the set of shader operations based on the adjusted execution flow of the graphics workload.

    DYNAMIC WAVE PAIRING
    10.
    发明公开

    公开(公告)号:US20230267567A1

    公开(公告)日:2023-08-24

    申请号:US17652478

    申请日:2022-02-24

    CPC classification number: G06T1/20 G06F9/505

    Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for dynamic wave pairing. A graphics processor may allocate one or more GPU workloads to one or more wave slots of a plurality of wave slots. The graphics processor may select a first execution slot of a plurality of execution slots for executing the one or more GPU workloads. The selection may be based on one of a plurality of granularities. The graphics processor may execute, at the selected first execution slot, the one or more GPU workloads at the one of the plurality of granularities.

Patent Agency Ranking