RASTERIZATION OF COMPUTE WORKLOADS
    11.
    发明公开

    公开(公告)号:US20230394738A1

    公开(公告)日:2023-12-07

    申请号:US18035507

    申请日:2020-11-09

    CPC classification number: G06T15/005

    Abstract: The present disclosure relates to methods and apparatus for graphics processing, e.g., a GPU. The apparatus may receive an image including a plurality of pixels associated with one or more workgroups and one or more pixel tiles, each of the workgroups and the pixel tiles including one or more pixels of the plurality of pixels. The apparatus may determine whether the one or more workgroups are misaligned with the one or more pixel tiles. The apparatus may determine a conversion order of the one or more workgroups when the one or more workgroups are misaligned with the one or more pixel tiles, the conversion order corresponding to a common multiple of one of the one or more workgroups and one of the one or more pixel tiles. The apparatus may convert each of the one or more workgroups based on the conversion order of the one or more workgroups.

    GPR OPTIMIZATION IN A GPU BASED ON A GPR RELEASE MECHANISM

    公开(公告)号:US20230113415A1

    公开(公告)日:2023-04-13

    申请号:US18046901

    申请日:2022-10-14

    Abstract: This disclosure provides systems, devices, apparatus and methods, including computer programs encoded on storage media, for GPR optimization in a GPU based on a GPR release mechanism. More specifically, a GPU may determine at least one unutilized branch within an executable shader based on constants defined for the executable shader. Based on the at least one unutilized branch, the GPU may further determine a number of GPRs that can be deallocated from previously allocated GPRs. The GPU may deallocate, for a subsequent thread within a draw call, the number of GPRs from the previously allocated GPRs during execution of the executable shader based on the determined number of GPRs to be deallocated.

    METHODS AND APPARATUS FOR CONSTANT DATA STORAGE

    公开(公告)号:US20220414814A1

    公开(公告)日:2022-12-29

    申请号:US17356434

    申请日:2021-06-23

    Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may generate a table including a plurality of entries to store data associated with at least one of a constant value or an immediate value. The apparatus may also process, upon generating the table, first data including at least one of a constant value or an immediate value. Further, the apparatus may store, in the generated table, at least one of the constant value or the immediate value of the first data. The apparatus may also transmit, upon storing at least one of the constant value or the immediate value in the table, the table including the stored at least one of the constant value or the immediate value of the first data.

    OUT OF ORDER WAVE SLOT RELEASE FOR A TERMINATED WAVE

    公开(公告)号:US20210209717A1

    公开(公告)日:2021-07-08

    申请号:US16734252

    申请日:2020-01-03

    Abstract: Methods, systems, and devices for image processing are described. A device may determine, based on a test operation, to terminate a first wave associated with a first slot of a set of slots. The device may update a terminated wave bit associated with the first slot based on the determination to terminate the first wave. In some aspects, the device may update a number of invocations field associated with the first wave based on the determination to terminate the first wave. The device may release the first slot based on updating the terminated wave bit and the number of invocations field. In some examples, the device may output the number of invocations field to a rendering backend of the device based on the terminated wave bit.

    GENERAL PURPOSE REGISTER AND WAVE SLOT ALLOCATION IN GRAPHICS PROCESSING

    公开(公告)号:US20200312006A1

    公开(公告)日:2020-10-01

    申请号:US16364829

    申请日:2019-03-26

    Abstract: Example techniques are described for generating graphics content by obtaining texture operation instructions corresponding to a texture operation, in response to determining at least one of insufficient general purpose register space is available for the texture operation or insufficient wave slots are available for the texture operation, generating an indication that the texture operation corresponds to a deferred wave, executing the texture operation, sending, to a texture processor, initial texture sample instructions corresponding to the texture operation that was executed, and receiving texture mapped data corresponding to the initial texture sample instructions.

Patent Agency Ranking