FIXED-STRIDE DRAW TABLES FOR TILED RENDERING

    公开(公告)号:US20200013137A1

    公开(公告)日:2020-01-09

    申请号:US16028151

    申请日:2018-07-05

    Abstract: Methods, systems, and devices for rendering are described. A device may divide a frame into a plurality of bins. The device may generate a command stream containing multiple repetitions of a fixed-stride draw table (FSDT), where each repetition of the FSDT includes a respective state vector for one or more hardware registers of a set of hardware registers. The device may identify, for each bin, a subset of the multiple repetitions of the FSDT in the command stream that include a live draw call. The device may execute, using the set of hardware registers, one or more rendering commands for each bin based at least in part on the corresponding subset of the multiple repetitions of the FSDT.

    Methods and apparatus for GPU context register management

    公开(公告)号:US10748239B1

    公开(公告)日:2020-08-18

    申请号:US16290761

    申请日:2019-03-01

    Abstract: The present disclosure relates to methods and apparatus of operation of a processing unit. The apparatus can update a first context register of one or more context registers based on a first programming state. In some aspects, the one or more context registers can be associated with at least one processing unit cluster in a graphics processing pipeline of the processing unit. The apparatus can execute a first draw call function corresponding to the first programming state. The apparatus can determine whether at least one additional first draw call function corresponds to the first programming state. In some aspects, the at least one additional first draw call function can follow the first draw call function in the graphics processing pipeline. Also, the apparatus can execute the at least one additional first draw call function when the at least one additional first draw call function corresponds to the first programming state.

    Deferred command execution
    4.
    发明授权

    公开(公告)号:US11372645B2

    公开(公告)日:2022-06-28

    申请号:US16900814

    申请日:2020-06-12

    Abstract: Deferred command execution by a command processor (CP) may be performed based on a determination that at least one command of a primary buffer is located between a first link of the primary buffer and a second link of the primary buffer. The first link and the second link may be to one or more secondary buffers that includes a set of commands. The CP may initiate, before executing, a fetch of a first set of commands in the set of commands based on the first link, a fetch of the at least one command of the primary buffer, and a fetch of a second set of commands in the set of commands based on the second link. After initiating the fetch of the second set of commands, the CP may execute the first set of commands, the at least one command of the primary buffer, and the second set of commands.

    SELECTIVE PREFETCH ENGINE FOR RESOURCE EFFICIENT IMAGE RENDERING

    公开(公告)号:US20240428364A1

    公开(公告)日:2024-12-26

    申请号:US18341233

    申请日:2023-06-26

    Abstract: Disclosed is a method for rendering a frame. The method is executed by a graphics processor and comprises dividing the frame into a plurality of bins. For each bin of the plurality of bins, the method comprises fetching a subset of entries corresponding to visible draw calls of a fixed stride draw table (FSDT). The FSDT comprises a plurality of entries corresponding to visible and invisible draw calls for the bin, wherein a visible draw call comprises instructions for drawing one or more pixels which are visible within the bin and wherein an invisible draw call only comprises instructions for drawing pixels which are invisible within the bin. The disclosed method further comprises executing the fetched subset of entries corresponding to visible draw calls.

    Performance overhead optimization in GPU scoping

    公开(公告)号:US12002142B2

    公开(公告)日:2024-06-04

    申请号:US17373718

    申请日:2021-07-12

    CPC classification number: G06T15/005 G06F11/3485 G06T1/20

    Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may process a first workload of a plurality of workloads at each of multiple clusters in a GPU pipeline. The apparatus may also increment a plurality of performance counters during the processing of the first workload at each of the multiple clusters. Further, the apparatus may determine, at each of the multiple clusters, whether the first workload is finished processing. The apparatus may also read, upon determining that the first workload is finished processing, a value of each of the multiple clusters for each of the plurality of performance counters. Additionally, the apparatus may transmit an indication of the read value of each of the multiple clusters for all of the plurality of performance counters.

Patent Agency Ranking