Patent search ap:("QUALCOMM Incorporated") AND inv:"Nigel Poole" Page 1

1.

发明公开
SLICED GRAPHICS PROCESSING UNIT (GPU) ARCHITECTURE IN PROCESSOR-BASED DEVICES 审中-公开

公开(公告)号：US20240078735A1

公开(公告)日：2024-03-07

申请号：US18067837

申请日：2022-12-19

Applicant: QUALCOMM Incorporated

Inventor： Jian Liang , Andrew Evan Gruber , Tao Wang , Xuefeng Tang , Vishwanath Shashikant Nikam , Nigel Poole , Kalyan Kumar Bhiravabhatla , Fei Xu , Zilin Ying

IPC: G06T15/00

CPC classification number: G06T15/005

Abstract: A sliced graphics processing unit (GPU) architecture in processor-based devices is disclosed. In some aspects, a GPU based on a sliced GPU architecture includes multiple hardware slices. The GPU further includes a command processor (CP) circuit and an unslice primitive controller (PC_US). Upon receiving a graphics instruction from a central processing unit (CPU), the CP circuit determines a graphics workload, and transmits the graphics workload to the PC_US. The PC_US then partitions the graphics workload into multiple subbatches and distributes each subbatch to a PC_S of a hardware slice for processing.

2.

发明申请
FIXED-STRIDE DRAW TABLES FOR TILED RENDERING 审中-公开

公开(公告)号：US20200013137A1

公开(公告)日：2020-01-09

申请号：US16028151

申请日：2018-07-05

Applicant: QUALCOMM Incorporated

Inventor： Richard Hammerstone , Nigel Poole , Thomas Edwin Frisinger , Andrew Evan Gruber , Anisha Datla

IPC: G06T1/20 , G06T15/00 , G06T11/20

Abstract: Methods, systems, and devices for rendering are described. A device may divide a frame into a plurality of bins. The device may generate a command stream containing multiple repetitions of a fixed-stride draw table (FSDT), where each repetition of the FSDT includes a respective state vector for one or more hardware registers of a set of hardware registers. The device may identify, for each bin, a subset of the multiple repetitions of the FSDT in the command stream that include a live draw call. The device may execute, using the set of hardware registers, one or more rendering commands for each bin based at least in part on the corresponding subset of the multiple repetitions of the FSDT.

3.

发明授权
Methods and apparatus for GPU context register management 有权

公开(公告)号：US10748239B1

公开(公告)日：2020-08-18

申请号：US16290761

申请日：2019-03-01

Applicant: QUALCOMM Incorporated

Inventor： Nigel Poole , Xuefeng Tang , Jian Liang

IPC: G06T1/20 , G06T1/60

Abstract: The present disclosure relates to methods and apparatus of operation of a processing unit. The apparatus can update a first context register of one or more context registers based on a first programming state. In some aspects, the one or more context registers can be associated with at least one processing unit cluster in a graphics processing pipeline of the processing unit. The apparatus can execute a first draw call function corresponding to the first programming state. The apparatus can determine whether at least one additional first draw call function corresponds to the first programming state. In some aspects, the at least one additional first draw call function can follow the first draw call function in the graphics processing pipeline. Also, the apparatus can execute the at least one additional first draw call function when the at least one additional first draw call function corresponds to the first programming state.

4.

发明授权
Deferred command execution 有权

公开(公告)号：US11372645B2

公开(公告)日：2022-06-28

申请号：US16900814

申请日：2020-06-12

Applicant: QUALCOMM Incorporated

Inventor： Nigel Poole , Joohi Mittal

IPC: G06F9/38 , G06F13/28

Abstract: Deferred command execution by a command processor (CP) may be performed based on a determination that at least one command of a primary buffer is located between a first link of the primary buffer and a second link of the primary buffer. The first link and the second link may be to one or more secondary buffers that includes a set of commands. The CP may initiate, before executing, a fetch of a first set of commands in the set of commands based on the first link, a fetch of the at least one command of the primary buffer, and a fetch of a second set of commands in the set of commands based on the second link. After initiating the fetch of the second set of commands, the CP may execute the first set of commands, the at least one command of the primary buffer, and the second set of commands.

5.

发明申请
SELECTIVE PREFETCH ENGINE FOR RESOURCE EFFICIENT IMAGE RENDERING 有权

公开(公告)号：US20240428364A1

公开(公告)日：2024-12-26

申请号：US18341233

申请日：2023-06-26

Applicant: QUALCOMM Incorporated

Inventor： Nigel Poole , Edwin Jose Africano

IPC: G06T1/20 , G06F12/02 , G06T1/60

Abstract: Disclosed is a method for rendering a frame. The method is executed by a graphics processor and comprises dividing the frame into a plurality of bins. For each bin of the plurality of bins, the method comprises fetching a subset of entries corresponding to visible draw calls of a fixed stride draw table (FSDT). The FSDT comprises a plurality of entries corresponding to visible and invisible draw calls for the bin, wherein a visible draw call comprises instructions for drawing one or more pixels which are visible within the bin and wherein an invisible draw call only comprises instructions for drawing pixels which are invisible within the bin. The disclosed method further comprises executing the fetched subset of entries corresponding to visible draw calls.

6.

发明授权
Performance overhead optimization in GPU scoping 有权

公开(公告)号：US12002142B2

公开(公告)日：2024-06-04

申请号：US17373718

申请日：2021-07-12

Applicant: QUALCOMM Incorporated

Inventor： Tushar Garg , Thomas Edwin Frisinger , Nigel Poole , Vishwanath Shashikant Nikam , Vijay Kumar Donthireddy

IPC: G06T15/00 , G06F11/34 , G06T1/20

CPC classification number: G06T15/005 , G06F11/3485 , G06T1/20

Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may process a first workload of a plurality of workloads at each of multiple clusters in a GPU pipeline. The apparatus may also increment a plurality of performance counters during the processing of the first workload at each of the multiple clusters. Further, the apparatus may determine, at each of the multiple clusters, whether the first workload is finished processing. The apparatus may also read, upon determining that the first workload is finished processing, a value of each of the multiple clusters for each of the plurality of performance counters. Additionally, the apparatus may transmit an indication of the read value of each of the multiple clusters for all of the plurality of performance counters.

Patent Agency Ranking