RESOURCE BASED WORKLOAD ALLOCATION FOR MACHINE LEARNING WORKLOADS

    公开(公告)号:US20210103852A1

    公开(公告)日:2021-04-08

    申请号:US16591353

    申请日:2019-10-02

    Abstract: Methods, systems, and devices for workload balancing for machine learning are described. Generally, a device may determine a size of a level one cache of a texture processor, identify a portion of input activation data for an iterative machine-learning process, and load the portion of input activation data into the level one cache. The device may allocate, based at least in part on a texture processor to shading processor arithmetic logic unit (ALU) resource ratio, a first set of one or more weight batches and a second set of one or more weight batches associated with the loaded portion of input activation data to the shading processor, and process the portion of input activation data based at least in part on the first set of one or more weight batches and the second set of one or more weight batches using the texture processor and the shading processor in parallel.

    Patched shading in graphics processing

    公开(公告)号:US10559123B2

    公开(公告)日:2020-02-11

    申请号:US13829900

    申请日:2013-03-14

    Abstract: Aspects of this disclosure relate to a process for rendering graphics that includes designating a hardware shading unit of a graphics processing unit (GPU) to perform first shading operations associated with a first shader stage of a rendering pipeline. The process also includes switching operational modes of the hardware shading unit upon completion of the first shading operations. The process also includes performing, with the hardware shading unit of the GPU designated to perform the first shading operations, second shading operations associated with a second, different shader stage of the rendering pipeline.

    Vertex shaders for binning based graphics processing

    公开(公告)号:US10062139B2

    公开(公告)日:2018-08-28

    申请号:US15218808

    申请日:2016-07-25

    CPC classification number: G06T1/20 G06T15/005

    Abstract: This disclosure describes examples of using two vertex shaders each one during different graphics processing passes in a binning architecture for graphics processing. A first vertex shader processes subset of attributes of a vertex in a binning pass, where the subset of attributes include those that contribute to visibility determination and attributes that may benefit from being processed with a vertex shader that provides functional flexibility. A second, different vertex shader processes another subset of attributes of the vertex in the rendering pass.

    VERTEX SHADERS FOR BINNING BASED GRAPHICS PROCESSING

    公开(公告)号:US20180025463A1

    公开(公告)日:2018-01-25

    申请号:US15218808

    申请日:2016-07-25

    CPC classification number: G06T1/20 G06T15/005 G06T15/80

    Abstract: This disclosure describes examples of using two vertex shaders each one during different graphics processing passes in a binning architecture for graphics processing. A first vertex shader processes subset of attributes of a vertex in a binning pass, where the subset of attributes include those that contribute to visibility determination and attributes that may benefit from being processed with a vertex shader that provides functional flexibility. A second, different vertex shader processes another subset of attributes of the vertex in the rendering pass.

    Utilizing pipeline registers as intermediate storage

    公开(公告)号:US09747104B2

    公开(公告)日:2017-08-29

    申请号:US14275047

    申请日:2014-05-12

    Abstract: In one example, a method includes responsive to receiving, by a processing unit, one or more instructions requesting that a first value be moved from a first general purpose register (GPR) to a third GPR and that a second value be moved from a second GPR to a fourth GPR, copying, by an initial logic unit and during a first clock cycle, the first value to an initial pipeline register, copying, by the initial logic and during a second clock cycle, the second value to the initial pipeline register, copying, by a final logic unit and during a third clock cycle, the first value from a final pipeline register to the third GPR, and copying, by the final logic unit and during a fourth clock cycle, the second value from the final pipeline register to the fourth GPR.

    Dynamic pipeline for graphics processing

    公开(公告)号:US09697580B2

    公开(公告)日:2017-07-04

    申请号:US14537589

    申请日:2014-11-10

    CPC classification number: G06T1/20 G09G5/18 G09G2330/022

    Abstract: This disclosure describes an apparatus configured to process graphics data. The apparatus may include a fixed hardware pipeline configured to execute one or more functions on a current set of graphics data. The fixed hardware pipeline may include a plurality of stages including a bypassable portion of the plurality of stages. The apparatus may further include a shortcut circuit configured to route the current set of graphics data around the bypassable portion of the plurality of stages, and a controller positioned before the bypassable portion of the plurality of stages, the controller configured to selectively route the current set of graphics data to one of the shortcut circuit or the bypassable portion of the plurality of stages.

Patent Agency Ranking