-
公开(公告)号:US11768687B2
公开(公告)日:2023-09-26
申请号:US17848559
申请日:2022-06-24
Applicant: Intel Corporation
Inventor: Balaji Vembu , Abhishek R. Appu , Joydeep Ray , Altug Koker
IPC: G06T1/20 , G06F9/50 , G06F9/48 , G06F9/38 , G06F9/46 , G06F9/52 , G06F9/54 , G06F15/16 , G06F15/76 , G06F12/0897 , G06F12/0866 , G06F12/0842 , G06T1/60
CPC classification number: G06F9/3851 , G06F9/46 , G06F9/4843 , G06F9/4881 , G06F9/5027 , G06F9/522 , G06F9/545 , G06F12/0842 , G06F12/0866 , G06F12/0897 , G06F15/16 , G06F15/76 , G06T1/20 , G06T1/60 , G06F2209/5018 , G06T2200/28
Abstract: An apparatus to facilitate thread scheduling is disclosed. The apparatus includes logic to store barrier usage data based on a magnitude of barrier messages in an application kernel and a scheduler to schedule execution of threads across a plurality of multiprocessors based on the barrier usage data.
-
公开(公告)号:US11762696B2
公开(公告)日:2023-09-19
申请号:US17520583
申请日:2021-11-05
Applicant: Intel Corporation
Inventor: Abhishek R Appu , Altug Koker , Balaji Vembu , Joydeep Ray , Kamal Sinha , Prasoonkumar Surti , Kiran C. Veernapu , Subramaniam Maiyuran , Sanjeev S. Jahagirdar , Eric J. Asperheim , Guei-Yuan Lueh , David Puffer , Wenyin Fu , Nikos Kaburlasos , Bhushan M. Borole , Josh B. Mastronarde , Linda L. Hurd , Travis T. Schluessler , Tomasz Janczak , Abhishek Venkatesh , Kai Xiao , Slawomir Grajewski
CPC classification number: G06F9/5016 , G06F1/329 , G06F9/4893 , G06F9/5044 , G06T1/20 , G06T1/60 , G06T15/005 , G06T2200/28 , Y02D10/00
Abstract: In an example, an apparatus comprises a plurality of execution units comprising at least a first type of execution unit and a second type of execution unit and logic, at least partially including hardware logic, to analyze a workload and assign the workload to one of the first type of execution unit or the second type of execution unit. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US11748302B2
公开(公告)日:2023-09-05
申请号:US17561427
申请日:2021-12-23
Applicant: Intel Corporation
Inventor: Altug Koker , Prasoonkumar Surti , David Puffer , Subramaniam Maiyuran , Guei-Yuan Lueh , Abhishek R. Appu , Joydeep Ray , Balaji Vembu , Tomer Bar-On , Andrew T. Lauritzen , Hugues Labbe , John G. Gierach , Gabor Liktor
IPC: G06F16/13 , G06F9/38 , G06F9/30 , G06F16/11 , G06F16/172 , G06F9/46 , G06F12/1036 , G06F12/1045 , G06F12/0831
CPC classification number: G06F16/13 , G06F9/30 , G06F9/38 , G06F9/3836 , G06F9/461 , G06F16/113 , G06F16/172 , G06F12/0831 , G06F12/1036 , G06F12/1045 , G06F2201/84
Abstract: In an example, an apparatus comprises a plurality of execution units, and a first memory communicatively couple to the plurality of execution units, wherein the first shared memory is shared by the plurality of execution units and a copy engine to copy context state data from at least a first of the plurality of execution units to the first shared memory. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US11675597B2
公开(公告)日:2023-06-13
申请号:US17900230
申请日:2022-08-31
Applicant: Intel Corporation
Inventor: Balaji Vembu , Abhishek R. Appu , Joydeep Ray , Altug Koker
IPC: G06F9/38 , G06F12/0842 , G06F9/52 , G06F9/46 , G06T1/20 , G06F9/48 , G06F9/54 , G06F15/16 , G06F9/50 , G06F15/76 , G06F12/0897 , G06F12/0866 , G06T1/60
CPC classification number: G06F9/3851 , G06F9/46 , G06F9/4843 , G06F9/4881 , G06F9/5027 , G06F9/522 , G06F9/545 , G06F12/0842 , G06F12/0866 , G06F12/0897 , G06F15/16 , G06F15/76 , G06T1/20 , G06T1/60 , G06F2209/5018 , G06T2200/28
Abstract: An apparatus to facilitate thread scheduling is disclosed. In one embodiment the apparatus includes a processor comprising a plurality of multiprocessors comprising single-instruction multiple thread (SIMT) execution circuitry to simultaneously execute multiple threads, a shared local memory to be shared by the multiple threads, and scheduling hardware logic to schedule the multiple threads in a thread group for execution across the plurality of multiprocessors in accordance with barrier data. The instructions of the multiple threads are to produce shared data to be stored in the shared local memory when executed by the plurality of multiprocessors, wherein additional instructions of at least a first thread of the multiple threads are to use the shared data, and wherein, in accordance with the barrier data, the first thread is to wait for other threads of the multiple threads to finish producing the shared data before executing the additional instructions.
-
公开(公告)号:US11650928B2
公开(公告)日:2023-05-16
申请号:US17715734
申请日:2022-04-07
Applicant: Intel Corporation
Inventor: Altug Koker , Balaji Vembu , Joydeep Ray , Abhishek R. Appu
IPC: G06F12/0895 , G06F12/126 , G06F12/02 , G06T1/60
CPC classification number: G06F12/0895 , G06F12/023 , G06F12/126 , G06T1/60 , G06F2212/1044 , G06F2212/1048 , G06F2212/455 , G06F2212/604 , G06F2212/608
Abstract: A mechanism is described for facilitating optimization of cache associated with graphics processors at computing devices. A method of embodiments, as described herein, includes introducing coloring bits to contents of a cache associated with a processor including a graphics processor, wherein the coloring bits to represent a signal identifying one or more caches available for use, while avoiding explicit invalidations and flushes.
-
公开(公告)号:US11586548B2
公开(公告)日:2023-02-21
申请号:US17191473
申请日:2021-03-03
Applicant: Intel Corporation
Inventor: Abhishek R. Appu , Altug Koker , Joydeep Ray , David Puffer , Prasoonkumar Surti , Lakshminarayanan Striramassarma , Vasanth Ranganathan , Kiran C. Veernapu , Balaji Vembu , Pattabhiraman K
IPC: G06F12/0877 , G06F12/0802 , G06F12/0855 , G06F12/0806 , G06F12/0846 , G06F12/0868 , G06T1/60 , G06F12/126 , G06F12/0893
Abstract: In an example, an apparatus comprises a plurality of execution units, and a cache memory communicatively coupled to the plurality of execution units, wherein the cache memory is structured into a plurality of sectors, wherein each sector in the plurality of sectors comprises at least two cache lines. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US11521294B2
公开(公告)日:2022-12-06
申请号:US17036950
申请日:2020-09-29
Applicant: Intel Corporation
Inventor: Joydeep Ray , Altug Koker , Abhishek R. Appu , Balaji Vembu
Abstract: A mechanism is described for facilitating dynamic merging of atomic operations in computing devices. A method of embodiments, as described herein, includes facilitating detecting atomic messages and a plurality of slot addresses. The method further includes comparing one or more slot addresses of the plurality of slot addresses with other slot addresses of the plurality of slot addresses to seek one or more matched slot addresses, where the one or more matched slot addresses are merged into one or more merged groups. The method may further include generating one or more merged atomic operations based on and corresponding to the one or more merged groups.
-
公开(公告)号:US11520723B2
公开(公告)日:2022-12-06
申请号:US17367169
申请日:2021-07-02
Applicant: Intel Corporation
Inventor: Abhishek R. Appu , Altug Koker , Joydeep Ray , Kiran C. Veernapu , Balaji Vembu , Vasanth Ranganathan , Prasoonkumar Surti
IPC: G06F13/40 , G06F9/54 , G06F13/42 , G06T1/60 , G06F12/084 , G06F12/0811 , G06F12/0846 , G06F12/0831
Abstract: Methods and apparatus relating to techniques for avoiding cache lookup for cold cache. In an example, an apparatus comprises logic, at least partially comprising hardware logic, to monitor a thread switching overhead parameter for an application executing in a processing system and in response to a determination that the thread switching overhead parameter exceeds a threshold, to activate a thread management algorithm to reduce thread switching in the processing system. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US11520555B2
公开(公告)日:2022-12-06
申请号:US17162864
申请日:2021-01-29
Applicant: Intel Corporation
Inventor: Deepak S. Vembar , Atsuo Kuwahara , Chandrasekaran Sakthivel , Radhakrishnan Venkataraman , Brent E. Insko , Anupreet S. Kalra , Hugues Labbe , Altug Koker , Michael Apodaca , Kai Xiao , Jeffery S. Boles , Adam T. Lake , David M. Cimini , Balaji Vembu , Elmoustapha Ould-Ahmed-Vall , Jacek Kwiatkowski , Philip R. Laws , Ankur N. Shah , Abhishek R. Appu , Joydeep Ray , Wenyin Fu , Nikos Kaburlasos , Prasoonkumar Surti , Bhushan M. Borole
IPC: G06F3/14 , G06F3/147 , G06T1/20 , G09G5/36 , H04L65/402 , G09G3/00 , H04L67/131
Abstract: An embodiment of a graphics apparatus may include a processor, memory communicatively coupled to the processor, and a collaboration engine communicatively coupled to the processor to identify a shared graphics component between two or more users in an environment, and share the shared graphics components with the two or more users in the environment. Embodiments of the collaboration engine may include one or more of a centralized sharer, a depth sharer, a shared preprocessor, a multi-port graphics subsystem, and a decode sharer. Other embodiments are disclosed and claimed.
-
100.
公开(公告)号:US11514550B2
公开(公告)日:2022-11-29
申请号:US16916670
申请日:2020-06-30
Applicant: INTEL CORPORATION
Inventor: Yunbiao Lin , Changliang Wang , Satyanantha Ramagopal Musunuri , David Puffer , David J. Cowperthwaite , Bryan R. White , Balaji Vembu
Abstract: An apparatus and method for managing pipes and planes within a virtual graphics processing engine. For example, one embodiment of a graphics processing apparatus comprises: a graphics processor comprising one or more display pipes to render one or more display planes, each of the one or more display pipes comprising a set of graphics processing hardware resources for executing graphics commands and rendering graphics images in the one or more display planes; and pipe and plane management hardware logic to manage pipe and plane assignment, the pipe and plane management hardware logic to associate a first virtual machine (VM) with one or more virtual display planes and to maintain a mapping between the one or more virtual display planes and at least one physical display plane.
-
-
-
-
-
-
-
-
-