-
公开(公告)号:US20240168807A1
公开(公告)日:2024-05-23
申请号:US18056949
申请日:2022-11-18
申请人: Intel Corporation
发明人: Jorge Eduardo Parra Osorio , Guei-Yuan Lueh , Maxim Kazakov , Fangwen Fu , Supratim Pal , Kaiyu Chen
CPC分类号: G06F9/5027 , G06F9/48 , G06F9/522 , G06F15/8046
摘要: An apparatus to facilitate cross-thread register sharing for matrix multiplication compute is disclosed. The apparatus includes matrix acceleration hardware comprising a plurality of data processing units, wherein the respective plurality of data processing units are to: receive a decoded instruction for a first thread having a first register space, wherein the decoded instruction is for a matrix multiplication operation and comprises an indication to utilize a second register space of a second thread for an operand of the decoded instruction for the first thread; access the second register space of the second thread to obtain data for the operand of the decoded instruction; and perform the matrix multiplication operation for the first thread using the data for the operand from the second register space of the second thread.
-
公开(公告)号:US20240111534A1
公开(公告)日:2024-04-04
申请号:US17957486
申请日:2022-09-30
申请人: Intel Corporation
发明人: Fangwen Fu , Chunhui Mei , Maxim Kazakov , Biju George , Jorge Parra , Supratim Pal
CPC分类号: G06F9/30047 , G06F9/3009 , G06F9/542
摘要: Embodiments described herein provide a technique enable a broadcast load from an L1 cache or shared local memory to register files associated with hardware threads of a graphics core. One embodiment provides a graphics processor comprising a cache memory and a graphics core coupled with the cache memory. The graphics core includes a plurality of hardware threads and memory access circuitry to facilitate access to memory by the plurality of hardware threads. The graphics core is configurable to process a plurality of load request from the plurality of hardware threads, detect duplicate load requests within the plurality of load requests, perform a single read from the cache memory in response to the duplicate load requests, and transmit data associated with the duplicate load requests to requesting hardware threads.
-
公开(公告)号:US11729403B2
公开(公告)日:2023-08-15
申请号:US16647998
申请日:2017-12-05
申请人: INTEL CORPORATION
发明人: James Holland , Hiu-Fai Chan , Fangwen Fu , Qian Xu , Sang-Hee Lee , Vidhya Krishnan
IPC分类号: H04N11/02 , H04N19/182 , H04N19/423
CPC分类号: H04N19/182 , H04N19/423
摘要: A lossless pixel compressor may include technology to detect a format of a pixel memory region, and compress the pixel memory region together with embedded control information which indicates the detected format of the pixel memory region. Other embodiments are disclosed and claimed.
-
公开(公告)号:US11423507B2
公开(公告)日:2022-08-23
申请号:US17159708
申请日:2021-01-27
申请人: Intel Corporation
发明人: Abhishek R. Appu , Stanley J. Baran , Sang-Hee Lee , Atthar H. Mohammed , Jong Dae Oh , Hiu-Fai R. Chan , Jill M. Boyce , Fangwen Fu , Satya N. Yedidi , Sumit Mohan , James M. Holland , Keith W. Rowe , Altug Koker
IPC分类号: G06T1/20 , G06T1/60 , G09G5/00 , H04N19/156 , G06F1/3206 , G06F1/3234 , G06F1/3212
摘要: An embodiment of an electronic processing system may include an application processor, persistent storage media communicatively coupled to the application processor, a graphics subsystem communicatively coupled to the application processor, a power budget analyzer to identify a power budget for one or more of the application processor, the persistent storage media, and the graphics subsystem, a target analyzer communicatively coupled to the graphics subsystem to identify a target for the graphics subsystem, and a parameter adjuster to adjust one or more parameters of the graphics subsystem based on one or more of the identified power budget and the identified target.
-
公开(公告)号:US10909653B2
公开(公告)日:2021-02-02
申请号:US16515794
申请日:2019-07-18
申请人: Intel Corporation
发明人: Abhishek R. Appu , Stanley J. Baran , Sang-Hee Lee , Atthar H. Mohammed , Jong Dae Oh , Hiu-Fai R. Chan , Jill M. Boyce , Fangwen Fu , Satya N. Yedidi , Sumit Mohan , James M. Holland , Keith W. Rowe , Altug Koker
IPC分类号: G06T1/20 , H04N19/156 , G06T1/60 , G09G5/00 , G06F1/3206 , G06F1/3234 , G06F1/3212
摘要: An embodiment of an electronic processing system may include an application processor, persistent storage media communicatively coupled to the application processor, a graphics subsystem communicatively coupled to the application processor, a power budget analyzer to identify a power budget for one or more of the application processor, the persistent storage media, and the graphics subsystem, a target analyzer communicatively coupled to the graphics subsystem to identify a target for the graphics subsystem, and a parameter adjuster to adjust one or more parameters of the graphics subsystem based on one or more of the identified power budget and the identified target.
-
公开(公告)号:US20200068216A1
公开(公告)日:2020-02-27
申请号:US16666275
申请日:2019-10-28
申请人: Intel Corporation
发明人: Fangwen Fu , Jill M. Boyce
IPC分类号: H04N19/52 , H04N19/70 , H04N19/105
摘要: Temporal motion vector prediction control is described in video coding. In one example, a method includes receiving a plurality of frames representing encoded video, parsing an uncompressed header for each frame, determining whether a temporal motion vector prediction command is included within the parsed uncompressed header of a first frame, selecting a reference frame from a reference list of frames, retrieving motion vector information from the selected reference frame, performing temporal motion vector prediction on the first frame corresponding to the parsed uncompressed header if a temporal motion vector prediction command is included within the parsed header to form a motion predicted frame, applying a loop filter to the motion predicted frame, and rendering the frame as decoded video.
-
公开(公告)号:US20180300839A1
公开(公告)日:2018-10-18
申请号:US15488569
申请日:2017-04-17
申请人: Intel Corporation
发明人: Abhishek R. Appu , Stanley J. Baran , Sang-Hee Lee , Atthar H. Mohammed , Jong Dae Oh , Hiu-Fai R. Chan , Jill M. Boyce , Fangwen Fu , Satya N. Yedidi , Sumit Mohan , James M. Holland , Keith W. Rowe , Altug Koker
CPC分类号: G06T1/20 , G06F1/3206 , G06F1/3212 , G06F1/3265 , G06T1/60 , G06T2200/16 , G06T2210/52 , G09G5/006 , G09G2330/021 , G09G2340/0407 , H04N19/156 , Y02D10/153 , Y02D10/174
摘要: An embodiment of an electronic processing system may include an application processor, persistent storage media communicatively coupled to the application processor, a graphics subsystem communicatively coupled to the application processor, a power budget analyzer to identify a power budget for one or more of the application processor, the persistent storage media, and the graphics subsystem, a target analyzer communicatively coupled to the graphics subsystem to identify a target for the graphics subsystem, and a parameter adjuster to adjust one or more parameters of the graphics subsystem based on one or more of the identified power budget and the identified target.
-
8.
公开(公告)号:US10097833B2
公开(公告)日:2018-10-09
申请号:US14583534
申请日:2014-12-26
申请人: Intel Corporation
发明人: Fangwen Fu , Haihua Wu , Tuyet-Trang Lam Piel
IPC分类号: H04N19/13 , H04N19/146 , H04N19/184 , H04N19/70
摘要: Techniques related to entropy coding with look-up-table based probability updating for video coding including setting a search range for candidate probabilities. This also involves selecting one of the candidate probabilities of the look-up table to update a previous probability for coding of a symbol, and selecting based on, at least in part, the bit-cost associated with updating the previous probability with at least one of the candidate probabilities.
-
9.
公开(公告)号:US20240169021A1
公开(公告)日:2024-05-23
申请号:US18056930
申请日:2022-11-18
申请人: Intel Corporation
发明人: Jorge Eduardo Parra Osorio , Supratim Pal , Fangwen Fu , Guei-Yuan Lueh , Po-Yu Chen , Jiasheng Chen
CPC分类号: G06F17/16 , G06F7/5443
摘要: An apparatus to facilitate enhancements for accumulator usage and instruction forwarding in matrix multiply pipeline in graphics environment is disclosed. The apparatus includes matrix acceleration hardware comprising a plurality of data processing units, wherein the respective plurality of data processing units comprise: multiply-accumulate hardware to generate intermediate results of a matrix multiplication operation; intermediate accumulation hardware to store the intermediate results of the matrix multiplication operation and accumulate with other intermediate results generated by the multiply-accumulate hardware; a bypass data structure to cause a source operand to bypass the multiply-accumulate hardware; and an adder circuit to add an output from the multiply-accumulate hardware with at least one of the source operand or an output of the intermediate accumulation hardware to generate a final output.
-
公开(公告)号:US20240134719A1
公开(公告)日:2024-04-25
申请号:US17973234
申请日:2022-10-24
申请人: Intel Corporation
发明人: Fangwen Fu , Chunhui Mei , John A. Wiegert , Yongsheng Liu , Ben J. Ashbaugh
CPC分类号: G06F9/522 , G06F9/4881
摘要: Embodiments described herein provide a technique to facilitate the synchronization of workgroups executed on multiple graphics cores of a graphics core cluster. One embodiment provides a graphics core including a cache memory and a graphics core coupled with the cache memory. The graphics core includes execution resources to execute an instruction via a plurality of hardware threads and barrier circuitry to synchronize execution of the plurality of hardware threads, wherein the barrier circuitry is configured to provide a plurality of re-usable named barriers.
-
-
-
-
-
-
-
-
-