-
公开(公告)号:US11562461B2
公开(公告)日:2023-01-24
申请号:US17529862
申请日:2021-11-18
Applicant: Intel Corporation
Inventor: Prasoonkumar Surti , Narayan Srinivasa , Feng Chen , Joydeep Ray , Ben J. Ashbaugh , Nicolas C. Galoppo Von Borries , Eriko Nurvitadhi , Balaji Vembu , Tsung-Han Lin , Kamal Sinha , Rajkishore Barik , Sara S. Baghsorkhi , Justin E. Gottschlich , Altug Koker , Nadathur Rajagopalan Satish , Farshad Akhbari , Dukhwan Kim , Wenyin Fu , Travis T. Schluessler , Josh B. Mastronarde , Linda L. Hurd , John H. Feit , Jeffery S. Boles , Adam T. Lake , Karthik Vaidyanathan , Devan Burke , Subramaniam Maiyuran , Abhishek R. Appu
IPC: G06T1/20 , G06T15/80 , G06F3/14 , G06T1/60 , G09G5/36 , G06F3/06 , G06N3/08 , G06N3/04 , G06N3/063 , G09G5/00
Abstract: An apparatus to facilitate compute optimization is disclosed. The apparatus includes one or more processing units to provide a first set of shader operations associated with a shader stage of a graphics pipeline, a scheduler to schedule shader threads for processing, and a field-programmable gate array (FPGA) dynamically configured to provide a second set of shader operations associated with the shader stage of the graphics pipeline.
-
公开(公告)号:US20230017304A1
公开(公告)日:2023-01-19
申请号:US17874876
申请日:2022-07-27
Applicant: Intel Corporation
Inventor: Rajkishore Barik , Brian T. Lewis , Murali Sundaresan , Jeffrey Jackson , Feng Chen , Xiaoming Chen , Mike Macpherson
Abstract: A mechanism is described for facilitating smart distribution of resources for deep learning autonomous machines. A method of embodiments, as described herein, includes detecting one or more sets of data from one or more sources over one or more networks, and introducing a library to a neural network application to determine optimal point at which to apply frequency scaling without degrading performance of the neural network application at a computing device.
-
公开(公告)号:US11550600B2
公开(公告)日:2023-01-10
申请号:US17090295
申请日:2020-11-05
Applicant: Intel Corporation
Inventor: Li Xu , Haihao Xiang , Feng Chen , Travis Schluessler , Yuheng Zhang , Sen Lin
IPC: G06F9/448 , G06F16/215 , G06T1/60 , G06T1/20
Abstract: Embodiments are generally directed to a system and method for adapting executable object to a processing unit. An embodiment of a method to adapt an executable object from a first processing unit to a second processing unit, comprises: adapting the executable object optimized for the first processing unit of a first architecture, to the second processing unit of a second architecture, wherein the second architecture is different from the first architecture, wherein the executable object is adapted to perform on the second processing unit based on a plurality of performance metrics collected while the executable object is performed on the first processing unit and the second processing unit.
-
公开(公告)号:US11488005B2
公开(公告)日:2022-11-01
申请号:US16518828
申请日:2019-07-22
Applicant: Intel Corporation
Inventor: Brian T. Lewis , Feng Chen , Jeffrey R. Jackson , Justin E. Gottschlich , Rajkishore Barik , Xiaoming Chen , Prasoonkumar Surti , Mike B. Macpherson , Murali Sundaresan
Abstract: A mechanism is described for facilitating smart collection of data and smart management of autonomous machines. A method of embodiments, as described herein, includes detecting one or more sets of data from one or more sources over one or more networks, and combining a first computation directed to be performed locally at a local computing device with a second computation directed to be performed remotely at a remote computing device in communication with the local computing device over the one or more networks, where the first computation consumes low power, wherein the second computation consumes high power.
-
公开(公告)号:US20220147837A1
公开(公告)日:2022-05-12
申请号:US17587407
申请日:2022-01-28
Applicant: Intel Corporation
Inventor: Feng Chen , Yan Hao , Yi Yang , Xiaoming Chen
IPC: G06N5/02 , G06N20/00 , G06F16/2458 , G06F21/00 , G06F16/903 , G06Q20/12 , G06F16/9535 , G06Q20/40
Abstract: A disclosed example includes selecting, by a mobile computing device, a model description for a predictive analytics model in response to a user-level application request including input data from an application of the mobile computing device, the model description created with a predictive analytics model description language, the model description received from a predictive analytics provider; comparing, by the mobile computing device, first data associated with the user-level application request with second data indicative of digital rights permissions associated with the model description; and executing, by the mobile computing device, an executable associated with the model description without providing the processor circuitry access to the executable and without providing the input data to the predictive analytics provider.
-
公开(公告)号:US20220084252A1
公开(公告)日:2022-03-17
申请号:US17355271
申请日:2021-06-23
Applicant: Intel Corporation
Inventor: Abhishek Appu , Altug Koker , Joydeep Ray , Balaji Vembu , Prasoonkumar Surti , Kamal Sinha , Nadathur Rajagoplan Satish , Narayan Srinivasa , Feng Chen , Dukhwan Kim , Farshad Akhbari
Abstract: An apparatus to facilitate compute compression is disclosed. The apparatus includes a graphics processing unit including mapping logic to map a first block of integer pixel data to a compression block and compression logic to compress the compression block.
-
公开(公告)号:US20210255857A1
公开(公告)日:2021-08-19
申请号:US17128972
申请日:2020-12-21
Applicant: Intel Corporation
Inventor: Feng Chen , Narayan Srinivasa , Abhishek R. Appu , Altug Koker , Kamal Sinha , Balaji Vembu , Joydeep Ray , Nicolas C. Galoppo Von Borries , Prasoonkumar Surti , Ben J. Ashbaugh , Sanjeev Jahagirdar , Vasanth Ranganathan
Abstract: A mechanism is described for facilitating intelligent dispatching and vectorizing at autonomous machines. A method of embodiments, as described herein, includes detecting a plurality of threads corresponding to a plurality of workloads associated with tasks relating to a graphics processor. The method may further include determining a first set of threads of the plurality of threads that are similar to each other or have adjacent surfaces, and physically clustering the first set of threads close together using a first set of adjacent compute blocks.
-
公开(公告)号:US11080811B2
公开(公告)日:2021-08-03
申请号:US16446398
申请日:2019-06-19
Applicant: Intel Corporation
Inventor: Abhishek R. Appu , Altug Koker , Linda L. Hurd , Dukhwan Kim , Mike B. Macpherson , John C. Weast , Feng Chen , Farshad Akhbari , Narayan Srinivasa , Nadathur Rajagopalan Satish , Joydeep Ray , Ping T. Tang , Michael S. Strickland , Xiaoming Chen , Anbang Yao , Tatiana Shpeisman
IPC: G06T1/20 , G06F3/14 , G06F9/30 , G06F9/38 , G06N3/04 , G06N3/063 , G06N3/08 , G06T15/00 , G09G5/36 , G06T15/04
Abstract: An apparatus to facilitate compute optimization is disclosed. The apparatus includes a mixed precision core to perform a mixed precision multi-dimensional matrix multiply and accumulate operation on 16-bit and/or 32 bit floating-point elements.
-
公开(公告)号:US20210232605A1
公开(公告)日:2021-07-29
申请号:US17229651
申请日:2021-04-13
Applicant: Intel Corporation
Inventor: Michael P. Mesnier , Tian Luo , Feng Chen
Abstract: In accordance with some embodiments, classification of input/output requests from a database to a storage system may be performed. Each input/output request may be associated with a database class, and each database class may be mapped to a quality of service policy. Thus, quality of service may be enforced such that different data blocks within the storage system of the database may be afforded appropriate quality of service.
-
公开(公告)号:US11017291B2
公开(公告)日:2021-05-25
申请号:US15581031
申请日:2017-04-28
Applicant: Intel Corporation
Inventor: Brian T. Lewis , Rajkishore Barik , Murali Sundaresan , Leonard Truong , Feng Chen , Xiaoming Chen , Mike B. Macpherson
Abstract: A mechanism is described for facilitating efficient training of neural networks at computing devices. A method of embodiments, as described herein, includes detecting one or more inputs for training of a neural network, and introducing randomness in floating point (FP) numbers to prevent overtraining of the neural network, where introducing randomness includes replacing less-significant low-order bits of operand and result values with new low-order bits during the training of the neural network.
-
-
-
-
-
-
-
-
-