Patent search ap:("QUALCOMM INCORPORATED") AND inv:"Colin Beaton Verrilli" Page 1

1.

发明授权
Method, apparatus, and system for an architecture for machine learning acceleration 有权

公开(公告)号：US11010313B2

公开(公告)日：2021-05-18

申请号：US16556094

申请日：2019-08-29

Applicant: QUALCOMM Incorporated

Inventor： Colin Beaton Verrilli , Natarajan Vaidhyanathan , Rexford Alan Hill

IPC: G06F13/16 , G06F17/16 , G06N3/10 , H04L12/18 , H04L12/46

Abstract: A method, apparatus, and system for an architecture for machine learning acceleration is presented. An apparatus includes a plurality of processing elements, each including a tightly-coupled memory, and a memory system coupled to the processing elements. A global synchronization manager is coupled to the plurality of the processing elements and to the memory system. The processing elements do not implement a coherency protocol with respect to the memory system. The processing elements implement direct memory access with respect to the memory system, and the global synchronization manager is configured to synchronize operations of the plurality of processing elements through the TCMs.

2.

发明授权
Providing space-efficient storage for dynamic random access memory (DRAM) cache tags 有权

公开(公告)号：US10467092B2

公开(公告)日：2019-11-05

申请号：US15085350

申请日：2016-03-30

Applicant: QUALCOMM Incorporated

Inventor： Natarajan Vaidhyanathan , Mattheus Cornelis Antonius Adrianus Heddes , Colin Beaton Verrilli

IPC: G11C29/00 , G06F11/10 , G11C7/10 , G06F12/0806 , G06F12/0895

Abstract: Providing space-efficient storage for dynamic random access memory (DRAM) cache tags is provided. In one aspect, a DRAM cache management circuit provides a plurality of cache entries, each of which contains a tag storage region, a data storage region, and an error protection region. The DRAM cache management circuit is configured to store data to be cached in the data storage region of each cache entry. The DRAM cache management circuit is also configured to use an error detection code (EDC) instead of an error correcting code (ECC), and to store a tag and the EDC for each cache entry in the error protection region of the cache entry. In this manner, the capacity of a DRAM cache can be increased by avoiding the need for the tag storage region for each cache entry, while still providing error detection for the cache entry.

3.

发明申请
PROVIDING FLEXIBLE MATRIX PROCESSORS FOR PERFORMING NEURAL NETWORK CONVOLUTION IN MATRIX-PROCESSOR-BASED DEVICES 审中-公开

公开(公告)号：US20190065942A1

公开(公告)日：2019-02-28

申请号：US16117952

申请日：2018-08-30

Applicant: QUALCOMM Incorporated

Inventor： Colin Beaton Verrilli , Mattheus Cornelis Antonius Adrianus Heddes , Natarajan Vaidhyanathan , Koustav Bhattacharya , Robert Dreyer

IPC: G06N3/063 , G06N3/04 , G06F17/16 , G06F15/80

Abstract: Providing flexible matrix processors for performing neural network convolution in matrix-processor-based devices is disclosed. In this regard, a matrix-processor-based device provides a central processing unit (CPU) and a matrix processor. The matrix processor reorganizes a plurality of weight matrices and a plurality of input matrices into swizzled weight matrices and swizzled input matrices, respectively, that have regular dimensions natively supported by the matrix processor. The matrix-processor-based device then performs a convolution operation using the matrix processor to perform matrix multiplication/accumulation operations for the regular dimensions of the weight matrices and the input matrices, and further uses the CPU to execute instructions for handling the irregular dimensions of the weight matrices and the input matrices (e.g., by executing a series of nested loops, as a non-limiting example). The matrix-processor-based device thus provides efficient hardware acceleration by taking advantage of dimensional regularity, while maintaining the flexibility to handle different variations of convolution.

4.

发明申请
PROVIDING EFFICIENT MULTIPLICATION OF SPARSE MATRICES IN MATRIX-PROCESSOR-BASED DEVICES 审中-公开

公开(公告)号：US20190065150A1

公开(公告)日：2019-02-28

申请号：US16118162

申请日：2018-08-30

Applicant: QUALCOMM Incorporated

Inventor： Mattheus Cornelis Antonius Adrianus Heddes , Robert Dreyer , Colin Beaton Verrilli , Natarajan Vaidhyanathan , Koustav Bhattacharya

IPC: G06F7/544 , G06F15/80

Abstract: Providing efficient multiplication of sparse matrices in matrix-processor-based devices is disclosed herein. In one aspect, a matrix processor of a matrix-processor-based device includes a plurality of sequencers coupled to a plurality of multiply/accumulate (MAC) units for performing multiplication and accumulation operations. Each sequencer determines whether a product of an element of a first input matrix to be multiplied with an element of a second input matrix has a value of zero (e.g., by determining whether the element of the first input matrix has a value of zero, or by determining whether either the element of the first input matrix or that of the second input matrix has a value of zero). If the product of the elements of the first input matrix and the second input matrix does not have a value of zero, the sequencer provides the elements to a MAC unit to perform a multiplication and accumulation operation.

5.

发明授权
Providing scalable dynamic random access memory (DRAM) cache management using DRAM cache indicator caches 有权

公开(公告)号：US10176096B2

公开(公告)日：2019-01-08

申请号：US15228320

申请日：2016-08-04

Applicant: QUALCOMM Incorporated

Inventor： Natarajan Vaidhyanathan , Mattheus Cornelis Antonius Adrianus Heddes , Colin Beaton Verrilli

IPC: G06F12/00 , G06F12/0804 , G06F12/0893

Abstract: Providing scalable dynamic random access memory (DRAM) cache management using DRAM cache indicator caches is provided. In one aspect, a DRAM cache management circuit is provided to manage access to a DRAM cache in high-bandwidth memory. The DRAM cache management circuit comprises a DRAM cache indicator cache, which stores master table entries that are read from a master table in a system memory DRAM and that contain DRAM cache indicators. The DRAM cache indicators enable the DRAM cache management circuit to determine whether a memory line in the system memory DRAM is cached in the DRAM cache of high-bandwidth memory, and, if so, in which way of the DRAM cache the memory line is stored. Based on the DRAM cache indicator cache, the DRAM cache management circuit may determine whether to employ the DRAM cache and/or the system memory DRAM to perform a memory access operation in an optimal manner.

6.

发明授权
Providing memory bandwidth compression using multiple last-level cache (LLC) lines in a central processing unit (CPU)-based system 审中-公开

公开(公告)号：US10042771B2

公开(公告)日：2018-08-07

申请号：US15718449

申请日：2017-09-28

Applicant: QUALCOMM Incorporated

Inventor： Colin Beaton Verrilli , Mattheus Cornelis Antonius Adrianus Heddes , Mark Anthony Rinaldi , Natarajan Vaidhyanathan

IPC: G06F12/04 , G06F12/12 , G06F12/0875 , G06F12/0897 , G06F12/084 , G06F12/0811 , G06F12/0862

CPC classification number: G06F12/0875 , G06F12/04 , G06F12/0811 , G06F12/084 , G06F12/0862 , G06F12/0897 , G06F12/12 , G06F2212/1016 , G06F2212/1044 , G06F2212/401

Abstract: Providing memory bandwidth compression using multiple last-level cache (LLC) lines in a central processing unit (CPU)-based system is disclosed. In some aspects, a compressed memory controller (CMC) provides an LLC comprising multiple LLC lines, each providing a plurality of sub-lines the same size as a system cache line. The contents of the system cache line(s) stored within a single LLC line are compressed and stored in system memory within the memory sub-line region corresponding to the LLC line. A master table stores information indicating how the compressed data for an LLC line is stored in system memory by storing an offset value and a length value for each sub-line within each LLC line. By compressing multiple system cache lines together and storing compressed data in a space normally allocated to multiple uncompressed system lines, the CMC enables compression sizes to be smaller than the memory read/write granularity of the system memory.

7.

发明申请
PROVIDING MEMORY BANDWIDTH COMPRESSION USING COMPRESSION INDICATOR (CI) HINT DIRECTORIES IN A CENTRAL PROCESSING UNIT (CPU)-BASED SYSTEM 审中-公开

公开(公告)号：US20170286001A1

公开(公告)日：2017-10-05

申请号：US15086882

申请日：2016-03-31

Applicant: QUALCOMM Incorporated

Inventor： Colin Beaton Verrilli , Mattheus Cornelis Antonius Adrianus Heddes , Natarajan Vaidhyanathan

IPC: G06F3/06 , G06F11/10

CPC classification number: G06F3/0638 , G06F3/0604 , G06F3/0632 , G06F3/0673 , G06F11/1004 , G06F11/1048 , G06F11/1076 , G06F12/0223 , G06F12/0862 , G06F12/0875 , G06F12/12 , G06F2212/1024 , G06F2212/1044 , G06F2212/401 , G06F2212/403 , G06F2212/466 , H03M7/30 , H03M13/6312

Abstract: Providing memory bandwidth compression using compression indicator (CI) hint directories in a central processing unit (CPU)-based system is disclosed. In this regard, a compressed memory controller provides a CI hint directory comprising a plurality of CI hint directory entries, each providing a plurality of CI hints. The compressed memory controller is configured to receive a memory read request comprising a physical address of a memory line, and initiate a memory read transaction comprising a requested read length value. The compressed memory controller is further configured to, in parallel with initiating the memory read transaction, determine whether the physical address corresponds to a CI hint directory entry in the CI hint directory. If so, the compressed memory controller reads a CI hint from the CI hint directory entry of the CI hint directory, and modifies the requested read length value of the memory read transaction based on the CI hint.

8.

发明申请
PROVIDING MEMORY BANDWIDTH COMPRESSION USING COMPRESSED MEMORY CONTROLLERS (CMCs) IN A CENTRAL PROCESSING UNIT (CPU)-BASED SYSTEM 审中-公开
Title translation: 在中央处理单元（CPU）系统中使用压缩存储器控制器（CMC）提供存储带宽压缩

公开(公告)号：US20150339239A1

公开(公告)日：2015-11-26

申请号：US14717552

申请日：2015-05-20

Applicant: QUALCOMM Incorporated

Inventor： Mattheus Cornelis Antonius Adrianus Heddes , Natarajan Vaidhyanathan , Colin Beaton Verrilli

IPC: G06F12/10 , G06F12/08

CPC classification number: G06F12/1081 , G06F12/023 , G06F12/0811 , G06F12/0817 , G06F12/0833 , G06F2212/1024 , G06F2212/1044 , G06F2212/2532 , G06F2212/401 , G06F2212/452 , G06F2212/62 , G06F2212/621 , G06F2212/622 , Y02D10/13

Abstract: Providing memory bandwidth compression using compressed memory controllers (CMCs) in a central processing unit (CPU)-based system is disclosed. In this regard, in some aspects, a CMC is configured to receive a memory read request to a physical address in a system memory, and read a compression indicator (CI) for the physical address from a master directory and/or from error correcting code (ECC) bits of the physical address. Based on the CI, the CMC determines a number of memory blocks to be read for the memory read request, and reads the determined number of memory blocks. In some aspects, a CMC is configured to receive a memory write request to a physical address in the system memory, and generate a CI for write data based on a compression pattern of the write data. The CMC updates the master directory and/or the ECC bits of the physical address with the generated CI.

Abstract translation: 在基于中央处理单元（CPU）的系统中，使用压缩存储器控制器（CMC）提供存储器带宽压缩。在这方面，在某些方面，CMC被配置为向系统存储器中的物理地址接收存储器读取请求，并从主目录和/或从纠错码读取物理地址的压缩指示符（CI）（ECC）位的物理地址。基于CI，CMC确定要为存储器读取请求读取的存储器块的数量，并读取确定的存储器块数。在一些方面，CMC被配置为接收对系统存储器中的物理地址的存储器写入请求，并且基于写入数据的压缩模式生成用于写入数据的CI。 CMC使用生成的CI更新主目录和/或物理地址的ECC位。

9.

发明授权
Providing flexible management of heterogeneous memory systems using spatial quality of service (QoS) tagging in processor-based systems 有权

公开(公告)号：US10055158B2

公开(公告)日：2018-08-21

申请号：US15272951

申请日：2016-09-22

Applicant: QUALCOMM Incorporated

Inventor： Colin Beaton Verrilli , Carl Alan Waldspurger , Natarajan Vaidhyanathan , Mattheus Cornelis Antonius Adrianus Heddes , Koustav Bhattacharya

IPC: G06F12/0891 , G06F3/06 , G06F12/0802 , G06F13/16

CPC classification number: G06F3/0629 , G06F3/0604 , G06F3/0685 , G06F12/0802 , G06F12/0804 , G06F12/0891 , G06F12/126 , G06F13/1668 , G06F13/1694 , G06F2212/1016 , G06F2212/20 , G06F2212/60 , G06F2212/601

Abstract: Providing flexible management of heterogeneous memory systems using spatial Quality of Service (QoS) tagging in processor-based systems is disclosed. In one aspect, a heterogeneous memory system of a processor-based system includes a first memory and a second memory. The heterogeneous memory system is divided into a plurality of memory regions, each associated with a QoS identifier (QoSID), which may be set and updated by software. A memory controller of the heterogeneous memory system provides a QoS policy table, which operates to associate each QoSID with a QoS policy state, and which also may be software-configurable. Upon receiving a memory access request including a memory address of a memory region, the memory controller identifies a software-configurable QoSID associated with the memory address, and associates the QoSID with a QoS policy state using the QoS policy table. The memory controller then applies the QoS policy state to perform the memory access operation.

10.

发明申请
PROVIDING SPACE-EFFICIENT STORAGE FOR DYNAMIC RANDOM ACCESS MEMORY (DRAM) CACHE TAGS 审中-公开

公开(公告)号：US20170286214A1

公开(公告)日：2017-10-05

申请号：US15085350

申请日：2016-03-30

Applicant: QUALCOMM Incorporated

Inventor： Natarajan Vaidhyanathan , Mattheus Cornelis Antonius Adrianus Heddes , Colin Beaton Verrilli

IPC: G06F11/10 , G06F12/08 , G11C7/10

CPC classification number: G06F11/1064 , G06F12/0806 , G06F12/0895 , G06F2212/1008 , G06F2212/40 , G06F2212/403 , G06F2212/621 , G06F2212/7209 , G11C7/1072

Abstract: Providing space-efficient storage for dynamic random access memory (DRAM) cache tags is provided. In one aspect, a DRAM cache management circuit provides a plurality of cache entries, each of which contains a tag storage region, a data storage region, and an error protection region. The DRAM cache management circuit is configured to store data to be cached in the data storage region of each cache entry. The DRAM cache management circuit is also configured to use an error detection code (EDC) instead of an error correcting code (ECC), and to store a tag and the EDC for each cache entry in the error protection region of the cache entry. In this manner, the capacity of a DRAM cache can be increased by avoiding the need for the tag storage region for each cache entry, while still providing error detection for the cache entry.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification