Patent search ap:("QUALCOMM Incorporated") AND inv:"Serag GADELRAB" Page 1

1.

发明申请
CONCURRENT OPTIMIZATION OF MACHINE LEARNING MODEL PERFORMANCE 有权

公开(公告)号：US20210019652A1

公开(公告)日：2021-01-21

申请号：US16515711

申请日：2019-07-18

Applicant: QUALCOMM Incorporated

Inventor： Serag GADELRAB , James ESLIGER , Meghal VARIA , Kyle ERNEWEIN , Alwyn DOS REMEDIOS , George LEE

IPC: G06N20/00 , G06N5/04 , G06F11/34

Abstract: Certain aspects of the present disclosure provide techniques for concurrently performing inferences using a machine learning model and optimizing parameters used in executing the machine learning model. An example method generally includes receiving a request to perform inferences on a data set using the machine learning model and performance metric targets for performance of the inferences. At least a first inference is performed on the data set using the machine learning model to meet a latency specified for generation of the first inference from receipt of the request. While performing the at least the first inference, operational parameters resulting in inference performance approaching the performance metric targets are identified based on the machine learning model and operational properties of the computing device. The identified operational parameters are applied to performance of subsequent inferences using the machine learning model.

2.

发明申请
ADAPTIVE QUANTIZATION FOR EXECUTION OF MACHINE LEARNING MODELS 有权

公开(公告)号：US20210279635A1

公开(公告)日：2021-09-09

申请号：US16810123

申请日：2020-03-05

Applicant: QUALCOMM Incorporated

Inventor： Serag GADELRAB , Karamvir CHATHA , Ofer ROSENBERG

IPC: G06N20/00 , G06N5/04 , G06F11/34

Abstract: Certain aspects of the present disclosure provide techniques for adaptively executing machine learning models on a computing device. An example method generally includes receiving weight information for a machine learning model to be executed on a computing device. The received weight information is reduced into quantized weight information having a reduced bit size relative to the received weight information. First inferences using the machine learning model and the received weight information, and second inferences are performed using the machine learning model and the quantized weight information. Results of the first and second inferences are compared, it is determined that results of the second inferences are within a threshold performance level of results of the first inferences, and based on the determination, one or more subsequent inferences are performed using the machine learning model and the quantized weight information.

3.

发明公开
CONCURRENT OPTIMIZATION OF MACHINE LEARNING MODEL PERFORMANCE 审中-公开

公开(公告)号：US20240112090A1

公开(公告)日：2024-04-04

申请号：US18539022

申请日：2023-12-13

Applicant: QUALCOMM Incorporated

Inventor： Serag GADELRAB , James Lyall ESLIGER , Meghal VARIA , Kyle ERNEWEIN , Alwyn DOS REMEDIOS , George LEE

IPC: G06N20/00 , G06F11/34 , G06N5/04

CPC classification number: G06N20/00 , G06F11/3466 , G06N5/04

Abstract: Certain aspects of the present disclosure provide techniques for concurrently performing inferences using a machine learning model and optimizing parameters used in executing the machine learning model. An example method generally includes receiving a request to perform inferences on a data set using the machine learning model and performance metric targets for performance of the inferences. At least a first inference is performed on the data set using the machine learning model to meet a latency specified for generation of the first inference from receipt of the request. While performing the at least the first inference, operational parameters resulting in inference performance approaching the performance metric targets are identified based on the machine learning model and operational properties of the computing device. The identified operational parameters are applied to performance of subsequent inferences using the machine learning model.

4.

发明申请
COMMAND-DRIVEN TRANSLATION PRE-FETCH FOR MEMORY MANAGEMENT UNITS 有权
Title translation: 用于内存管理单元的命令驱动翻译预备电路

公开(公告)号：US20160283384A1

公开(公告)日：2016-09-29

申请号：US14672133

申请日：2015-03-28

Applicant: QUALCOMM Incorporated

Inventor： Jason Edward PODAIMA , Bohuslav RYCHLIK , Paul Christopher John WIERCIENSKI , Kyle John ERNEWEIN , Carlos Javier MOREIRA , Meghal VARIA , Serag GADELRAB

IPC: G06F12/08 , G06F12/10

CPC classification number: G06F12/0862 , G06F12/0875 , G06F12/10 , G06F12/1027 , G06F2212/1021 , G06F2212/452 , G06F2212/602 , G06F2212/6028 , G06F2212/654 , G06F2212/684

Abstract: Methods and systems for pre-fetching address translations in a memory management unit (MMU) of a device are disclosed. In an embodiment, the MMU receives a pre-fetch command from an upstream component of the device, the pre-fetch command including an address of an instruction, pre-fetches a translation of the instruction from a translation table in a memory of the device, and stores the translation of the instruction in a translation cache associated with the MMU.

Abstract translation: 公开了用于在设备的存储器管理单元（MMU）中预取地址转换的方法和系统。在一个实施例中，MMU从设备的上游组件接收预取命令，预取命令包括指令的地址，从设备的存储器中的转换表预取指令的转换并将指令的转换存储在与MMU相关联的转换高速缓存中。

5.

发明申请
Compression Of High Dynamic Ratio Fields For Machine Learning 有权

公开(公告)号：US20210288660A1

公开(公告)日：2021-09-16

申请号：US17333282

申请日：2021-05-28

Applicant: QUALCOMM Incorporated

Inventor： Clara Ka Wah SUNG , Meghal VARIA , Serag GADELRAB , Cheng-Teh HSIEH , Jason Edward PODAIMA , Victor SZETO , Richard BOISJOLY , Milivoje ALEKSIC , Tom LONGO , In-Suk CHONG

IPC: H03M7/30 , G06F17/18 , G06N20/00 , G06N5/04

Abstract: Various embodiments include methods and devices for implementing decompression of compressed high dynamic ratio fields. Various embodiments may include receiving compressed first and second sets of data fields, decompressing the first and second compressed sets of data fields to generate first and second decompressed sets of data fields, receiving a mapping for mapping the first and second decompressed sets of data fields to a set of data units, aggregating the first and second decompressed sets of data fields using the mapping to generate a compression block comprising the set of data units.

6.

发明申请
Compression Of High Dynamic Ratio Fields For Machine Learning 审中-公开

公开(公告)号：US20200274549A1

公开(公告)日：2020-08-27

申请号：US16798186

申请日：2020-02-21

Applicant: QUALCOMM Incorporated

Inventor： Clara Ka Wah SUNG , Meghal VARIA , Serag GADELRAB , Cheng-Teh HSIEH , Jason Edward PODAIMA , Victor SZETO , Richard BOISJOLY , Milivoje ALEKSIC , Tom LONGO , In-Suk CHONG

IPC: H03M7/30 , G06N20/00 , G06N5/04 , G06F17/18

Abstract: Various embodiments include methods and devices for implementing compression of high dynamic ratio fields. Various embodiments may include receiving a compression block having data units, receiving a mapping for the compression block, wherein the mapping is configured to map bits of each data unit to two or more data fields to generate a first set of data fields and a second set of data fields, compressing the first set of data fields together to generate a compressed first set of data fields, and compressing the second set of data fields together to generate a compressed second set of data fields.

7.

发明申请
SPECULATIVE PRE-FETCH OF TRANSLATIONS FOR A MEMORY MANAGEMENT UNIT (MMU) 审中-公开
Title translation: 用于存储管理单元（MMU）的转换的预测预处理

公开(公告)号：US20160350225A1

公开(公告)日：2016-12-01

申请号：US14726454

申请日：2015-05-29

Applicant: QUALCOMM Incorporated

Inventor： Jason Edward PODAIMA , Paul Christopher John WIERCIENSKI , Kyle John ERNEWEIN , Carlos Javier MOREIRA , Meghal VARIA , Serag GADELRAB , Muhammad Umar CHOUDRY

IPC: G06F12/08 , G06F12/10

CPC classification number: G06F12/0862 , G06F12/10 , G06F12/109 , G06F2212/1021 , G06F2212/283 , G06F2212/312 , G06F2212/507 , G06F2212/6026 , G06F2212/608 , G06F2212/65 , G06F2212/654

Abstract: Systems and methods for pre-fetching address translations in a memory management unit (MMU) are disclosed. The MMU detects a triggering condition related to one or more translation caches associated with the MMU, the triggering condition associated with a trigger address, generates a sequence descriptor describing a sequence of address translations to pre-fetch into the one or more translation caches, the sequence of address translations comprising a plurality of address translations corresponding to a plurality of address ranges adjacent to an address range containing the trigger address, and issues an address translation request to the one or more translation caches for each of the plurality of address translations, wherein the one or more translation caches pre-fetch at least one address translation of the plurality of address translations into the one or more translation caches when the at least one address translation is not present in the one or more translation caches.

Abstract translation: 公开了用于在存储器管理单元（MMU）中预取地址转换的系统和方法。 MMU检测与与MMU相关联的一个或多个翻译高速缓存相关联的触发条件，与触发地址相关联的触发条件，生成描述地址转换序列以预取到一个或多个翻译高速缓存中的序列描述符，地址转换序列包括对应于与包含触发地址的地址范围相邻的多个地址范围的多个地址转换，并且向多个地址转换中的每一个的一个或多个翻译高速缓存发出地址转换请求，其中当所述一个或多个翻译高速缓存中不存在所述至少一个地址转换时，所述一个或多个翻译高速缓冲存储器将所述多个地址转换的至少一个地址转换预取到所述一个或多个翻译高速缓存中。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification