-
公开(公告)号:US10484685B2
公开(公告)日:2019-11-19
申请号:US15490625
申请日:2017-04-18
Applicant: QUALCOMM INCORPORATED
Inventor: Serag Gadelrab , Chinchuan Chiu , Moinul Khan , Kyle Ernewein , Tom Longo , Simon Booth , Meghal Varia , Milivoje Aleksic , King-Chung Lai
IPC: H04N19/127 , G06F3/06 , H04N19/426 , H04N19/176 , H04N19/65 , G06T3/40 , G01K13/00
Abstract: An exemplary method for intelligent compression defines a threshold value for a key performance indicator. Based on the key performance indicator value, data blocks generated by a producer component may be scaled down to reduce power and/or bandwidth consumption when being compressed according to a lossless compression module. The compressed data blocks are then stored in a memory component along with metadata that signals the scaling factor used prior to compression. Consumer components later retrieving the compressed data blocks from the memory component may decompress the data blocks and upscale, if required, based on the scaling factor signaled by the metadata.
-
公开(公告)号:US20180253236A1
公开(公告)日:2018-09-06
申请号:US15448095
申请日:2017-03-02
Applicant: QUALCOMM INCORPORATED
Inventor: SERAG GADELRAB , Jason Edward Podaima , Kyle Ernewein , Meghal Varia
IPC: G06F3/06 , G06F12/1009 , G06F9/50
CPC classification number: G06F3/0613 , G06F3/0656 , G06F3/0659 , G06F3/0673 , G06F9/5016 , G06F12/1009 , G06F13/1642 , G06F13/1663 , G06F13/1689 , G06F2212/1024
Abstract: A method and system for dynamic control of shared memory resources within a portable computing device (“PCD”) are disclosed. A limit request of an unacceptable deadline miss (“UDM”) engine of the portable computing device may be determined with a limit request sensor within the UDM element. Next, a memory management unit modifies a shared memory resource arbitration policy in view of the limit request. By modifying the shared memory resource arbitration policy, the memory management unit may smartly allocate resources to service translation requests separately queued based on having emanated from either a flooding engine or a non-flooding engine.
-
公开(公告)号:US12182676B2
公开(公告)日:2024-12-31
申请号:US18539022
申请日:2023-12-13
Applicant: QUALCOMM Incorporated
Inventor: Serag Gadelrab , James Esliger , Meghal Varia , Kyle Ernewein , Alwyn Dos Remedios , George Lee
Abstract: Certain aspects of the present disclosure provide techniques for concurrently performing inferences using a machine learning model and optimizing parameters used in executing the machine learning model. An example method generally includes receiving a request to perform inferences on a data set using the machine learning model and performance metric targets for performance of the inferences. At least a first inference is performed on the data set using the machine learning model to meet a latency specified for generation of the first inference from receipt of the request. While performing the at least the first inference, operational parameters resulting in inference performance approaching the performance metric targets are identified based on the machine learning model and operational properties of the computing device. The identified operational parameters are applied to performance of subsequent inferences using the machine learning model.
-
公开(公告)号:US11029745B2
公开(公告)日:2021-06-08
申请号:US16184934
申请日:2018-11-08
Applicant: QUALCOMM INCORPORATED
Inventor: Kyle Ernewein , Jason Edward Podaima , Francisco Perez , John Daniels , Alex Miler , Jeffrey Gemar , Rexford Alan Hill , Haoping Xu
IPC: G06F1/32 , G06F1/324 , G06F1/3228
Abstract: Systems and methods are disclosed method for controlling instantaneous current changes in parallel processors with arrays of parallel computing elements, such as neural processors. An exemplary method comprises monitoring the array of computing elements and determining a transition from a first activity level of the array to a second activity level of the array, such as an idle-to-active or active-to-idle transition. Once a transition is determined, the array is selectively controlled to minimize the instantaneous current change from the transition from the first activity level to the second activity level.
-
公开(公告)号:US10609418B2
公开(公告)日:2020-03-31
申请号:US15490620
申请日:2017-04-18
Applicant: QUALCOMM INCORPORATED
Inventor: Serag Gadelrab , Chinchuan Chiu , Moinul Khan , Kyle Ernewein , Tom Longo , Simon Booth , Meghal Varia , Milivoje Aleksic
IPC: H04N19/85 , H04N19/103 , H04N19/176 , H04N19/156 , H04N19/172 , H04N19/146 , H04N19/152 , H04N19/117
Abstract: An exemplary method for intelligent compression defines a threshold value for a temperature reading generated by a temperature sensor. Data blocks received into the compression module are compressed according to either a first mode or a second mode, the selection of which is determined based on a comparison of the active level for the temperature reading to the defined threshold value. The first compression mode may be associated with a lossless compression algorithm while the second compression mode is associated with a lossy compression algorithm. Or, both the first compression mode and the second compression mode may be associated with a lossless compression algorithm, however, for the first compression mode the received data blocks are produced at a default high quality level setting while for the second compression mode the received data blocks are produced at a reduced quality level setting.
-
公开(公告)号:US11907810B2
公开(公告)日:2024-02-20
申请号:US16515711
申请日:2019-07-18
Applicant: QUALCOMM Incorporated
Inventor: Serag Gadelrab , James Esliger , Meghal Varia , Kyle Ernewein , Alwyn Dos Remedios , George Lee
CPC classification number: G06N20/00 , G06F11/3466 , G06N5/04
Abstract: Certain aspects of the present disclosure provide techniques for concurrently performing inferences using a machine learning model and optimizing parameters used in executing the machine learning model. An example method generally includes receiving a request to perform inferences on a data set using the machine learning model and performance metric targets for performance of the inferences. At least a first inference is performed on the data set using the machine learning model to meet a latency specified for generation of the first inference from receipt of the request. While performing the at least the first inference, operational parameters resulting in inference performance approaching the performance metric targets are identified based on the machine learning model and operational properties of the computing device. The identified operational parameters are applied to performance of subsequent inferences using the machine learning model.
-
公开(公告)号:US10067691B1
公开(公告)日:2018-09-04
申请号:US15448095
申请日:2017-03-02
Applicant: QUALCOMM INCORPORATED
Inventor: Serag Gadelrab , Jason Edward Podaima , Kyle Ernewein , Meghal Varia
IPC: G06F3/06 , G06F9/50 , G06F12/10 , G06F12/1009
Abstract: A method and system for dynamic control of shared memory resources within a portable computing device (“PCD”) are disclosed. A limit request of an unacceptable deadline miss (“UDM”) engine of the portable computing device may be determined with a limit request sensor within the UDM element. Next, a memory management unit modifies a shared memory resource arbitration policy in view of the limit request. By modifying the shared memory resource arbitration policy, the memory management unit may smartly allocate resources to service translation requests separately queued based on having emanated from either a flooding engine or a non-flooding engine.
-
-
-
-
-
-