-
公开(公告)号:US20250021827A1
公开(公告)日:2025-01-16
申请号:US18748071
申请日:2024-06-19
Applicant: MEDIATEK INC.
Inventor: Chia-Lin Yu
IPC: G06N3/0985 , G06N3/0495
Abstract: A method for finding at least one optimal post-training quantization model includes converting and optimizing a floating-point machine learning model into a converted machine learning model, applying a plurality of PTO settings to generate a plurality of PTO models, and evaluating the plurality of PTO models based on at least one predetermined indirect metric to find at least one optimal PTO model.
-
公开(公告)号:US20220156567A1
公开(公告)日:2022-05-19
申请号:US17505422
申请日:2021-10-19
Applicant: MediaTek Inc.
Inventor: Chien-Hung Lin , Yi-Min Tsai , Chia-Lin Yu , Chi-Wei Yang
Abstract: A neural network (NN) processing unit includes an operation circuit to perform tensor operations of a given layer of a neural network in one of a first number representation and a second number representation. The NN processing unit further includes a conversion circuit coupled to at least one of an input port and an output port of the operation circuit to convert between the first number representation and the second number representation. The first number representation is one of a fixed-point number representation and a floating-point number representation, and the second number representation is the other one of the fixed-point number representation and the floating-point number representation.
-