Invention Publication
- Patent Title: ELECTRONIC DEVICE FOR IMPROVING INFERENCE PERFORMANCE OF PRE-TRAINED LANGUAGE MODEL, METHOD THEREOF AND RECORDING MEDIUM
-
Application No.: US18476598Application Date: 2023-09-28
-
Publication No.: US20240111965A1Publication Date: 2024-04-04
- Inventor: Tae-Sun Chung , Zhen Zhang
- Applicant: AJOU UNIVERSITY INDUSTRY-ACADEMIC COOPERATION FOUNDATION
- Applicant Address: KR Suwon-Si
- Assignee: AJOU UNIVERSITY INDUSTRY-ACADEMIC COOPERATION FOUNDATION
- Current Assignee: AJOU UNIVERSITY INDUSTRY-ACADEMIC COOPERATION FOUNDATION
- Current Assignee Address: KR Suwon-Si
- Priority: KR 20220123189 2022.09.28
- Main IPC: G06F40/40
- IPC: G06F40/40

Abstract:
The electronic device for improving the inference performance of a pre-trained language model according to an exemplary embodiment of the present invention includes a processor for sequentially passing input data through a plurality of transformer layers of the pre-trained language model and obtaining output data, wherein the processor calculates a probability distribution for prediction results received from each transformer layer in each of a plurality of middle layers connected to each rear end of the plurality of transformer layers, and measures a confidence level of the prediction results based on an entropy value calculated by using the probability distribution, and wherein when a confidence level less than a predefined value is measured in a predefined number of consecutive middle layers among the plurality of middle layers, the processor outputs a prediction result of the first middle layer as the output data by taking the last first middle layer among the consecutive middle layers as an exit.
Information query