Compiling models for dedicated hardware

    公开(公告)号:US12020168B2

    公开(公告)日:2024-06-25

    申请号:US16262807

    申请日:2019-01-30

    Applicant: Apple Inc.

    CPC classification number: G06N3/10 G06F9/461 G06F9/4881 G06F9/5038

    Abstract: The subject technology runs a compiled neural network (NN) model on a particular processor with multiple priority queues for executing different processes, the compiled NN model being assigned to a particular priority queue, and the compiled NN model includes context switch instructions that were previously inserted into a neural network (NN) model from which the compiled NN model was compiled. The subject technology determines that a particular context switch instruction has been executed by the particular processor. The subject technology determines that a different process is waiting to be executed, the different process being assigned to a different priority queue and the different process being a higher priority process than the running compiled NN model. In response to executing the particular context switch instruction, the subject technology performs a context switch to the different process assigned to the different priority queue when the different process is waiting to be executed.

    Flexible resolution support for image and video style transfer

    公开(公告)号:US10909657B1

    公开(公告)日:2021-02-02

    申请号:US16032844

    申请日:2018-07-11

    Applicant: Apple Inc.

    Abstract: Artistic styles extracted from one or more source images may be applied to one or more target images, e.g., in the form of stylized images and/or stylized video sequences. The extracted artistic style may be stored as a plurality of layers in a neural network, which neural network may be further optimized, e.g., via the fusion of various elements of the network's architectures. An optimized network architecture may be determined for each processing environment in which the network will be applied. The artistic style may be applied to the obtained images and/or video sequence of images using various optimization methods, such as the use of scalars to control the resolution of the unstylized and stylized images, temporal consistency constraints, as well as the use of dynamically adjustable or selectable versions of Deep Neural Networks (DNN) that are responsive to system performance parameters, such as available processing resources and thermal capacity.

    Real-time adjustment of hybrid DNN style transfer networks

    公开(公告)号:US10664718B1

    公开(公告)日:2020-05-26

    申请号:US16032909

    申请日:2018-07-11

    Applicant: Apple Inc.

    Abstract: Artistic styles extracted from one or more source images may be applied to one or more target images, e.g., in the form of stylized images and/or stylized video sequences. The extracted artistic style may be stored as a plurality of layers in a neural network, which neural network may be further optimized, e.g., via the fusion of various elements of the network's architectures. An optimized network architecture may be determined for each processing environment in which the network will be applied. The artistic style may be applied to the obtained images and/or video sequence of images using various optimization methods, such as the use of scalars to control the resolution of the unstylized and stylized images, temporal consistency constraints, as well as the use of dynamically adjustable or selectable versions of Deep Neural Networks (DNN) that are responsive to system performance parameters, such as available processing resources and thermal capacity.

    Annotation override determination for a neural network

    公开(公告)号:US12182619B2

    公开(公告)日:2024-12-31

    申请号:US18074440

    申请日:2022-12-02

    Applicant: Apple Inc.

    Abstract: The subject technology provides for dynamic task allocation for neural network models. The subject technology determines an operation performed at a node of a neural network model. The subject technology assigns an annotation to indicate whether the operation is better performed on a CPU or a GPU based at least in part on hardware capabilities of a target platform. The subject technology determines whether the neural network model includes a second layer. The subject technology, in response to determining that the neural network model includes a second layer, for each node of the second layer of the neural network model, determines a second operation performed at the node. Further the subject technology assigns a second annotation to indicate whether the second operation is better performed on the CPU or the GPU based at least in part on the hardware capabilities of the target platform.

    SYSTEMS AND METHODS OF MEMORY ALLOCATION FOR NEURAL NETWORKS

    公开(公告)号:US20240403119A1

    公开(公告)日:2024-12-05

    申请号:US18581097

    申请日:2024-02-19

    Applicant: Apple Inc.

    Abstract: A method may include accessing a data processing architecture associated with a neural network to determine dependencies between intermediate data layers of the neural network; obtaining dimensions of the intermediate data layers in the neural network; calculating a minimum number of data storage portions for executing the neural network based on the dependencies; determining a memory allocation size for each respective data storage portion of the data storage portions based on the dimensions and dependencies; allocating memory on a storage device for each data storage portion in accordance with its respective determined memory allocation size.

    Compiling models for dedicated hardware

    公开(公告)号:US11468338B2

    公开(公告)日:2022-10-11

    申请号:US16262809

    申请日:2019-01-30

    Applicant: Apple Inc.

    Abstract: The subject technology provides receiving a neural network (NN) model to be executed on a target platform, the NN model including multiple layers that include operations and some of the operations being executable on multiple processors of the target platform. The subject technology further sorts the operations from the multiple layers in a particular order based at least in part on grouping the operations that are executable by a particular processor of the multiple processors. The subject technology determines, based at least in part on a cost of transferring the operations between the multiple processors, an assignment of one of the multiple processors for each of the sorted operations of each of the layers in a manner that minimizes a total cost of executing the operations. Further, for each layer of the NN model, the subject technology includes an annotation to indicate the processor assigned for each of the operations.

    Device text to speech
    20.
    发明授权

    公开(公告)号:US11289073B2

    公开(公告)日:2022-03-29

    申请号:US16552309

    申请日:2019-08-27

    Applicant: Apple Inc.

    Abstract: Systems and processes for generating speech from text are provided. An example method of generating speech from text includes, at an electronic device having at least one processor and memory, obtaining text; generating a plurality of segments of a spectrogram using a first neural network, each spectrogram segment of the plurality of spectrogram segments representing a portion of the text; generating, based on the plurality of spectrogram segments, a plurality of speech segments using a second neural network; and providing the plurality of speech segments as a speech output.

Patent Agency Ranking