Skip to main content
Version: 0.8.x

Memory Usage

info

Memory usage values represent the peak memory increase observed while the model was loaded and actively running inference, relative to the baseline app memory before model initialization.

Classification

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
EFFICIENTNET_V2_S (XNNPACK FP32)101122
EFFICIENTNET_V2_S (XNNPACK INT8)6278
EFFICIENTNET_V2_S (Core ML FP32)101-
EFFICIENTNET_V2_S (Core ML FP16)87-

Object Detection

note

Data presented for YOLO models is based on inference with forward_640 method.

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
SSDLITE_320_MOBILENET_V3_LARGE (XNNPACK FP32)94104
SSDLITE_320_MOBILENET_V3_LARGE (Core ML FP32)83-
SSDLITE_320_MOBILENET_V3_LARGE (Core ML FP16)62-
RF_DETR_NANO (XNNPACK FP32)145162
YOLO26N (XNNPACK FP32)3644
YOLO26S (XNNPACK FP32)8182
YOLO26M (XNNPACK FP32)123158
YOLO26L (XNNPACK FP32)170172
YOLO26X (XNNPACK FP32)320309

Style Transfer

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
STYLE_TRANSFER_CANDY (XNNPACK FP32)12001200
STYLE_TRANSFER_CANDY (XNNPACK INT8)800800
STYLE_TRANSFER_CANDY (Core ML FP32)400-
STYLE_TRANSFER_CANDY (Core ML FP16)380-
STYLE_TRANSFER_MOSAIC (XNNPACK FP32)12001200
STYLE_TRANSFER_MOSAIC (XNNPACK INT8)800800
STYLE_TRANSFER_MOSAIC (Core ML FP32)400-
STYLE_TRANSFER_MOSAIC (Core ML FP16)380-
STYLE_TRANSFER_UDNIE (XNNPACK FP32)12001200
STYLE_TRANSFER_UDNIE (XNNPACK INT8)800800
STYLE_TRANSFER_UDNIE (Core ML FP32)400-
STYLE_TRANSFER_UDNIE (Core ML FP16)380-
STYLE_TRANSFER_RAIN_PRINCESS (XNNPACK FP32)12001200
STYLE_TRANSFER_RAIN_PRINCESS (XNNPACK INT8)800800
STYLE_TRANSFER_RAIN_PRINCESS (Core ML FP32)400-
STYLE_TRANSFER_RAIN_PRINCESS (Core ML FP16)380-

OCR

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
Detector (CRAFT) + Recognizer (CRNN) (XNNPACK FP32)13201400

Vertical OCR

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
Detector (CRAFT) + Recognizer (CRNN) (XNNPACK FP32)1000-15001000-1600

LLMs

Model / DeviceiPhone 17 Pro [GB]OnePlus 12 [GB]
LLAMA3_2_1B (XNNPACK)3.13.3
LLAMA3_2_1B_SPINQUANT (XNNPACK)2.41.9
LLAMA3_2_1B_QLORA (XNNPACK)2.82.7
LLAMA3_2_3B (XNNPACK)7.37.1
LLAMA3_2_3B_SPINQUANT (XNNPACK)3.83.7
LLAMA3_2_3B_QLORA (XNNPACK)4.03.9

Speech to Text

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
WHISPER_TINY (XNNPACK)375410

Text to Speech

note

The reported memory usage values include the memory footprint of the Phonemis package, which is used for phonemizing input text. Currently, this can range from 100 to 150 MB depending on the device.

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
KOKORO_SMALL (XNNPACK)820820
KOKORO_MEDIUM (XNNPACK)11001140

Text Embeddings

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
ALL_MINILM_L6_V2 (XNNPACK)11095
ALL_MPNET_BASE_V2 (XNNPACK)455405
MULTI_QA_MINILM_L6_COS_V1 (XNNPACK)140120
MULTI_QA_MPNET_BASE_DOT_V1 (XNNPACK)455435
CLIP_VIT_BASE_PATCH32_TEXT (XNNPACK)280200

Image Embeddings

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
CLIP_VIT_BASE_PATCH32_IMAGE (XNNPACK FP32)340345

Semantic Segmentation

note

Data presented in the following sections is based on inference with non-resized output. When resize is enabled, expect higher memory usage and inference time with higher resolutions.

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
DEEPLABV3_RESNET50 (XNNPACK)660930

Instance Segmentation

note

Data presented in the following sections is based on inference with forward_640 method.

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
YOLO26N_SEG (XNNPACK)66892
YOLO26S_SEG (XNNPACK)712220
YOLO26M_SEG (XNNPACK)815570
YOLO26L_SEG (XNNPACK)1024680
YOLO26X_SEG (XNNPACK)14501410
RF_DETR_NANO_SEG (XNNPACK)603620

Text to Image

Model / DeviceiPhone 17 Pro [MB]OnePlus 12 [MB]
BK_SDM_TINY_VPRED_256 (XNNPACK)24002400
BK_SDM_TINY_VPRED (XNNPACK)60506210