Skip to main content
Version: Next

Memory Usage

Classification

info

Memory usage values represent the peak memory increase observed while the model was loaded and actively running inference, relative to the baseline app memory before model initialization.

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
EFFICIENTNET_V2_S (XNNPACK FP32)101122
EFFICIENTNET_V2_S (XNNPACK INT8)6278
EFFICIENTNET_V2_S (Core ML FP32)101-
EFFICIENTNET_V2_S (Core ML FP16)87-

Object Detection

info

Memory usage values represent the peak memory increase observed while the model was loaded and actively running inference, relative to the baseline app memory before model initialization.

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
SSDLITE_320_MOBILENET_V3_LARGE (XNNPACK FP32)94104
SSDLITE_320_MOBILENET_V3_LARGE (Core ML FP32)83-
SSDLITE_320_MOBILENET_V3_LARGE (Core ML FP16)62-

Style Transfer

info

Memory usage values represent the peak memory increase observed while the model was loaded and actively running inference, relative to the baseline app memory before model initialization.

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
STYLE_TRANSFER_CANDY (XNNPACK FP32)12001200
STYLE_TRANSFER_CANDY (XNNPACK INT8)800800
STYLE_TRANSFER_CANDY (Core ML FP32)400-
STYLE_TRANSFER_CANDY (Core ML FP16)380-
STYLE_TRANSFER_MOSAIC (XNNPACK FP32)12001200
STYLE_TRANSFER_MOSAIC (XNNPACK INT8)800800
STYLE_TRANSFER_MOSAIC (Core ML FP32)400-
STYLE_TRANSFER_MOSAIC (Core ML FP16)380-
STYLE_TRANSFER_UDNIE (XNNPACK FP32)12001200
STYLE_TRANSFER_UDNIE (XNNPACK INT8)800800
STYLE_TRANSFER_UDNIE (Core ML FP32)400-
STYLE_TRANSFER_UDNIE (Core ML FP16)380-
STYLE_TRANSFER_RAIN_PRINCESS (XNNPACK FP32)12001200
STYLE_TRANSFER_RAIN_PRINCESS (XNNPACK INT8)800800
STYLE_TRANSFER_RAIN_PRINCESS (Core ML FP32)400-
STYLE_TRANSFER_RAIN_PRINCESS (Core ML FP16)380-

OCR

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
Detector (CRAFT) + Recognizer (CRNN)14001320

Vertical OCR

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
Detector (CRAFT) + Recognizer (CRNN)1000-16001000-1500

LLMs

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

ModelAndroid (XNNPACK) [GB]iOS (XNNPACK) [GB]
LLAMA3_2_1B3.33.1
LLAMA3_2_1B_SPINQUANT1.92.4
LLAMA3_2_1B_QLORA2.72.8
LLAMA3_2_3B7.17.3
LLAMA3_2_3B_SPINQUANT3.73.8
LLAMA3_2_3B_QLORA3.94.0

Speech to text

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
WHISPER_TINY410375

Text to speech

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
KOKORO_SMALL820820
KOKORO_MEDIUM11401100
info

The reported memory usage values include the memory footprint of the Phonemis package, which is used for phonemizing input text. Currently, this can range from 100 to 150 MB depending on the device.

Text Embeddings

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
ALL_MINILM_L6_V295110
ALL_MPNET_BASE_V2405455
MULTI_QA_MINILM_L6_COS_V1120140
MULTI_QA_MPNET_BASE_DOT_V1435455
CLIP_VIT_BASE_PATCH32_TEXT200280

Image Embeddings

info

Memory usage values represent the peak memory increase observed while the model was loaded and actively running inference, relative to the baseline app memory before model initialization.

Model / DeviceiPhone 17 Pro [MB]Google Pixel 10 [MB]
CLIP_VIT_BASE_PATCH32_IMAGE (XNNPACK FP32)340345

Semantic Segmentation

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

warning

Data presented in the following sections is based on inference with non-resized output. When resize is enabled, expect higher memory usage and inference time with higher resolutions.

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
DEELABV3_RESNET50930660

Instance Segmentation

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

warning

Data presented in the following sections is based on inference with forward_640 method.

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
YOLO26N_SEG92668
YOLO26S_SEG220712
YOLO26M_SEG570815
YOLO26L_SEG6801024
YOLO26X_SEG14101450
RF_DETR_NANO_SEG620603

Text to image

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
BK_SDM_TINY_VPRED_25624002400
BK_SDM_TINY_VPRED62106050