Skip to main content
Version: Next

Memory Usage

info

All the below benchmarks were performed on iPhone 17 Pro (iOS) and OnePlus 12 (Android).

Classification

ModelAndroid (XNNPACK) [MB]iOS (Core ML) [MB]
EFFICIENTNET_V2_S23087

Object Detection

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
SSDLITE_320_MOBILENET_V3_LARGE164132

Style Transfer

ModelAndroid (XNNPACK) [MB]iOS (Core ML) [MB]
STYLE_TRANSFER_CANDY1200380
STYLE_TRANSFER_MOSAIC1200380
STYLE_TRANSFER_UDNIE1200380
STYLE_TRANSFER_RAIN_PRINCESS1200380

OCR

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
Detector (CRAFT) + Recognizer (CRNN)14001320

Vertical OCR

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
Detector (CRAFT) + Recognizer (CRNN)1000-16001000-1500

LLMs

ModelAndroid (XNNPACK) [GB]iOS (XNNPACK) [GB]
LLAMA3_2_1B3.33.1
LLAMA3_2_1B_SPINQUANT1.92.4
LLAMA3_2_1B_QLORA2.72.8
LLAMA3_2_3B7.17.3
LLAMA3_2_3B_SPINQUANT3.73.8
LLAMA3_2_3B_QLORA3.94.0

Speech to text

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
WHISPER_TINY410375

Text to speech

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
KOKORO_SMALL820820
KOKORO_MEDIUM11401100
info

The reported memory usage values include the memory footprint of the Phonemis package, which is used for phonemizing input text. Currently, this can range from 100 to 150 MB depending on the device.

Text Embeddings

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
ALL_MINILM_L6_V295110
ALL_MPNET_BASE_V2405455
MULTI_QA_MINILM_L6_COS_V1120140
MULTI_QA_MPNET_BASE_DOT_V1435455
CLIP_VIT_BASE_PATCH32_TEXT200280

Image Embeddings

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
CLIP_VIT_BASE_PATCH32_IMAGE345340

Image Segmentation

warning

Data presented in the following sections is based on inference with non-resized output. When resize is enabled, expect higher memory usage and inference time with higher resolutions.

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
DEELABV3_RESNET50930660

Text to image

ModelAndroid (XNNPACK) [MB]iOS (XNNPACK) [MB]
BK_SDM_TINY_VPRED_25624002400
BK_SDM_TINY_VPRED62106050