Model SizeClassificationMemory UsageClassificationInference TimeTimes presented in the tables are measured as consecutive runs of the model. Initial run times may be up to 2x longer due to model loading and initialization.
Inference TimeTimes presented in the tables are measured as consecutive runs of the model. Initial run times may be up to 2x longer due to model loading and initialization.