Inference TimeTimes presented in the tables are measured as consecutive runs of the model. Initial run times may be up to 2x longer due to model loading and initialization.Memory UsageClassificationModel SizeClassification
Inference TimeTimes presented in the tables are measured as consecutive runs of the model. Initial run times may be up to 2x longer due to model loading and initialization.