Skip to content

Slow Loading

  • the model is too large for the device
  • the runtime is falling back to CPU
  • storage is slow
  • the app reloads the model too often
  1. Confirm the runtime is using the accelerator path you expect.
  2. Drop one model size and compare.
  3. Use the simplest runtime that exposes loading behavior clearly.
  4. Treat long load times as a hardware-fit signal, not just an annoyance.

If the model takes so long to load that you avoid using it, it is effectively the wrong model for that device.