Re-thinking computation offload for efficient inference on IoT devices with duty-cycled radios