Inference Engine for Voice AI: Lower Latency, More Control