Speech-to-text API: how to evaluate and integrate