We built Datadog’s natural language querying features (variant of text->SQL) using a fine-tuned model, replacing prompted OpenAI models.
We did this explicitly for latency and cost purposes: the feature actually translates as you type in the UI, which required both <500ms latency, and would have been wasteful to do on a pay-per-token model like the hosted providers. We run it on our own pay-per-hour GPUs, allowing real time translation. Any UX that’s trying to feel like a tab-completion model (fast, user accept/reject) likely would benefit from similar approaches
https://docs.datadoghq.com/logs/explorer/search#natural-language-queries…