ā ļø Drop-in replacement for OpenAI with API compatibility š Serve OSS LLMs on CPUs or GPUs āļø Autoscaling with scale from 0 š ļø Zero dependencies (no Istio, Knative, etc.) š¤ Operates OSS model servers (vLLM and Ollama) š Chat UI included