LLM testing made easy with a spreadsheet-like interface. Score tests with natural language, pattern matching, or code. Optimize LLM apps by experimenting with models, parameters, and prompts. Gain insights from test results and analytics.
Hey! I'm Tomas, co-founder and CTO of Langtail. 🚀
Quick update on our LLM provider support!
During beta launch, we only supported OpenAI. Now with Langtail 1.0, we've added all major providers:
- Anthropic Claude
- Google Gemini (includes video support!)
- Open Source models like LLama on Groq
- And more! 🤖
This means more flexibility for you to test and compare different models in one place.
Looking forward to hearing your thoughts! 💬
👋 Hi Product Hunt community! I’m Petr, co-founder and CEO of Langtail.
We launched Langtail into public beta six months ago, and today we're excited to introduce Langtail 1.0 — a major step forward for building and testing AI apps.
Taking an LLM app from an exciting demo to a dependable product is hard work. Over the past year, we’ve heard a lot about the challenges: LLMs are unpredictable, prompt iteration is a mess, and traditional testing tools fall short.
Langtail 1.0 is our answer to these problems. It’s all about control, consistency, and confidence.
Here’s what’s new:
🧪 Spreadsheet-Like Testing Interface: Drop your test cases in with ease, get instant feedback, and iterate quickly—like using Google Sheets, but for LLMs.
🔧 Hosted Tools: Langtail now handles function execution directly within the app, making prototyping easier by running tools without needing to mock responses or set up external infrastructure.
🛠️ Test Configurations: Compare different models side-by-side with just a few clicks, making it easy to see which one works best for your app.
🔗 Shareable AI Apps: Create shareable links to let anyone interact with your LLM—no sign-up required. Perfect for getting feedback or showcasing to colleagues.
🤖 Assistants: Introducing stateful assistants that handle memory and conversation history automatically, reducing the overhead of prompt management. These assistants can be tested, deployed as APIs, and even integrated across models with ease.
✨ Magic Buttons: We've added Magic Buttons to streamline workflows—automatically generate new test cases, adjust prompts, or implement improvements with a click.
🔥 AI Firewall: Real-time protection for your app. Stop prompt injections, denial of service attacks, and data leaks before they happen.
🌟 New Redesign & Light Theme: A complete redesign to improve usability, including a new light theme for those who prefer it.
🚀 Self-Hosting Available: Full control over your data, entirely on your infrastructure. Perfect for larger teams and enterprises.
See more here https://langtail.com/blog/introd....
We’d love for you to give it a try and let us know what you think!
We’ve been using Langtail for a while at Deepnote. It’s helped us improve the reliability of our text-to-SQL system, and let us quickly prototype potential new behaviors for Deepnote AI. The Langtail team’s been super responsive to our feedback, which is great. Excited for what comes next!
Yo, I'm the Product Manager behind Langtail. Thanks for the support so far. We've put a lot into this, really curious what you think of the new serverless function handling and AI assistant support. Check it out and let us know what you think! 🚀
@matt_roskovec Thanks Matt! Glad those metrics stuck out to you 📊 That's just the beginning of what you can do in Langtail though! Check out the tests to make sure that your prompts respond accurately too!
Nice product, solving a real pain point! How does integrate with a product in production? Is there a sdk that we put in our codebase in certain segments and it listens the outputs?
@yigit For integration, we have SDK or OpenAPI. You can also use us as a Proxy - all data goes through us, or asynchronously - you send data from your codebase to Langtail. Additionally, you can use us just for development if you want to develop prompts and test them in the UI environment.
Glad to launch Langtail 1.0 on Product Hunt! One feature I’ve been working hard on is AI Assistants. It allows you to create custom assistants for different tasks and share them publicly—it’s simple to set up and ready to use. Looking forward to hearing your thoughts and seeing how it works for you!
I’m one of the engineers on this project! Very excited to launch 1.0 after months of working on it.
Our team has thought deeply (and talked to other companies much larger than us) about testing LLM-based apps. AMA.
Very nice! I have used it already to polish some of my boring system prompts, so I am looking forward to your progress! Open-source when? I would find it useful (at least to have some parts) self/hosted to integrate into CI/CD pipelines <3
Good luck!
@ryan_hefner Yes. Very neat! The only thing I would love to see is the "diff" between the previous and new version. Now my sys prompt is 2 pages and I dont know of what was really changed.
@raghavendra_devadiga4 Yeah, from our experience, self-hosting is a must-have. Bigger companies need it to keep their data secure and comply with regulations like GDPR and SOC2. It's just the safest option.
We've been using Langtail at Deepnote for over a year now and would highly recommend it to any team serious about shipping AI products. The ROI has been clear from day one!
Awesome product you've built with Langtail, @petrbrzek and Team! I think this has great potential as building apps based on LLMs is not that trivial. I hope this launch boosts you!
Great to see Langtail evolving into a full-fledged testing suite for AI apps! The spreadsheet-style interface is exactly what I've been looking for - been struggling with messy prompt iterations in my recent projects. Love that you've added hosted tools and that AI firewall feature (seriously, prompt injection has been keeping me up at night 😅). The self-hosting option is a huge plus for enterprise teams who need to keep everything in-house. Feels like you guys really listened to the community pain points and delivered. Definitely giving this a spin on my next LLM project! 👍
@_ivan1 Woohoo! Glad to hear that. When you start your next project, hit us up in the chat and we'd be happy to walk you through the setup process and show you around.
Langtail