Agentic IT OperationsStartupAgent Builder
Vellum
Dev platform to build, evaluate, and orchestrate enterprise LLM agents
Mkt Cap / ValPrivate
RevenueEarly Stage
Growth+150% YoY
Jul 2025: Series A $20M
Fuses visual workflow orchestration with rigorous evals and production observability for engineer + domain-expert collaboration
SWOT Analysis
Strengths
- Orchestration, evals, and observability combined in one platform, reducing tool sprawl
- Visual builder lets engineers and non-technical domain experts collaborate on prompts
- Model- and framework-agnostic, with Python/TS code running natively in the graph
- Strong eval discipline tied to versioned, CI-safe deployments and production feedback
- Proven enterprise traction in regulated verticals; claims ~10x faster time-to-market
Opportunities
- Ride the agent wave with the no-code builder aimed at ops teams
- Land-and-expand in enterprises lacking AI strategy and mature data
- Geographic and vertical expansion into more regulated industries
- Differentiate on governance and data isolation for compliance buyers
Weaknesses
- Opaque, sales-led pricing with no public page makes TCO modeling hard
- No native CI/CD eval gating (e.g., PR-blocking) versus some rivals
- UI/UX occasionally clunky; eval UI, annotation queue, and dataset UX need work
- Steep learning curve for advanced flows; agent features are recent
Threats
- Eval-native rivals and LangSmith inside LangChain compete on price and CI
- Foundation-model vendors and frameworks absorbing orchestration and evals natively
- Crowded field (Humanloop, PromptLayer, Langfuse, Arize) compresses differentiation
- No-code agent builders face a fragmenting, hype-exposed category
User Sentiment
Synthesized from G2, Gartner Peer Insights, and analyst review data.
What users love
- Intuitive low-code workflow builder — weeks of work in minutes
- Non-technical team members can iterate on prompts independently
- Side-by-side prompt and model comparison with automated evals
- Function calling plus workflows build complex interactions without custom code
Common complaints
- UI can be clunky or buggy
- Eval UI, annotation queue, and dataset UX want improvement
- Learning curve for advanced features and lack of published pricing
Customer Profile
Who buys this
Typical segments
Mid-market & enterprise engCross-functional product teamsOps teams (emerging)
Typical buyer
VP/Director of Engineering or Head of AI needing production rigor
Top use cases
- 1Conversational agents at scale, evaluated across thousands of cases
- 2Regulated/compliance AI automation with data isolation
- 3Healthcare workflow and document AI with regression testing
Future Focus Areas
1
Maturing the no-code agent builder and MCP Agent Node
2
Foundational AI-stack layer with more deployed use cases
3
Stronger CI/CD eval gating
4
Enterprise governance and data-isolation features