About the team
Engineering builds and runs Vercilio end to end — the chat product, the model-routing layer, billing, and the infrastructure that keeps streaming responses fast and reliable for everyone. We ship in small, autonomous teams and value people who can own a problem from idea to production.
About the role
You'll own the systems that keep Vercilio up: deployments, observability, performance, and cost. When thousands of streams are open at once, you make sure they stay fast and stable.
You like making the invisible parts of a product excellent, and you measure your success in nines and latency percentiles.
In this role, you will
- Own CI/CD, deployments, and environment management.
- Build observability — metrics, logs, traces, and alerting — across the stack.
- Profile and tune latency for streaming AI workloads.
- Keep infrastructure spend efficient as usage grows.
You might thrive here if
- Experience operating production systems and on-call rotations.
- Comfort with cloud infrastructure, edge runtimes, and serverless platforms.
- A pragmatic, automation-first mindset.
- Bonus: experience with high-throughput, low-latency streaming systems.
Sound like you?
There's no formal application — just send us an email telling us a little about yourself, why this role, and anything you've built that you're proud of. We read every message.
Send us an email to join uscontact@vercilio.com

