Posted at: 6 March
Operations Team Lead (Production & Reliability)
Company
Complexio
Complexio is a London-based B2B SaaS platform specializing in AI-driven business process automation through its Event Knowledge Graph for enterprise clients.
Remote Hiring Policy:
Complexio Limited supports remote work for candidates located within 4-5 hours of the CET timezone, fostering collaboration across various regions.
Job Type
Full-time
Allowed Applicant Locations
Europe, Africa, Middle East
Job Description
Complexio is Foundational AI works to automate business activities by ingesting whole company data – both structured and unstructured – and making sense of it. Using proprietary models and algorithms Complexio forms a deep understanding of how humans are interacting and using it. Automation can then replicate and improve these actions independently.
Complexio is a joint venture between Hafnia and Símbolo, in partnership with Marfin Management, C Transport Maritime, Trans Sea Transport and BW Epic Kosan.
Operations Team Lead (Production & Reliability)
We’re looking for an Operations Team Lead to own production.
Not just keep it running, but build a system that scales.
You’ll lead operational excellence across all live customer-facing systems. Your mission: make production reliable, observable, predictable, and continuously improving.
This is a hands-on role. You’ll shape process, lead incidents, build the team, and move us from reactive firefighting to proactive reliability engineering.
What You’ll Own
Production
- Stability and availability of all live systems
- Operational readiness for new releases
- Safe production access and change coordination
Production is a high-discipline environment. You make sure it stays that way.
Incident Management
You own the full lifecycle:
- High-signal alerting and fast detection
- Structured incident response
- Clear internal and customer communication
- Blameless postmortems
- Systemic fixes that prevent repeats
Goal: Fast recovery. Fewer recurring incidents.
On-Call
- Design sustainable rotations
- Clear escalation paths
- Defined severity levels
- Strong runbooks
- No burnout culture
Someone accountable is always reachable. Escalations are fast and predictable.
Monitoring & Reliability
- Define SLIs/SLOs for critical systems
- Improve visibility across availability, latency, errors, and saturation
- Track MTTR, incident frequency, and escalation trends
- Drive reliability roadmap initiatives
We measure reliability, and improve it continuously.
Team Leadership
- Lead and grow the Operations team
- Set clear standards and KPIs
- Build a culture of ownership and accountability
- Raise the bar on operational discipline
You’re responsible for both system performance and team performance.
What We’re Looking For
- Strong experience in SRE, DevOps, Infrastructure, or Production Engineering
- Prior experience leading technical teams
- Deep hands-on incident management experience
- Strong observability and reliability mindset
- Calm under pressure, clear in communication
- Systems thinker, fixes root causes, not symptoms
How We Think
- Production is sacred.
- Clear ownership beats ambiguity.
- Blameless culture, high accountability.
- Fix systems, not people.
- Reliability is a product feature.