Posted at: 12 February

SRE/Incident Response

Company

CompanyXBOW

XBOW is a remote-based B2B SaaS company specializing in AI-powered penetration testing solutions for the cybersecurity industry, targeting organizations in need of advanced security measures.

Remote Hiring Policy:

XBOW operates as a fully remote company, with all team members working remotely. While specific hiring locations are not detailed, the company supports collaboration through regular meetings and travel for in-person interactions.

Job Type

Full-time

Allowed Applicant Locations

Worldwide

Job Description

Build the future of offensive security with XBOW. Attackers are already using AI to move faster than defenders can react—we’re creating the platform that puts security ahead in the arms race. Our AI-powered system autonomously discovers, validates, and even exploits vulnerabilities, giving organizations proof-backed results in hours instead of weeks.

Founded by Oege de Moor, creator of GitHub Copilot, and backed by Sequoia, Altimeter, and other leading investors, XBOW is applying cutting-edge AI to one of the world’s most urgent problems. In just over a year, our AI, built by a world-class AI team and legendary security researchers — has uncovered thousands of real-world zero-days across the software billions rely on, and achieved the #1 ranking on HackerOne’s global leaderboard.

We’re a team of builders, hackers, and researchers who thrive on solving problems others think are impossible. If you want to push the boundaries of AI, reshape how security is done, and join the group defining this new era of defense — we’d love to talk.

Your Role: Site Reliability Engineer (SRE), Automation, and Incident Response

In this role, daily work centers on keeping XBOW’s production systems stable, observable, and resilient as the product scales. You’d be building and maintaining automated reliability tooling, covering monitoring, alerting, and self healing; while defining and tracking service level goals for both production and development environments.

The role involves close collaboration with infrastructure and feature teams to manage cloud systems through IaC, review architectural changes for reliability and capacity impact, and respond to incidents during local working hours as part of a “follow the sun model.”

When issues occur, you’d lead or contribute to root-cause investigations, analyze incident trends across the organization, and turn those insights into improvements that reduce future risk. You’d also help maintain internal and customer-facing status dashboards that clearly communicate system health and uptime.

Responsibilities:

  • Automation of site reliability infrastructure, monitoring, and self-healing systems.

  • Definition and ownership of Service Level Objectives for production and development deployments.

  • Infrastructure-as-code for production and development systems, in collaboration with the infrastructure engineering team.

  • Incident response:

    • Responding to in-hours alerts (we run a follow-the-sun model to avoid out-of-hours paging)

    • Conducting RCAs in collaboration with the feature teams

    • Building resilience to prevent future outages.

  • Incident analysis: Organization-wide analysis of incident cause, frequency, and severity, to guide prioritization of future changes.

  • Design reviews for architectural changes: reviewing for scalability, reliability, and capacity planning.

  • Public and internal status and uptime dashboards.

Skills and Qualifications

Essential:

  • Strong experience with TypeScript

  • Hands-on experience with AWS

  • Solid expertise in Linux, plus experience with infrastructure & DevOps tooling such as Kubernetes, Docker, Terraform, and CI/CD pipelines (especially GitHub Actions)

  • Background in infrastructure automation and/or incident response (depth may vary by candidate)

  • Familiarity with monitoring and observability tools such as OpenTelemetry, Prometheus, VictoriaMetrics, Grafana, and Datadog

Advantageous:

  • Experience with Python and/or Go

  • Experience with additional cloud providers beyond AWS

What we offer

  • Compensation & Equity: Competitive salary and a generous equity package, making you a true owner of the company.

  • Career Growth: Shape your role, lead the function, and grow with the company as we redefine cybersecurity.

  • Meaningful Work: You will tackle technically complex challenges and play a pivotal role in the growth of our business, working alongside an amazing team and some of the world’s experts to shape how AI transforms cybersecurity.

What else you should know

  • Location: Remote (all team members are remote but we meet regularly and you’re supported to travel to collaborate with colleagues in person)

  • Contract: Full-time.

We aren't focused on seniority titles at XBOW—so if you’re worried about “leveling,” don’t be. We care a lot more about mission fit, capability, and impact than what’s on your LinkedIn headline.

We believe in people who are driven by curiosity and a willingness to learn. Even if you don't check every box, we encourage you to apply if you're excited about the role and our mission.