Posted at: 12 March
Cloud Operations Engineer
Company
Lumin Digital is a San Ramon, California-based B2B cloud-native digital banking platform provider, specializing in innovative solutions for financial institutions across the United States.
Remote Hiring Policy:
Lumin Digital operates a remote-first work environment with a hybrid workspace model, supporting remote work from various locations, including the United States. Team members gather twice a year for in-person collaboration.
Job Type
Full-time
Allowed Applicant Locations
United States
Salary
$110,000 to $125,000 per year
Job Description
The Operations Center has two main focuses: removing toil and enhancing platform visibility. The Cloud Operations Engineer role is responsible for monitoring cloud infrastructure using observability tools, performing Tier 1 incident triage, and ensuring timely resolution or escalation of production issues. Supports CI/CD pipelines, mobile application releases, and SSL/TLS certificate lifecycle management while maintaining accurate documentation and clear cross-functional communication.
Reporting to the Operations Center Manager, the qualified candidate will possess exceptional communication, cross functional skills, and a solid understanding of cloud infrastructure.
Essential Functions and Responsibilities:
Monitor cloud infrastructure and application health using observability tools; respond to alerts and ensure timely triage and resolution of production issues.
Perform Tier 1 incident triage, document findings, and escalate appropriately to Development or SRE teams while maintaining clear communication.
Monitor and support CI/CD pipelines to ensure successful builds and deployments; troubleshoot and coordinate resolution of pipeline failures.
Support/coordinate mobile application release processes.
Manage SSL/TLS certificate lifecycle activities, including renewals and proactive expiration monitoring.
Proactively identify patterns in incidents or alerts and implement improvements that reduce operational toil and increase platform stability.
Contribute to automation and orchestration efforts that improve efficiency, reliability, and repeatability of operational processes.
Maintain accurate documentation, runbooks, and standard operating procedures to improve operational consistency and knowledge sharing.
Collaborate cross-functionally with SRE, Development, Security, Product, and Support to ensure platform health, visibility, and alignment to shared business goals.
Perform other duties as assigned.
Position Specifications
Education:
Bachelor's degree or 3 years equivalent experience
Certifications are nice, but not required
Requirement:
Cultural fit. Humility. Strong sense of ownership, and integrity. Willing to walk in the mud.
Commitment to continually improving yourself.
Detail-oriented.
Exceptional written and verbal communication skills.
Effective collaboration skills with a proven ability to work cross-functionally in order to establish and meet shared business goals.
Knowledge, Skills, & Abilities:
Experience with a monitoring platform (Cloudwatch, Grafana, etc.)
Familiarity with automation/orchestration tools
Familiarity with Atlassian suite or similar tools
Experience with AWS preferred
Travel:
Minimal, generally 12 days or less per year, ~2X team get togethers a year