Responsibilities
- Manage and enhance the organization's cloud infrastructure and multi-cloud design, including Infrastructure-as-Code tools like Terraform and Terramate, to support global growth.
- Maintain and improve core data and application platforms such as PostgreSQL/Aurora, event streaming systems, asynchronous processing, and database migration tools while isolating complexity from product development teams.
- Ensure platform reliability by overseeing end-to-end uptime and latency service level objectives, advancing observability with Datadog, refining incident response using Incident.io, and managing a shared on-call schedule that remains scalable.
- Lead cross-functional initiatives in governance, compliance, and financial operations, including infrastructure cost management, identity and access policies, backup systems, disaster recovery, and exit planning through automated reporting.
- Integrate AI-driven tools into infrastructure operations to streamline incident analysis, automate runbook workflows, and reduce repetitive tasks, enabling engineers to focus on high-impact work.
Work Arrangement
Hybrid — France, Spain, Belgium, Canada
Other
Remote work: We offer remote work flexibility, but we value in-person collaboration