Mountain View, CA, USA On-site

Gatik AI is hiring a Senior/Staff Site Reliability Engineer

About the Role

Gatik AI is hiring a Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. You will work closely with our infrastructure and platform teams to manage rollouts of both on-premises and cloud infrastructure in support of expansions to new customer sites.

What You'll Do

  • Upgrade and maintain both physical and cloud infrastructure used for offloading data from our autonomous vehicle fleet.
  • Partner with the infrastructure and platform engineering teams to monitor, maintain, and troubleshoot our on-premises data offload and CI systems.
  • Design, develop, and maintain business intelligence dashboards and ETL pipelines to provide actionable insights into our infrastructure performance and health.
  • Architect and deploy test environments to validate internal and customer-facing infrastructure solutions.
  • Automate deployment, scaling, and upgrading of our remote monitoring software to ensure operational efficiency.
  • Perform ongoing analysis of infrastructure performance, identifying opportunities for optimization in latency, throughput, and reliability.

What We're Looking For

  • 5+ years of experience in a related role such as Site Reliability Engineer, DevOps Engineer, or Infrastructure Engineer.
  • Strong knowledge of networking fundamentals, including protocols, troubleshooting, and optimization.
  • Hands-on experience with Docker and related ecosystem tools.
  • Expertise in Kubernetes deployments and package management via Helm.
  • Proficiency with relational and time-series databases such as Postgres, TimescaleDB, or InfluxDB.
  • Familiarity with workflow orchestration tools such as Argo and Airflow.
  • Proven experience managing upgrades and rollbacks for customer-facing SaaS environments.
  • Scripting experience in Python and Bash for automation and tooling.
  • Experience building and maintaining dashboards with tools like Grafana.

Technical Stack

  • Docker, Kubernetes, Helm
  • Postgres, TimescaleDB, InfluxDB
  • Argo, Airflow
  • Python, Bash, Grafana

Team & Environment

You will work closely with our infrastructure and platform teams. Our culture emphasizes collaboration, respect, and agility, striving to create a diverse and inclusive environment where everyone has opportunities to succeed and grow.

Benefits & Compensation

  • Compensation range: $180,000 - $260,000

Work Mode

This is an onsite role located in Mountain View, CA.

We are committed to an inclusive and diverse team. We do not discriminate based on race, color, ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, veteran status, genetic information, marital status or any legally protected status.

Required Skills
DockerKubernetesHelmPostgresTimescaleDBInfluxDBArgoAirflowPythonBashSite Reliability EngineeringInfrastructureMonitoringAutomationCloud Platforms DockerKubernetesHelmPostgresTimescaleDBInfluxDBArgoAirflowPythonBashSite Reliability EngineeringInfrastructureMonitoringAutomationCloud Platforms
Got hired remotely?

Get paid like a professional

Remote clients expect company invoices, not personal PayPal requests. Glopay forms an EU partnership that makes you look legitimate while you stay independent.

Professional invoices with EU company details
Compliance handled automatically
Withdraw to any bank account
Income reports for easy tax filing
Create free account
Free signup • 5 min setup
About company
Gatik AI
Gatik is the leader in autonomous middle-mile logistics, revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution. The company focuses on short-haul, B2B logistics for Fortune 500 retailers and launched the world’s first fully driverless commercial transportation service with Walmart.
All jobs at Gatik AI Visit website
Job Details
Category infrastructure
Posted 8 months ago