Responsibilities
- Ensure systems and services remain available, performant, and scalable via proactive monitoring, upkeep, and capacity forecasting.
- Respond to, investigate, and resolve system outages while applying corrective actions to reduce future recurrence.
- Build automation tools and scripts to minimize manual tasks, boost efficiency, and strengthen system reliability.
- Track and enhance system performance and resource utilization by detecting bottlenecks and applying performance optimization techniques.
- Partner with engineering teams to embed reliability, scalability, and performance standards into development workflows and coordinate across teams for seamless operations.
- Keep current with emerging technologies and advancements, contributing actively to internal tech communities to promote a culture of technical excellence.
Benefits
- Comprehensive health benefits including medical, dental, and vision plans, 401(k) options, life insurance, and paid leave for qualifying situations.
- Eligible for performance-based incentive compensation.
Other
All employees must uphold the core values of Respect, Integrity, Service, Excellence, and Stewardship as a guide for ethical decision-making, and embody the behavioral framework of Empower, Challenge, and Drive in daily operations.


