As Principal Product Manager for Platform Observability, you will shape the end-to-end strategy for telemetry across a large-scale SaaS platform. Your work will define how engineering teams and customers gain insight into system behavior through metrics, logs, and distributed traces. You'll establish a durable foundation that supports both operational resilience and meaningful customer-facing analytics.
Key Responsibilities
- Own the multi-year roadmap for observability, balancing innovation with platform stability and scale.
- Design integration strategies with external logging and monitoring ecosystems to support enterprise workflows.
- Partner with product leaders across services to deliver actionable insights in areas like traffic analysis, web application security, and API protection.
- Ensure observability investments support SaaS-specific goals including cost efficiency, multi-tenancy, and platform reliability.
Qualifications
- Minimum of 8 years in product management, with at least 3 years focused on B2B SaaS platforms.
- Proven experience with distributed systems and cloud-native infrastructure.
- Familiarity with core observability tools including Prometheus, Grafana, Loki, Mimir, Tempo, Elastic, Datadog, or Splunk.
- Solid understanding of L4–L7 networking protocols such as TCP, TLS, HTTP, gRPC, and API standards.
- Track record of leading cross-functional initiatives from concept through delivery in fast-moving environments.
Preferred Experience
- Direct experience building observability products or platform services.
- Background in high-volume logging pipelines using Kafka, Event Hubs, or Kinesis.
- Knowledge of schema governance with Avro or OTLP, and telemetry optimization for cost and performance.
- Familiarity with Kubernetes monitoring, eBPF-based data collection, and OpenTelemetry standards.
- Experience with automation in incident response or self-healing systems.
- Exposure to AI-driven observability or SRE tooling.
- Background in networking or security-centric SaaS products such as CDNs, WAFs, Zero Trust, or Edge platforms.
- Experience shaping API governance and programmability strategies for secure, scalable integrations.
Technical Environment
Technologies include Prometheus, Grafana, Loki, Mimir, Tempo, Elastic, Datadog, Splunk, Kafka, Event Hubs, Kinesis, Avro, OTLP, Kubernetes, eBPF, and OpenTelemetry.
Compensation & Benefits
This role offers a competitive package including base salary in the range of $196,000–$294,000, incentive compensation, bonuses, and restricted stock units. Additional benefits include health coverage, paid time off, professional development support, and relocation assistance where applicable. The position operates in a hybrid model, with flexibility based on location and team needs.