Site Reliability Engineer – Observability

Join Wolt’s Observability Engineering Team

Wolt is powered by a cutting-edge platform managed by specialized teams within our Core Group. One of these key teams is the Observability Engineering Team, dedicated to ensuring visibility, reliability, and performance across all Wolt services and infrastructure at scale.

As a Software Engineer in Observability, you will be instrumental in developing scalable observability and reliability tooling, maintaining performance testing frameworks, and improving the overall system health of Wolt’s technology landscape.

This role is ideal for engineers with a solid foundation in software development and experience or interest in Site Reliability Engineering (SRE) practices.

About the Team

Our Observability platform handles billions of metrics, traces, and logs every month. We empower every Wolt engineer to monitor and improve the health of their services through tooling that spans:

  • Application instrumentation
  • Telemetry data collection
  • Visualization
  • Alerting

We are also partnering with DoorDash to build a next-generation observability platform that enhances visibility, scalability, and operational efficiency across both companies.

What You’ll Do

  • Design and develop scalable software solutions to improve observability and reliability across Wolt’s services
  • Build tooling that helps engineering teams monitor and debug efficiently
  • Contribute to the architecture and maintenance of an observability stack capable of handling increasing telemetry data
  • Champion and implement SRE principles to ensure performance and reliability across services
  • Own initiatives to improve system reliability and reduce downtime
  • Develop frameworks for incident management and performance optimization
  • Collaborate closely with engineering teams to resolve production issues and integrate observability best practices
  • Participate in on-call rotations, drive root cause analysis, and create automated resolution tools to reduce MTTR
  • Create documentation, playbooks, and training to improve developer experience and promote self-service tooling

What We’re Looking For

  • Strong software engineering background, especially in designing distributed systems
  • Proficiency in Go (preferred) or Python, with a focus on automation and tooling
  • Experience operating observability platforms at scale
  • Hands-on expertise with tools such as Prometheus, Grafana, Elasticsearch, or similar open-source observability stacks
  • Understanding of SRE principles – including fault-tolerance, SLIs/SLOs, and incident response
  • Experience in cloud-native, distributed environments and container orchestration using Kubernetes
  • Familiarity with AWS (preferred), GCP, or Azure
  • Strong analytical and troubleshooting skills for diagnosing complex systems
  • Excellent collaboration and communication skills

Nice to Have

  • Experience with OpenTelemetry and other modern observability frameworks
  • Background in managing large-scale distributed databases or event streaming systems like Kafka or ClickHouse
  • Contributions to open-source projects in observability, platform engineering, or CNCF projects

Location & Remote Work

You can be based in one of our tech hubs: Helsinki, Berlin, or Stockholm, or work remotely from Finland, Sweden, Germany, Denmark, or Estonia.

Living elsewhere? We offer full relocation support to help you move to Finland, Germany, or Sweden.

Apply Now

The position will be filled as soon as we find the right person. If you’re excited about observability, scale, and reliability—apply today to learn more and potentially join Wolt & DoorDash!

CareerBee Logo

Don't miss out on new jobs!

Signup for weekly updates on new jobs so you can be the first to apply

Contact form for Companies

Are you a talented professional seeking a new opportunity?
Visit our Talents Page.