Wednesday Jun 12
15:40 –
16:30
Effectenbeurszaal
A Field Guide to Reliability Engineering at Zalando
This video is also available in the GOTO Play video app! Download it to enjoy offline access to our conference videos while on the move.
We present Zalando's approach to engineering reliability from a very small to a very large scale, and touch on both technological and human angles.
With over 50M customers across 23 countries, Zalando operates one of the largest eCommerce platforms worldwide. Achieving a reliable customer experience requires the intricate collaboration of over 3000 applications and more than 2000 software engineers who constantly seek to improve and extend product capabilities. In the talk we will walk you through the best practices Zalando has arrived to consistently achieve high levels of reliability.
- We will start with a simple stand-alone application and cover best practices for instrumentation, monitoring and alerting.
- We continue the journey to products that span multiple applications which are operated by different teams. At this scale methods like tracing and incident management become important.
- Finally we will present technologies and processes which are used to steer reliability on the company level. Here WORM Cascades and Risk Management have proven highly effective.
Keynotes
-
A Short Summary of the Last Decades of Data ManagementHannes MühleisenTuesday Jun 11 @ 13:25
-
Lessons From The Pit LaneMarc PriestleyWednesday Jun 12 @ 09:10
-
X Marks the Spot: Navigating Possible Futures with Wardley MapsSimon WardleyWednesday Jun 12 @ 16:30
-
Is It Time To Version Observability? (Signs Point To Yes)Charity MajorsTuesday Jun 11 @ 09:10
-
How a Passion for Oceans Can Utilize Synergies of TechnologySigne SimonsenWednesday Jun 12 @ 13:25
-
There’s no AI in human: Navigating The Intersection of Technology and HumanityImran RashidTuesday Jun 11 @ 17:10