F*ckUp Gracefully

Radim Klaška

Morpht.com & Drupal.cz

Who are you?

NDA

3 incidents

  • background
  • task
  • can you spot the problem?
  • solution
  • damage report
  • retrospective

"Quick" deploy

Background

  • Site rescue project
  • Big site (2+ mil. nodes)
  • Redmine & Jira
  • Few processes to follow
  • External QA team

Task

  • Deploy the whole sprint
    • updates
    • new modules
    • optimization to search queries
    • search index changes
    • theme changes

First clue: this random message

Deploy time

Solution

Damage report

  • Server load
  • Developer time
  • Client trust probably

Retrospective

  • Cloudflare cache saved us
  • Company's first incident report!
  • Communication problems (source of truth)
  • Client was informed about possible downtime

Monthly Membership is not

a subscription

Background

  • D7, Ubercart
  • Site selling access to files
  • Issue repored by client

Report

Solution

Damage report

  • Developer time
  • Client's time
  • ~2 customers of our client affected

IPv6 just in time

+

we need to log everything

Background

  • Site logging every "view" event
  • Big DB (just one table)
  • Czech ISP switching to IPv6

Nope.

All good.

We want that data.

While working on that...

Damage report

  • None?
  • Developer time was paid
  • We actually improved the site

Retrospective

  • Hey, we did pretty good! :)
  • Enjoy the little things

Keep your cool,

but iterate!

Questions?

Thank you!

Made with Slides.com