F*ckUp Gracefully
Radim Klaška
Morpht.com & Drupal.cz


Who are you?

NDA

3 incidents
- background
- task
- can you spot the problem?
- solution
- damage report
- retrospective

"Quick" deploy
Background
- Site rescue project
- Big site (2+ mil. nodes)
- Redmine & Jira
- Few processes to follow
- External QA team
Task
- Deploy the whole sprint
- updates
- new modules
- optimization to search queries
- search index changes
- theme changes
First clue: this random message


Deploy time





Solution
- d.o/project/node_access_rebuild_progressive
- Search API: Mark for reindexing
Damage report
- Server load
- Developer time
- Client trust probably
Retrospective
- Cloudflare cache saved us
- Company's first incident report!
- Communication problems (source of truth)
- Client was informed about possible downtime
Monthly Membership is not
a subscription
Background
- D7, Ubercart
- Site selling access to files
- Issue repored by client
Report





Solution
Damage report
- Developer time
- Client's time
- ~2 customers of our client affected
IPv6 just in time
+
we need to log everything
Background
- Site logging every "view" event
- Big DB (just one table)
- Czech ISP switching to IPv6

Nope.
All good.
We want that data.

While working on that...





Damage report
- None?
- Developer time was paid
- We actually improved the site

Retrospective
- Hey, we did pretty good! :)
- Enjoy the little things
Keep your cool,

but iterate!

Questions?
Thank you!
F*ckUp Gracefully
By Radim Klaška
F*ckUp Gracefully
- 1,042