ProductTank Meetup, Singapore
June 9, 2016
Pavel Kudinov, RedMart
79% of time is data munging
In 2009, data scientist Mike Driscoll popularized the term “data munging,” describing the “painful process of cleaning, parsing, and proofing one’s data”
What was previous name? Who updated this field?
What was previous birthday? Who updated this field?
Not reliable. Owned by DevOps and ALWAYS considered as not important after couple months
Depends on the engineer. After refactoring or due to wrong configuration log might not be written
Unstructured format. Requires parsing and special care
If it is structured (audit trail), what about atomicity and transactions?
...
...
Create cart
Add item #1
Set delivery time
Add item #2
Change Quantity of item #1
Change delivery time
No Persisted State.
Only event journal.
Display audit log: who did what and when
Store the history of changes in the same DB
Track changes for all business entities
Reports that show time distribution between events
Event-driven VS Scheduled dependencies