Problem statement
A project I’m familiar with recently had a series of issues. Each issue was investigated somewhat independently. It was hard to share common code, share data across roles and track progress over time.
Solution
- Capture canonical queries in version control
- Periodically run queries, persist and visualize output (aka ETL)
- At a higher level, invest in tooling to facilitate such dashboard creation
The end result is much more awareness of the underlying data. Folks in different roles can see the data and ask questions, which often improves the quality of analysis. For example, we now review the dashboard weekly and look for changes as we roll out fixes. Because we now have a pipeline, we can also run different data sources through it to check the analysis.