Data Strategie

Embarrassing 90% cost reduction fix

Reddit r/dataengineering

Summary

I'm running and uptime monitoring service. However boring that must sound, it's giving some quite valuable lessons. A few months ago I started noticing the BigQuery bill going up rapidly. Nothing wrong with BigQuery, the service is working fine and very responsive. #1 learning Don't just use BigQuery as a dump of rows, use the tools and methods available. I rebuilt using DATE partitioning with clustering by user_id and website_id, and built in a 90-day partition expiratiton. This dropped my q...

Read the full article