Hey there,

Inspired by a recent meetup I attended, I wrote a series on cleaning your data:
  1. Finding Duplicate Rows in SQL.
  2. Filling Missing Data by Generating a Continuous Series in SQL.
  3. Finding Patterns with Regular Expressions in SQL.

Data cleaning, or "wrangling" as some like to call it is an important aspect of the data analytics process. I've seen analysts spin big Hadoop clusters just to clean their data. Though at times that's necessary, usually, you can go pretty far with just SQL.

Let me know what you think. Enjoy!

Copyright © 2017 Silota Inc., All rights reserved.

Want to change how you receive these emails?
You can update your preferences or unsubscribe from this list

Email Marketing Powered by Mailchimp