Duplicates! The scourge of every data engineer. Our sales data today has a couple of them. Your job is to read the data from an external stage, and clean it up. Expunge the duplicates using GROUP BY ALL that shiney new SQL command we’ve all been given.
Start Up Code
create or replace stage week55_stage url='s3://frostyfridaychallenges/challenge_55/';
At the end your data should look like:
Remember, if you want to participate:
- Sign up as a member of Frosty Friday. You can do this by clicking on the sidebar, and then going to ‘REGISTER‘
- Post your code to GitHub and make it publicly available (Check out our guide if you don’t know how to here)
- Post the URL in the comments of the challenge