Duplicates! The scourge of every data engineer. Our sales data today has a couple of them. Your job is to read the data from an external stage, and clean it up. Expunge the duplicates using GROUP BY ALL that shiney new SQL command we’ve all been given.
Start Up Code
create or replace stage week55_stage
url='s3://frostyfridaychallenges/challenge_55/';
At the end your data should look like:
Remember, if you want to participate:
- Sign up as a member of Frosty Friday. You can do this by clicking on the sidebar, and then going to ‘REGISTER‘
- Post your code to GitHub and make it publicly available (Check out our guide if you don’t know how to here)
- Post the URL in the comments of the challenge
6 responses to “Week 55 – Basic”
-
Another nice challenge to demonstrate new functionality, thanks!
- Solution URL – https://github.com/ChrisHastieIW/Frosty-Friday
-
Lovely feature!
- Solution URL – https://github.com/canonicalized/FrostyFriday/blob/main/WEEK55.sql
-
Thanks for the challenge! This will be useful with SELECT ILIKE
- Solution URL – https://github.com/dsmdavid/frostyfridays-sf/blob/main/models/challenge_55.sql
-
picked up on the qualify clause as an added bonus
- Solution URL – https://github.com/NMangera/frosty_friday/blob/main/week%2055%20-%20basic%20/GROUPBYALL
-
Another nice new feature to learn about
- Solution URL – https://github.com/ChrisBo94/FrostyFriday/blob/main/Week_55.sql
-
I was not sure how to infer the schema so I tried to guess it just in case it was different from the one in the result picture here. I didn´t see any duplicates either, so when using the group by all I didn´t notice any changes. Any ideas?
- Solution URL – https://github.com/GerganaAK/FrostyFridays/blob/main/Week%2055%20%E2%80%93%20Basic
Leave a Reply
You must be logged in to post a comment.