Week 3 – Basic

In Week 1 we looked at ingesting S3 data, now it’s time to take that a step further. So this week we’ve got a short list of tasks for you all to do.

The basics aren’t earth-shattering but might cause you to scratch your head a bit once you start building the solution.

Frosty Friday Inc., your benevolent employer, has an S3 bucket that was filled with .csv data dumps. These dumps aren’t very complicated and all have the same style and contents. All of these files should be placed into a single table.

However, it might occur that some important data is uploaded as well, these files have a different naming scheme and need to be tracked. We need to have the metadata stored for reference in a separate table. You can recognize these files because of a file inside of the S3 bucket. This file, keywords.csv, contains all of the keywords that mark a file as important.

Objective:

Create a table that lists all the files in our stage that contain any of the keywords in the keywords.csv file.

The S3 bucket’s URI is: s3://frostyfridaychallenges/challenge_3/

Result:

Your result should look like:

Remember if you want to participate:

Sign up as a member of Frosty Friday. You can do this by clicking on the sidebar, and then going to ‘REGISTER‘ (note joining our mailing list does not give you a Frosty Friday account)
Post your code to GitHub and make it publicly available (Check out our guide if you don’t know how to here)
Post the URL in the comments of the challenge.

If you have any technical questions you’d like to pose to the community, you can ask here on our dedicated thread.

30 responses to “Week 3 – Basic”

mvdvelden

2023-09-26
Nice challenge. Took the chance to create the tables using templates based on the CSV files:

https://github.com/marioveld/frosty_friday/tree/main/ffw3
- Solution URL – https://github.com/marioveld/frosty_friday/tree/main/ffw3
Log in to Reply
dzapp

2024-02-07
🙂
- Solution URL – https://github.com/darylkit/Frosty_Friday/blob/main/Week%203%20-%20Metadata%20Queries/metadata_Queries.sql
Log in to Reply
tomo

2024-04-17
‘LIKE ANY’ is Basic, but Useful.
My code is in Japanese.
- Solution URL – https://app.snowflake.com/oghptkp/et79103/w2bUPQqmFerC/query
Log in to Reply
1. tomo
  
  2024-04-17
  ‘LIKE ANY’ is Basic, but Useful.
  My code is in Japanese.
  
  （One previous comment had the wrong Git URL.）
  
  Solution URL – https://github.com/tomoWakamatsu/FrostyFriday/blob/main/FrostyFriday-Week3.sql
  Log in to Reply
marcoscatassi

2024-05-08
Not sure it is the most straightforward approach, but using scripting you can nicely filter the staged files out! I learned a lot 🙂
- Solution URL – https://github.com/marco-scatassi-nimbus/Frosty-Friday/blob/main/week3/load_filtering_using_metadata.sql
Log in to Reply
marcopastore

2024-05-09
Really quick solution, what do you think about?
- Solution URL – https://github.com/marco-pastore-HH/Frosty-Friday-Challenges/blob/main/ff_challenge_3.sql
Log in to Reply
gergana98

2024-05-13
This is my version of the solution for this task. I hope you find it helpful! ^^
- Solution URL – https://github.com/GerganaAK/FrostyFridays/blob/main/Week%203%20%E2%80%93%20Basic
Log in to Reply

Older Comments

1 2

Comments

mvdvelden says

2023-09-26 at 16:38
Nice challenge. Took the chance to create the tables using templates based on the CSV files:

https://github.com/marioveld/frosty_friday/tree/main/ffw3
- Solution URL - https://github.com/marioveld/frosty_friday/tree/main/ffw3
Log in to Reply
dzapp says

2024-02-07 at 06:59
🙂
- Solution URL - https://github.com/darylkit/Frosty_Friday/blob/main/Week%203%20-%20Metadata%20Queries/metadata_Queries.sql
Log in to Reply
tomo says

2024-04-17 at 11:59
‘LIKE ANY’ is Basic, but Useful.
My code is in Japanese.
- Solution URL - https://app.snowflake.com/oghptkp/et79103/w2bUPQqmFerC/query
Log in to Reply
- tomo says
  
  2024-04-17 at 12:05
  ‘LIKE ANY’ is Basic, but Useful.
  My code is in Japanese.
  
  （One previous comment had the wrong Git URL.）
  - Solution URL - https://github.com/tomoWakamatsu/FrostyFriday/blob/main/FrostyFriday-Week3.sql
  Log in to Reply
marcoscatassi says

2024-05-08 at 13:11
Not sure it is the most straightforward approach, but using scripting you can nicely filter the staged files out! I learned a lot 🙂
- Solution URL - https://github.com/marco-scatassi-nimbus/Frosty-Friday/blob/main/week3/load_filtering_using_metadata.sql
Log in to Reply
marcopastore says

2024-05-09 at 08:09
Really quick solution, what do you think about?
- Solution URL - https://github.com/marco-pastore-HH/Frosty-Friday-Challenges/blob/main/ff_challenge_3.sql
Log in to Reply
gergana98 says

2024-05-13 at 15:00
This is my version of the solution for this task. I hope you find it helpful! ^^
- Solution URL - https://github.com/GerganaAK/FrostyFridays/blob/main/Week%203%20%E2%80%93%20Basic
Log in to Reply

« Older Comments

30 responses to “Week 3 – Basic”

Leave a Reply Cancel reply

Reader Interactions

Comments

Leave a Reply Cancel reply