To celebrate Women in Data and the upcoming Data+Women event, we’ve partnered up with Gaia Pometto , a data consultant in the Netherlands, to release a challenge for the Data+Women event. She’ll go through the challenge, live, at the event and you can find her on LinkedIn, Twitter and YouTube.
She also wanted to give a special mention to colleague Atzmon Ben Binyamin who collaborated with her on this challenge and was a great help throughout the process. You can find Atzmon on LinkedIn.
The Challenge
In order to prepare a future ingestion flow of data, your organization wants to get data from a weather API into Snowflake. At the end of the flow, it is expected to create a fact table to be used by consumers.
Your colleague Atzmon has already pulled a sample of the weather data from the API and momentarily stored it in an s3 bucket. The s3 bucket contains a JSON file with hourly data for 7 days.
Your Flow will consist of 3 layers (schemas) of data:
a. Landing zone (external stage, raw table – weather_raw)
b. Curated zone (raw data parsed table – weather_parsed)
c. Consumption zone (aggregated data table – weather_agg)
Objectives:
1. Create a stage for the landing layer and copy the JSON.
2. Parse the JSON and create a table – weather_parsed (in the curated zone).
3. Create a table for consumption weather_agg (in the consumption zone).
The table should have aggregates per day for:
a. Temperature
b. Wind speed
c. Distinct ‘icon’ definitions for a day (e.g. [“cloudy”, “rain”])
Here is the link to the s3 bucket (hint: bucket = frostyfridaychallenges, path = challenge_25).
Good luck!
The solution should look something like this :

Remember, if you want to participate:
It was a nice surprise seeing this one launch early this week!
Cheating a bit this time since I created the challenge myself haha…but good to share the solution for those interested 🙂
Ah, this was a good one for dbt!
– create stage & landing zone: https://github.com/dsmdavid/frostyfridays-sf/blob/main/models/challenge_25_1.sql
– create curated: https://github.com/dsmdavid/frostyfridays-sf/blob/main/models/challenge_25_2.sql
– create output: https://github.com/dsmdavid/frostyfridays-sf/blob/main/models/challenge_25_3.sql
https://github.com/fengliplatform/Frosty_Friday/blob/main/week25.sql
Here’s my solution 🙂
https://github.com/CSimonini/Frosty_Friday/blob/main/W25_Solution.sql
Really enjoyed this one!
nice one
Difficulty will be viewing the large json file, once separated it’s easier to check structure.
A very nice one!