Welcome to the Data Engineer Intern assessment for Shopify! This assessment is designed to evaluate your skills in SQL, data manipulation, and problem-solving. Please follow the instructions below to complete the assessment.
The assessment is based on a simulated dataset containing sales information from an e-commerce platform. The dataset is provided in the /data directory as a number of CSV files named <table-name>_data.csv. These dataset includes columns such as product_id, sales_amount, customer_id, etc.
- Fork the Repository: Start by forking this repository to your local machine.
- Create a new Branch: Create a new branch to store your work in
- Data Understanding: Review the
schema.sqlfile and data.csvfiles and understand its structure and columns. - Write SQL Queries: Create SQL files (
task1.sql,task2.sql,task3.sql) in the/sqldirectory to solve each task mentioned above. - Submit your Work: Once completed, create a pull request with your changes to the Shopify
mainbranch and submit the link to your PR
- Ensure your SQL files contain clear and commented queries for each task.
- Use the provided datasets (
<table-name>_data.csv) for all tasks. - Feel free to ask any clarifying questions by creating an issue in this repository.
Good luck, and we look forward to reviewing your work!