Skip to content

nortonlyr/DataEngineering.Labs.AirflowProject

 
 

Repository files navigation

Airflow Project

description_if_image_fails_to_load

Question?

  • How to select the valuable airbnbn home when travel to the NYC?

Goal:

  • Applied the Apache Airflow directed acyclic graphs (DAGs) to build data pipelines on NYC open data (park, shooting, hot_spot, hotel, public housing) and Airbnb housing data, followed by data minipulation, analysis, and visualization.

Flowchart

  • Original sources: NYC OPEN DATA, Airbnb dataset (from Insider Airbnb)

  • Get requests and download the sources

  • Preliminary Data cleaning and manipulation

  • Import to SQL database (MySQL/PostgreSQL)

  • Load data from database, and use jupyter notebook to show analysis and visualization (run in both localhost and AWS (EC2, RDS, S3))

description_if_image_fails_to_load

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.4%
  • Python 2.6%