A fully automated ETL pipeline for bike share data using Apache Airflow. Extracts live data, transforms it into analytics-ready Parquet files, and loads it into S3/PostgreSQL for analysis.
Deployed on AWS EC2 with Nginx, Gunicorn, PostgreSQL, and media handling. Utilizes affiliate links and gear recommendations.
Utilizing Beautiful Soup to web scrape quotes, convert to JSON and store in AWS S3 bucket.