Skip to content

katiehuangx/Portfolio-Guide

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

67 Commits
Β 
Β 

Repository files navigation

πŸ—Ί Katie's Portfolio

Welcome to my data portfolio! Here, I document a summary of my projects in the data field.

πŸ“š Table of Contents

Data Engineering

Project Link Completion Date Tools Project Description
πŸš— Uber Taxi May 2023 Python, GCP (Storage, Compute Engine, BigQuery), Mage, Looker Studio Developed and implemented an end-to-end ETL pipeline for processinsg NYC Trip Record data. The pipeline encompassed extracting raw data, performing data transformation using Python, applying fact and dimensional data modelling techniques, orchestrating the pipeline on Mage, and ultimately creating a dashboard using Looker Studio.
🐢 Dog Adoption Mar 2023 Python, PostgreSQL, Jupyter Notebook Designed, created, and deployed a custom data model for a dog adoption data set using Python and PostgreSQL on Jupyter Notebook.

SQL

Project Link Area of Analysis Project Description
πŸ’‘ 8-Week SQL Challenges Data analysis, data cleaning, data transformation This repo serves as the solution for the 8 case studies from the #8WeekSQLChallenge. It showcases my ability to tackle various SQL challenges and demonstrates my proficiency in SQL query writing and problem-solving skills.
πŸ‘©πŸ»β€βš•οΈ Health Analytics Case Study Health analysis I answer business questions related to patients data, such as average and median measurements per user, types of measurements for active users, and median blood pressure values for users.
🦠 Covid-19 and the Impact on Malaysia Stock Market Data cleaning, data analysis A project close to 🏑 home. Inspired by Alex Freberg's Data Exploration Project, I analysed global and local Covid-19 cases & the impact on Malaysia stock market from Jan 2020 to Jul 2021 using SQL and Tableau.

Python

Project Link Area Project Description Libraries
πŸ‘©πŸ»β€πŸ’» CS50P - Ongoing Programming This repo contains the solution to the problem sets in Harvardx CS50P Introduction to Programming with Python. -
πŸ“Ί TMDb Movie Analysis Data Wrangling & EDA I analysed more than 10,000 TMDb movies and getting the answers to - Which actor(s) is associated with higher revenue and profit, Does a higher budget constitute to a higher profit, and Which director produced the highest grossing movie? pandas, matplotlib
⛽️ Fuel Economy Data Wrangling & EDA Analysis on vehicles’ fuel economy in 2008 and 2018 to understand usage of alternative sources of fuel, changes in greenhouse gas and smog ratings over the decade, and vehicle features associated with better fuel economy. pandas, matplotlib
🍷 Wine Quality Data Wrangling & EDA A study on red and white wine samples and understanding whether certain types of wine and their qualities (alcohol level, sugar content and acidity level) are associated with higher wine quality. pandas, matplotlib
🌀 Explore Weather Trends Time-series analysis In this time-series analysis, I use moving average method to analyze local and global temperature data and compare the temperature trends where I live to overall global temperature trends. pandas, matplotlib
πŸ› Super Store Analysis EDA Analysis of sales data to find out highest revenue and profit product categories and top customer segments. pandas, matplotlib, seaborn
πŸƒπŸ»β€β™€οΈ Bellabeat Fitness Tracking Analysis EDA Discovered insights into whether users are using the FitBit app for tracking health habits, their frequency of usage across the week and whether there is correlation between the hours logged, number of steps taken and calories burnt. pandas, matplotlib, seaborn

Tableau

Project Link Project Description Dashboard Link
πŸ¦„ Maven Unicorn Challenge Cleansed and transformed data on privately-owned companies (start-ups) valued at over $1 billion using Python. Visualised key insights using Tableau, including the timeline of valuations, the top 10 countries and investors with the highest valuations, the most successful unicorns, and the average time it takes to reach a $1 billion valuation. Dashboard
🦠 Covid-19 and the Impact on Malaysia Stock Market A project close to 🏑 home. Inspired by Alex Freberg's Data Exploration Project, I analysed global and local Covid-19 cases in Malaysia and the impact on the KLSE stock market from Jan 2020 to Jul 2021 using SQL and Tableau. Dashboard

Katie's Guides

πŸ“š SQL Resources

Looking to learn SQL for data analysis but don't know where to start?

Check out my Linkedin post and GitHub guide where I've compiled a comprehensive list of free SQL resources! From YouTube videos to interactive websites, courses, practice sites, and projects, this list has got you covered.

Screenshot 2021-09-29 at 9 28 28 PM

πŸ‘©πŸ»β€πŸ’» Transition into Data Analytics Guide

Are you keen on pursuing a career in data analytics, but feeling lost on how to take the first steps?

Explore my comprehensive repo here, which contains all the essential resources you require to develop the technical expertise in SQL, Python, and Tableau!

✏️ How to Create a GitHub Portfolio

Are you new to GitHub and wondering how to showcase your coding skills to potential employers or clients? Look no further!

My step-by-step tutorial here will guide you through creating a professional portfolio on GitHub.

In my guide, you will learn:

  • How to create your profile on GitHub and add relevant information
  • How to customize Markdown files to create a visually appealing portfolio
  • How to create a new repository for each project and add project details and code
  • Follow these steps and you'll have an impressive portfolio to showcase your coding projects in no time!