Skip to content
View akabzw24's full-sized avatar

Block or report akabzw24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
akabzw24/README.md

๐Ÿ‘‹ Welcome! I'm Bozhao Wang

๐ŸŽ“ Economics Undergraduate @ UCalgary | Aspiring Data Scientist


๐Ÿ‘€ About Me

My name is Bozhao Wang. I am a data science enthusiast with expertise in machine learning, predictive modelling, and statistical analysis. With a background in economics and statistics, my focus lies in applying quantitative methods to solve real-world problems across fintech, ESG, and urban economics domains. Iโ€™m actively seeking opportunities in data science, business analytics, and related roles!


๐Ÿ› ๏ธ Technical Skills

Programming Languages:

  • Python, SQL

Machine Learning & Statistical Modeling:

  • Classification & Regression (Logistic Regression, Random Forest, XGBoost)
  • Hyperparameter Tuning (GridSearchCV, RandomizedSearchCV)
  • Handling Imbalanced Data (SMOTE)
  • Model Evaluation (AUC, Precision, Recall, Confusion Matrix)

Data Handling & Visualization:

  • Exploratory Data Analysis (EDA)
  • Data cleaning and preprocessing
  • Data visualization
  • Dashboard development (PowerBI, EXCEL)

Tools & Platforms:

  • GitHub
  • Jupyter Notebooks
  • VS Code
  • Microsoft Excel
  • Power BI

๐Ÿ“‚ My Projects

  • Credit Risk Prediction Using Supervised Machine Learning Model
    Built and compared supervised learning models to predict credit card default using imbalanced financial data. Applied SMOTE oversampling and model evaluation metrics to enhance predictive performance and support risk assessment strategies.
  • Urban System Revenue Prediction with XGBoost (DSMLC Competition)
    Applied XGBoost regression modelling to predict municipal revenue in urban systems using infrastructure investment data. Feature engineering, log transformation, and model tuning achieved high predictive accuracy.
  • Quantify Energy Risk Case Competition 2025
    Built classification models (Logistic Regression, Random Forest, XGBoost) to predict high-loss CAT events. Created an interactive Power BI dashboard with parametric triggers and strategic recommendations for renewable expansion.
  • Demographic Trends and Housing Analysis in Calgary (Capstone Project) Conducted regression analysis on Calgary's housing supply and population growth using historical census and building permit data. Identified key factors influencing demographic shifts and housing demands to inform urban planning strategies.
  • Detecting COVID-19 Health Misinformation Targeting Older Adults
    Developed and compared TFโ€“IDF + Logistic Regression and fine-tuned BERT classifiers on COVID-19 tweets, evaluated cross-platform robustness on senior-focused Reddit posts, applied SHAP for interpretability, and used LDA topic modelling to uncover key misinformation themes.

๐ŸŒฑ Currently Learning & Building

  • ๐Ÿ’ป Currently Learning NLP & LLM.
  • Advancing my skills in machine learning and Python

๐Ÿ’ž๏ธ Collaboration Interests

  • ๐ŸŽฏ Open to hackathons, case competitions, and interdisciplinary collaborations
  • ๐Ÿš€ Open to collaboration and internships in Data Science, Business Analytics, or Applied Research.
  • ๐Ÿ“ข How to reach me: BozhaoWang24@gmail.com

๐Ÿ“ซ Connect with Me

LinkedIn

Pinned Loading

  1. Demographic-Trends-and-Housing-Analysis-in-Calgary Demographic-Trends-and-Housing-Analysis-in-Calgary Public

    Capstone Project conducted regression analysis on Calgary's housing supply and population growth using historical census and building permit data.

  2. wsp-esg-stock-pitch wsp-esg-stock-pitch Public

    Investment pitch for WSP Global (TSE: WSP) submitted to the 2025 CFAC Portfolio Management Competition. Includes ESG-focused equity research, 3-scenario DCF model, and peer valuation benchmarking.

  3. Credit-Risk-Prediction-Using-Supervised-Machine-Learning-Model Credit-Risk-Prediction-Using-Supervised-Machine-Learning-Model Public

    A machine learning project for predicting credit card default risk using supervised machine learning techniques like Logistic Regression, Random Forest, XGBoost. Applied SMOTE, and hyperparameter tโ€ฆ

    Jupyter Notebook

  4. quantify-energy-risk-case-2025 quantify-energy-risk-case-2025 Public

    Quantify 2025 Energy Risk & Insurance Case Modeling high-loss CAT events and developing parametric risk triggers to support renewable energy expansion. Includes ML code, writing report, and Presentโ€ฆ

    Jupyter Notebook

  5. covid-misinfo-nlp covid-misinfo-nlp Public

    Detecting COVID-19 health misinformation targeting older adults using TF-IDF, logistic regression, and BERT

    Jupyter Notebook

  6. Urban-revenue-prediction-XGBoost Urban-revenue-prediction-XGBoost Public

    DSMLC Final Competition project using XGBoost to predict urban project revenue. Data provided by Urban Systems in Calgary.

    Jupyter Notebook