Projects
Take a look at key highlights from my portfolio — find the complete list on GitHub or contact me for deeper dives.
Nutrient Composition of Common Foods in Canada: Analyzing the Canadian Nutrient File
Tech Stack:

Interactive dietary tools leveraging the Canadian Nutrient File — a Tableau dashboard for in-depth nutrient comparisons and a Streamlit app featuring dynamic visualizations, clustering analytics, and an AI chatbot for personalized nutrition guidance. Presented at YYC DataCon 2025.
View on GitHubAF Risk Prediction Using ECG & EHR Data
Tech Stack:

An end-to-end pipeline and interactive app for forecasting new-onset atrial fibrillation using routinely collected 12-lead ECG signals and EHR data. The XGBoost model identifies high-risk patients, while the Streamlit interface and DeepSeek chatbot provide intuitive exploration and insights. Presented at Statistical Society of Canada Annual Meeting 2025.
View on StreamlitCredit Card Default
Tech Stack:

A predictive modelling project to forecast credit card defaults using demographic and financial data from 30,000 account holders — combining exploratory data analysis, visualization, and machine learning (regularized classifiers and XGBoost) for robust risk prediction.
View on GitHubEnhancing Bank's Personal Loan Approaches
Tech Stack:

A sequential neural network model predicting customer likelihood to accept personal loan offers — achieving 98% accuracy, 96% precision, and 87% recall — to drive targeted marketing strategies.
View on GitHubSleep-Apnea Severity Estimation from EEG Spectrograms via ResNet-18
Tech Stack:

An end-to-end deep learning pipeline that regresses continuous AHI from EEG spectrograms using a ResNet-18 backbone. With subject-stratified validation, it achieves an RMSE of 6.8 events/hour and a Pearson correlation coefficient of 0.76, delivering accurate severity rankings across the entire AHI range. The workflow combines robust preprocessing, transfer learning optimization, and scalable inference for real-time sleep apnea screening.
View on StreamlitData-Driven Department Optimization
Tech Stack:

A suite of data-driven solutions optimizing operations across HR, Marketing, Sales, Operations, and PR. Leveraging machine learning, deep learning, and advanced analytics to tackle departmental challenges — from predicting employee turnover with logistic regression and deep nets, to customer segmentation via K-Means and autoencoders, sales forecasting with Prophet, medical diagnostics with deep models, and PR sentiment analysis using Naive Bayes and logistic regression.
View on GitHubGDP & Population: 2024 Olympic Medals
Tech Stack:

An analysis of how GDP, population, and socio-economic indicators relate to medal performance at the 2024 Olympics — using correlation matrices, regression models, and clustering techniques to uncover key drivers of success.
View on GitHubBeWell360: Your Daily Holistic Wellness Log
In Progress

BeWell360 is a holistic wellness tracker that uses data science and analytics to transform daily logs — nutrition, fitness, sleep, body composition, professional development, and personal growth — into actionable insights for balanced living.
View on GitHubDatabase Chatbots
Tech Stack:

AI-powered agents for conversational interaction with relational data — transforming natural language prompts into SQL queries and insights for data-driven decision-making.
View on GitHubDesigning an Intelligent Agent for the Wumpus World
Tech Stack:

An intelligent Wumpus World agent that navigates a grid of hazards and searches for gold under uncertainty. It perceives environmental cues — breezes near pits and stenches near the Wumpus — applies logical inference to deduce safe paths, and balances exploration with caution to avoid fatal encounters, demonstrating key AI concepts such as decision-making under uncertainty, logical reasoning, and informed search strategies.
View on GitHubYT Channel Analytics
Tech Stack:

A Streamlit app for analyzing YouTube channel performance — tracking subscriber growth, views, watch hours, likes, comments, and shares to uncover engagement insights.
View on StreamlitWine Dataset Analysis
Tech Stack:

An exploratory analysis of red and white wine quality — examining the proportion of high-quality wines, identifying key differentiators (e.g., alcohol content, sulphur dioxide), and evaluating their impact on ratings.
View on GitHubUnlocking Superconductor Potential: Predicting Critical Temperatures with Multiple Regression
Tech Stack:

A multiple regression analysis to predict superconductors' critical temperature (Tc), identify the most impactful material features, and evaluate model performance.
View on GitHubLife Expectancy EDA: Key Influencing Factors
Tech Stack:

A focused exploratory analysis of life expectancy drivers — examining healthcare investment, vaccination coverage, and socio-economic indicators — and contrasting trends across developed and developing countries.
View on GitHubLinkedIn and Mental Health: UCalgary MDSA Students
Tech Stack:

An interactive dashboard examining how LinkedIn engagement influences the mental health of UCalgary MDSA students — developed from a carefully designed survey to test hypotheses such as whether exposure to peers' success posts heightens anxiety or whether supportive comments enhance job-search confidence. The dashboard presents trends in confidence, anxiety, and comparison behaviours using simplified versions of the original survey questions, which were more comprehensive and formally structured — for example, "How motivated or anxious do you feel after browsing LinkedIn?" and "How often do you compare your progress to others online?" Users can explore correlations, observe sentiment shifts, and filter results by gender, age range, student status, and program specialization to reveal demographic patterns and identify areas where targeted support may be most beneficial.
View on Tableau PublicLEGO Set Analysis: Patterns in Build Complexity, Price Points, and Themes
Tech Stack:

An interactive visualization of the LEGO universe, analyzing 4,385 sets to reveal trends in average pieces, prices, and themes across age ranges. Users can filter by theme group, specific theme, or target age range to examine how complexity and pricing vary across categories. Key metrics — including total sets, average piece count, and average price — enable comparisons between themes, identification of high-value categories, and detection of design and pricing trends over time. Designed for collectors, enthusiasts, and data analysts, it offers a clear and engaging way to explore LEGO's evolution through data.
View on GitHubArticles
Browse my data science articles for practical tips, project breakdowns, and industry analysis — more on Medium.
What's in Your Food? A Data-Driven Nutrient Analysis

This article explores the nutritional content of commonly consumed foods in Canada, emphasizing the importance of balanced diets for maintaining overall health. Using data from The Canadian Nutrient File by Health Canada, the analysis covers 12 key nutrients, compares nutrient densities, and evaluates protein-to-fat ratios across food categories. The dataset, refined through thorough preprocessing, enabled detailed insights into nutrient levels, highlighting nutrient-rich and nutrient-poor foods.
View articleAnalyzing 2024 Olympic Medals: The Role of GDP and Population

This article explores the relationship between a country's population size and GDP per capita and its success in the 2024 Olympic Games. By analyzing these variables, we aim to understand how they influence the total number of medals won by different countries. The article also involves clustering countries based on their Olympic performance and socioeconomic context, providing insights into patterns and trends affecting their athletic achievements. Through this analysis, we seek to uncover the extent to which economic and demographic factors contribute to Olympic success.
View articleUnderstanding the P-Value in Hypothesis Testing

In hypothesis testing, we start with the null hypothesis (H0), assuming no effect or difference. We then collect our observed results — the actual data from the study — and calculate the p-value, which tells us how surprising these results are if H0 is true. A small p-value (e.g., < 0.05) leads us to reject H0, considering the alternative hypothesis (H1) as a possible explanation. A large p-value suggests our results align with H0, so we fail to reject it. Importantly, the p-value is not the probability that H0 is true; instead, it's the probability of observing our results, or more extreme ones, under the assumption that H0 is correct.
View articleEssential DAX Functions for Power BI: The Story Behind the Code

This article is a clear, practical guide to DAX in Power BI, focusing on understanding what you want to calculate rather than memorizing formulas. It explains how to choose between measures and calculated columns, how row and filter context work, and how to use key functions for totals, logic, and time-based calculations. With simple examples of CALCULATE, SUMX, IF, SWITCH, and more, it also covers text formatting, selection tools, and ways to filter or rank data. Tips on using VAR/RETURN help make formulas easier to read and faster to run, encouraging you to see DAX as both a language and a tool for building powerful reports.
View articleAbout
I'm a Data Science and Analytics specialist passionate about building ETL pipelines and end-to-end data solutions. My work spans interactive dashboards, forecasting models, and machine learning applications that deliver actionable insights and support strategic decisions across healthcare, government, finance, and technology.
-
Summary of Skills
Programming & Data Analysis
Python (pandas, NumPy), SQL — data wrangling & quality assurance, exploratory analysis & feature engineering, statistical testing & inference
Data Visualization & BI Platforms
Power BI, Power Automate, Tableau, D3.js, Matplotlib, Seaborn — report development, dashboard design, automated workflows
Machine Learning & Deep Learning
scikit-learn, XGBoost, LightGBM, Prophet, TensorFlow/Keras, PyTorch — model training & validation, cross-validation, hyperparameter tuning, ensemble methods
MLOps & Cloud Infrastructure
AWS (S3, SageMaker), Streamlit, Git — model development & tuning, rapid prototyping of model UIs, code versioning & collaboration
-
Employment Experience
Data Scientist & Co-Founder, Oct 2025–Present
SnowyOwl Data, Edmonton, Alberta
• Leading the development of BeWell360, a holistic wellness tracker that uses data science and analytics to deliver insights on nutrition, sleep, fitness, and personal development.
• Designing wellness challenges with Canada-inspired themes to promote healthier lifestyles and boost user engagement.
• Leveraging data-driven insights to improve user experience and encourage consistent participation.Data Science Analyst, May 2025–Present
BGE Indoor Air Quality Solutions, Edmonton, Alberta
• Built Power BI reports and dashboards across Finance, HR, Operations, and IT, enabling cross-functional reporting and data-driven leadership decisions.
• Created a company-wide Power BI JSON theme to enforce visual consistency and accelerate development.
• Automated reporting pipelines from Access databases using Power BI and Power Automate, streamlining workflows and improving operational efficiency.
• Designed and deployed a reporting solution for the Diamond Rewards program with Power BI, Power Automate, SharePoint, and Forms, providing real-time insights on participation, redemptions, and inventory.Research Assistant (Machine Learning), May 2025–Jul 2025
University of Calgary, Calgary, Alberta
• Engineered end-to-end EEG preprocessing workflows in PyTorch, automating 30s windowing, artifact rejection, cohort normalization, and STFT spectrogram extraction.
• Leveraged transfer learning on an ImageNet-pretrained ResNet-18 with k-fold cross-validation, learning-rate scheduling, early stopping, and data augmentation to train a continuous AHI regression model.
• Validated model performance on held-out subjects to ensure reliable AHI estimation for downstream analytics.
• Details: https://resnet-18-based-eeg-ahi-regression-pipeline.streamlit.appWeb Publisher (Data Visualization), Sep 2020–Aug 2024
Health Canada, Ottawa, Ontario
• Developed custom AEM components using D3.js to ingest Health Canada datasets and render interactive charts and tables, enabling dynamic data exploration and actionable insights.
• Built and maintained WCAG 2.1 AA-compliant pages on Canada.ca in Adobe Experience Manager, ensuring accessible presentation of data-rich content.Web Developer (Programming & Data Analysis), Mar 2019–Sep 2019
OPIN, A Portage CyberTech Company, Ottawa, Ontario
• Developed Power BI dashboards from web-traffic exports to track monthly usage patterns and top pages, enabling data-driven decisions around marketing and UX prioritization.
• Built and maintained WCAG 2.1 AA-compliant Drupal websites for clients such as Holland Bloorview, Hydro Ottawa, and York Region DSB, improving usability and accessibility.
• Refactored and documented HTML, CSS, JavaScript, and PHP codebases, enhancing site performance, maintainability, and scalability.Doctoral Researcher (Data Analysis & Visualization), Nov 2013–Mar 2016
National Academy of Sciences of Ukraine, Kyiv, Ukraine
• Collected, validated, and structured experimental datasets for statistical modelling, enforcing rigorous data-quality checks to ensure accuracy and consistency.
• Produced analytical reports and data visualizations that drove insights, supporting peer-reviewed publications and patented innovations.
• Presented findings at international conferences, translating complex analyses into clear, actionable insights for diverse audiences. -
Education
Master of Data Science and Analytics (Honours), 2025
University of Calgary, Calgary, AlbertaCertificate in Artificial Intelligence, 2024
University of Toronto School of Continuing Studies, Toronto, OntarioCertificate in Data Science, 2022
University of Toronto School of Continuing Studies, Toronto, OntarioOntario College Diploma (Honours) in Internet Applications and Web Development, 2019
Algonquin College of Applied Arts and Technology, Ottawa, Ontario -
Awards
Diamond Rewards — Be the Expert, 2025
BGE Indoor Air Quality Solutions, Edmonton, AlbertaAssistant Deputy Minister's Merit Award — Collaboration & Service Excellence, 2024
Health Canada, Ottawa, OntarioAssistant Deputy Minister's Merit Award — Contribution to the Improvement of the Health of Canadians, 2024
Health Canada, Ottawa, OntarioCOVID-19 Commemorative Coin — Support & Contribution to Canada's COVID-19 Response Efforts, 2023
Public Health Agency of Canada, Ottawa, Ontario
Contact
If you have any questions or would like to connect, feel free to reach out:
+1 613 700 4510