Welcome!

I'm Alina Yildir, and I believe every dataset tells a story.

From exploratory data analysis and machine learning to deep learning, MLOps, and cloud computing—see how I turn raw data into actionable insights.

Dive in and uncover the narrative—reach out if something sparks your curiosity.

Get started

Projects

Below is a selection of my recent work. You can explore my GitHub profile or contact me directly for additional examples.

Nutrient Composition of Common Foods in Canada: Analyzing the Canadian Nutrient File

Tech Stack: NumPy Pandas Tableau Streamlit LangChain OpenAI API Chatbot

Interactive dietary tools leveraging the Canadian Nutrient File—a Tableau dashboard for in-depth nutrient comparisons and a Streamlit app featuring dynamic visualizations, clustering analytics, and an AI chatbot for personalized nutrition guidance. Presented at YYC DataCon 2025.

View on GitHub

AF Risk Prediction Using ECG & EHR Data

Tech Stack: NumPy Pandas Seaborn XGBoost Streamlit LangChain DeepSeek API Chatbot

An end-to-end pipeline and interactive app for forecasting new-onset atrial fibrillation using routinely collected 12-lead ECG signals and EHR data. The XGBoost model identifies high-risk patients, while the Streamlit interface and DeepSeek chatbot provide intuitive exploration and insights. Presented at Statistical Society of Canada Annual Meeting 2025.

View on Streamlit

Credit Card Default

Tech Stack: NumPy Pandas Seaborn Scikit-Learn XGBoost

A predictive modelling project to forecast credit card defaults using demographic and financial data from 30,000 account holders—combining exploratory data analysis, visualization, and machine learning (regularized classifiers and XGBoost) for robust risk prediction.

View on GitHub

Enhancing Bank's Personal Loan Approaches

Tech Stack: NumPy Pandas Seaborn Scikit-Learn TensorFlow Keras

A sequential neural network model predicting customer likelihood to accept personal loan offers—achieving 98% accuracy, 96% precision, and 87% recall—to drive targeted marketing strategies.

View on GitHub

Sleep Apnea Classification Using EEG Spectrograms

Tech Stack: ResNet50 YOLOv5 YOLOv8

An end-to-end pipeline for four-class sleep apnea severity classification using EEG spectrograms from PSG recordings. YOLOv8 delivers fast, high-accuracy detection across healthy, mild, moderate, and severe cases, while ResNet64 and YOLOv5 benchmarks underscore YOLOv8's streamlined architecture and superior performance.

View on GitHub

Data-Driven Department Optimization

Tech Stack: NumPy Pandas Seaborn Scikit-Learn XGBoost Prophet TensorFlow Keras NLTK

A suite of data-driven solutions optimizing operations across HR, Marketing, Sales, Operations, and PR. Leveraging machine learning, deep learning, and advanced analytics to tackle departmental challenges—from predicting employee turnover with logistic regression and deep nets, to customer segmentation via K-Means and autoencoders, sales forecasting with Prophet, medical diagnostics with deep models, and PR sentiment analysis using Naive Bayes and logistic regression.

View on GitHub

GDP & Population: 2024 Olympic Medals

Tech Stack: NumPy Pandas Seaborn Scikit-Learn

An analysis of how GDP, population, and socio-economic indicators relate to medal performance at the 2024 Olympics—using correlation matrices, regression models, and clustering techniques to uncover key drivers of success.

View on GitHub

Database Chatbots

Tech Stack: Python Streamlit LangChain OpenAI API Chatbot

AI-powered agents for conversational interaction with relational data—transforming natural language prompts into SQL queries and insights for data-driven decision-making.

View on GitHub

Designing an Intelligent Agent for the Wumpus World

Tech Stack: Python NetworkX Pomegranate

An intelligent Wumpus World agent that navigates a grid of hazards and searches for gold under uncertainty. It perceives environmental cues—breezes near pits and stenches near the Wumpus—applies logical inference to deduce safe paths, and balances exploration with caution to avoid fatal encounters, demonstrating key AI concepts such as decision-making under uncertainty, logical reasoning, and informed search strategies.

View on GitHub

YT Channel Analytics

Tech Stack: Python NumPy Pandas Streamlit

A Streamlit app for analyzing YouTube channel performance—tracking subscriber growth, views, watch hours, likes, comments, and shares to uncover engagement insights.

View on Streamlit

Wine Dataset Analysis

Tech Stack: NumPy Pandas Seaborn

An exploratory analysis of red and white wine quality—examining the proportion of high-quality wines, identifying key differentiators (e.g., alcohol content, sulphur dioxide), and evaluating their impact on ratings.

View on GitHub

Unlocking Superconductor Potential: Predicting Critical Temperatures with Multiple Regression

Tech Stack: R

A multiple regression analysis to predict superconductors' critical temperature (Tc), identify the most impactful material features, and evaluate model performance.

View on GitHub

Life Expectancy EDA: Key Influencing Factors

Tech Stack: R

A focused exploratory analysis of life expectancy drivers—examining healthcare investment, vaccination coverage, and socio-economic indicators—and contrasting trends across developed and developing countries.

View on GitHub

Sales & Profit Analytics

Tech Stack: Tableau

An interactive dashboard for analyzing sales and profit trends—with customizable filters like year, category, region, and city—so businesses can track performance, identify top products and markets, and optimize strategies.

View on Tableau Public

British Airways Review Analytics

Tech Stack: Tableau

An interactive dashboard for customer reviews—track satisfaction, cabin crew, entertainment, food, and comfort; explore sentiment trends, compare seat classes and regions, and filter by date, traveller type, and region to pinpoint improvement areas.

View on Tableau Public

Articles

Browse through my collection of data science articles, where I share insights from my projects, practical tips, and updates on industry trends. You can explore more of my work on Medium.

What's in Your Food? A Data-Driven Nutrient Analysis

This article explores the nutritional content of commonly consumed foods in Canada, emphasizing the importance of balanced diets for maintaining overall health. Using data from The Canadian Nutrient File by Health Canada, the analysis covers 12 key nutrients, compares nutrient densities, and evaluates protein-to-fat ratios across food categories. The dataset, refined through thorough preprocessing, enabled detailed insights into nutrient levels, highlighting nutrient-rich and nutrient-poor foods.

View article

Analyzing 2024 Olympic Medals: The Role of GDP and Population

This article explores the relationship between a country's population size and GDP per capita and its success in the 2024 Olympic Games. By analyzing these variables, we aim to understand how they influence the total number of medals won by different countries. The article also involves clustering countries based on their Olympic performance and socioeconomic context, providing insights into patterns and trends affecting their athletic achievements. Through this analysis, we seek to uncover the extent to which economic and demographic factors contribute to Olympic success.

View article

Understanding the P-Value in Hypothesis Testing

In hypothesis testing, we start with the null hypothesis (H0), assuming no effect or difference. We then collect our observed results—the actual data from the study—and calculate the p-value, which tells us how surprising these results are if H0 is true. A small p-value (e.g., < 0.05) leads us to reject H0, considering the alternative hypothesis (H1) as a possible explanation. A large p-value suggests our results align with H0, so we fail to reject it. Importantly, the p-value is not the probability that H0 is true; instead, it's the probability of observing our results, or more extreme ones, under the assumption that H0 is correct.

View article

About

Having developed a solid technical foundation in data science through my academic studies and hands-on projects, I am eager to share my expertise. Below is a summary of my skills, education, recent certifications, work experience, and awards. I look forward to discussing how my background can support your goals.

  • Exploratory Data Analysis (EDA)

    SQL, pandas, matplotlib, seaborn, Tableau, Power BI

    Machine Learning

    scikit-learn, XGBoost, LightGBM, Prophet

    Deep Learning

    TensorFlow, Keras, PyTorch

    MLOps & Cloud

    AWS (S3, SageMaker, Lambda), FastAPl, Streamlit, Docker, Git, GitHub

  • Deep Learning Research Assistant, May 2025 - Aug 2025
    University of Calgary, Calgary, Alberta
    • Developed a deep learning model to analyze EEG signals and accurately diagnose sleep disorders such as insomnia and sleep apnea.

    Communications Officer, Sep 2020 - Aug 2024
    Health Canada, Ottawa, Ontario
    • Developed and maintained over 500 WCAG 2.1 AA-compliant web pages on Canada.ca using Adobe Experience Manager, HTML, and Web Experience Toolkit, reinforcing the government's commitment to inclusivity and accessibility for millions of citizens.
    • Created 50+ WCAG 2.1 AA-compliant PDF forms using Foxit, significantly enhancing user experience and supporting the mission of equal access to information.
    • Facilitated cross-functional team discussions, leveraging collaborative communication to efficiently align efforts and consistently meet project deadlines, achieving a 100% on-time completion rate.

    Web Developer, Mar 2019 - Sep 2019
    OPIN, A Portage CyberTech Company, Ottawa, Ontario
    • Developed and maintained Drupal websites compliant with WCAG 2.1 AA standards, including Holland Bloorview, Hydro Ottawa, and York Region District School Board, focusing on usability to promote easy access and interaction with site features, serving over 300,000 users monthly.
    • Produced efficient, well-tested, refactored, documented, maintainable, and extendable HTML, CSS, JavaScript, and PHP code, improving overall site functionality and user engagement.

    Doctoral Researcher, Nov 2013 - Mar 2016
    A. V. Dumansky Institute of Colloid and Water Chemistry, National Academy of Sciences of Ukraine
    • Collected and prepared data for analysis, ensuring accuracy, completeness, reliability, relevance, and timeliness, significantly elevating the quality of research findings and optimizing project deliverables.
    • Conducted advanced statistical analyses across five research projects, generating actionable insights that contributed to the successful development of two patented devices:
    1. Goncharuk, V.V., Taranov, V.V., & Kurliantseva, A.Y. (2017). Device for photometric determination of nitrates in aqueous solutions. [UA Patent No. 116728]. https://sis.nipo.gov.ua/en/search/detail/801092/
    2. Taranov, V.V., & Kurliantseva, A.Y. (2015). Device for determining particles. [UA Patent No. 97578]. https://sis.nipo.gov.ua/en/search/detail/885891/
    • Developed comprehensive data visualizations and analytical reports using R, ensuring that research findings were easily interpretable for grant proposals and academic publications.
    • Presented findings at over 10 international conferences and published three research papers in peer-reviewed journals, resulting in increased citations and broadening the impact of the research:
    1. Goncharuk, V.V., Kurliantseva, A.Y., Taranov, V.V., & Nifantova, L.S. (2016). Quality and quantitative assessment of the impact of magnetic field and ultrasound on water with different concentrations of deuterium. Journal of Water Chemistry and Technology, 38(3), 143–148. https://doi.org/10.3103/S1063455X16030048
    2. Goncharuk, V.V., Taranov, V.V., Kurliantseva, A.Y., & Syroeshkin, A.V. (2015). Phase transition in waters with different content of deuterium. Journal of Water Chemistry and Technology, 37(5), 219–223. https://doi.org/10.3103/S1063455X15050021
    3. Goncharuk, V.V., Kurliantseva, A.Y., & Taranov, V.V. (2014). Detection of heterogeneities of water medium. Journal of Water Chemistry and Technology, 36(5), 205–210. https://doi.org/10.3103/S1063455X14050014
    (Publications and patents are listed under the name Alina Kurliantseva.)

  • Master of Data Science and Analytics, Sep 2024 - Aug 2025
    University of Calgary, Alberta

    Certificate in Artificial Intelligence, 2024
    University of Toronto School of Continuing Studies, Ontario

    Certificate in Data Science, 2022
    University of Toronto School of Continuing Studies, Ontario

    Ontario College Diploma (Hons) in Internet Applications and Web Development, Sep 2016 - Jun 2019
    Algonquin College of Applied Arts and Technology, Ottawa, Ontario

  • 2023 Assistant Deputy Minister's Merit Award – Collaboration and Service Excellence, 2024
    Health Canada, Ottawa, Ontario
    Recognized for exceptional collaboration and service delivery within or outside the Branch.

    2023 Assistant Deputy Minister's Merit Award – Contribution to the Improvement of the Health of Canadians, 2024
    Health Canada, Ottawa, Ontario
    Acknowledged for making a significant contribution to improving the health of Canadians.

    COVID-19 Commemorative Coin, 2023
    Public Health Agency of Canada, Ottawa, Ontario
    Awarded for support and contribution to Canada's COVID-19 response efforts.

Contact

If you have any questions or would like to connect, feel free to reach out:

+1 613 700 4510

yildir.a.mdsa@gmail.com