Skip to content
View nellocoder's full-sized avatar

Block or report nellocoder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nellocoder/README.md

Profile views

πŸ‘‹ Kishoyian Duncan

πŸ”¬ Research Analyst Β· πŸ“Š Data Scientist Β· πŸ“ˆ Statistician

Research analyst and data scientist with expertise in statistical modelling, ML pipelines, and Monitoring, Evaluation, and Learning (MEL) frameworks for evidence-based decision making.

Translating complex data into actionable insights for policy, public health, and operational decision-making.



πŸš€ Featured Projects

Regression pipeline predicting airline delay minutes from flight volume and carrier data.

Metric Result
RΒ² Score ~0.95 on monthly aggregates
Key Insight Carrier identity explains ~30% of delay variance beyond traffic volume
Stack Python Β· Pandas Β· Scikit-learn Β· Seaborn Β· Jupyter

End-to-end ML workflow for diabetes risk prediction using the Pima Indians dataset.

Metric Result
Accuracy ~75% with Logistic Regression
Segmentation K-Means (k=3) identifies distinct metabolic-risk patient profiles
Stack Python Β· Pandas Β· Scikit-learn Β· Matplotlib

Compartmental ODE model evaluating educational campaign impact on alcohol-use population dynamics.

Metric Result
Model Type SIR-style compartmental ODE
Output Equilibrium behaviour & sensitivity analysis via phase-plane visualisation
Stack Python Β· SciPy Β· NumPy Β· Matplotlib

Statistical and network-based analysis of crime patterns for intelligence-led decision making.

Metric Result
Method Network centrality & exploratory spatial analysis
Insight Identifies core distribution nodes and brokerage roles in urban networks
Stack Python Β· NetworkX Β· Pandas Β· Matplotlib

πŸ“Œ Portfolio Standards

All featured repositories are built to these standards:

  • βœ… Reproducible β€” Every repo includes requirements.txt and step-by-step execution instructions
  • βœ… Documented β€” Methodologies, assumptions, and limitations are explicitly stated
  • βœ… Licensed β€” Open-source licenses for reuse and collaboration
  • βœ… Decision-focused β€” Outputs designed for policy-makers and operational teams, not just technical peers

🧠 Core Areas

  • πŸ“Š Statistical Modelling β€” Inference, regression, and experimental design
  • πŸ€– Machine Learning β€” End-to-end pipelines for regression, classification, and clustering
  • πŸ₯ Public Health Research β€” Epidemiological analysis and intervention modelling
  • πŸ—ΊοΈ Crime & Geospatial Analysis β€” Pattern detection, network analysis, and spatial statistics
  • πŸ“‹ MEL Systems β€” Monitoring, Evaluation, and Learning framework design and implementation
  • πŸ”¬ Mathematical Modelling β€” Compartmental ODEs for assessing public health intervention impact

πŸ› οΈ Tech Stack

Category Tools
Languages Python Β· R Β· SQL Β· TypeScript
Data & ML Pandas Β· Scikit-learn Β· Statsmodels Β· NumPy
Visualization Matplotlib Β· Seaborn Β· Power BI Β· ggplot2
Data Collection KoboToolbox Β· REDCap Β· ODK Β· SPSS Β· STATA
Workflows Git Β· Jupyter Β· RMarkdown Β· GitHub Actions

πŸ“ About

Based in Nairobi, Kenya πŸ‡°πŸ‡ͺ

Work focuses on applying statistical and machine learning methods to public health, crime analysis, and decision-support systems.

Principles: Reproducibility Β· Interpretability Β· Policy-relevant insights

Contributions and replications are welcome. All flagship repositories include open-source licenses and end-to-end execution instructions.


πŸ‘¨β€πŸ« Teaching & Academic Contributions

  • πŸ“š Developed instructional materials for undergraduate mathematics (Algebra, Calculus)
  • πŸ“ Produced LaTeX-based academic and technical documents
  • πŸŽ“ Supported training in data collection tools (KoboToolbox, ODK) and reproducible analysis workflows

πŸ” Currently Working On

  • πŸ—ΊοΈ Crime pattern analysis using geospatial and statistical methods
  • πŸ” Strengthening reproducible research workflows with CI/CD for Jupyter notebooks
  • πŸ“Š Building MEL dashboards for public health programme evaluation

🀝 Open To

  • Research collaborations in public health, epidemiology, and social policy
  • MEL system design, implementation, and third-party evaluation
  • Data science and analytics consultancy for NGOs and government agencies
  • Academic partnerships, peer review, and teaching opportunities

πŸ“« Let's Connect

Open to research collaborations, MEL consultancy, and data-driven projects.

Pinned Loading

  1. diabetes-prediction-analysis diabetes-prediction-analysis Public

    End-to-end machine learning pipeline for diabetes risk prediction with EDA, preprocessing, and classification

    Jupyter Notebook 1

  2. Mathematical-Modeling-of-Alcoholism Mathematical-Modeling-of-Alcoholism Public

    Compartmental ODE model analyzing the impact of educational interventions on substance use

    Jupyter Notebook 1

  3. movie-sentiment-analyzer movie-sentiment-analyzer Public

    It uses sentimental analysis to analyze and predict reviews on movies.

    Jupyter Notebook 1

  4. flight-delay-prediction flight-delay-prediction Public

    Regression-based machine learning model for predicting airline delays using Python and scikit-learn

    Jupyter Notebook 3

  5. Crime_analysis Crime_analysis Public

    Exploratory and statistical analysis of crime data patterns with potential geospatial insights

    Jupyter Notebook