I am a self-motivated individual who has developed a passion for Statistics and Data Analysis across my tertiary studies. I am currently working as a Lead Analyst where my main tasks involve analysing data solutions, developing technical reports and mentoring and supporting team members.
Open Polytechnic | Te PÅ«kenga
Lead Analyst - Strategic Insights Team
01/2021 - present
Analyst - Strategic Insights Team
01/2021 - 11/2021
Massey University
Research Assistant
08/2020 - 12/2020
Harmonic Analytics
Shiny Developer Intern
03/2020 - 08/2020
Victoria University of Wellington
Master of Applied Statistics
2019 - 2020
Otago University
Bachelor of Science in Statistics
2015 - 2018
Communication & Writing skills
Creativity & Critical observation
Teamwork skills
Work well under pressure
R Studio, R Shiny, R Markdown
Python, Jupyter Notebook
HTML and CSS
Excel, Power BI, SSMS
Research study documenting the cluster analysis of ordinal and binary lingustics data (obtained from the Ewave data source). The model-based clustering approach using finite mixtures was proposed and the general ordinal models were described in the context of one-mode and two-mode hard clustering.
Shiny application allowing the user to perform interactive analysis on Kaggle suicide statistics accumulated at world level. I built this application when I first learned how to use Shiny for a project assignment at university in 2019.
Shiny dashboard presenting time series analysis of randomly generated data. This dashboard was reproduced using the structure of the dashboard I had previously created for a client during my internship at Harmonic Analytics.
In this report summarised a variety of methods used for fitting time series data. The aim is to investigate and compare different models' perfomances and predictive ability based on accuracy metrics and prediction results.
Applying multinomial logistic regression and Kaplan-Meier survival methods to health data. The purpose of this study is to examine the long term effect of Carpentier-Edwards Ring or Band annuloplasty in patients under the age of 18 years old. ICMR is a recently released medical term short for Isolated Congenital Mitral Regurgitation.
I wrote this report in 2019 for a university assignment. The research question was raised to investigate the relationship between the amount of fish intake and mercury levels found in fisherman hair living in Doha fishing village, Kuwait. I recently did the analysis again and rewrote the findings in order to improve my statistical writing skills.
This follow-up report explores three different statistical methodologies, contingency table analysis, Bayesian approach to multiple multinomial logistic regression and Random Forest classification algorithm and their application to health data. The results were reported to conclude the relationship between the side effect of annuloplasty Band and the clinical outcome of Mitral Regurgitation in patients with mitral valve diseases.
This short notebook describes the classification using Random Forest in a multi-class setting, where one class is fitted against all the other classes for each classifier. Confusion matrix and other performance metrics were reported to compare amongst models fitted for imbalanced and balanced health data.
Data visualisation of the number of positive COVID-19 cases (fictious) by ethnicity at Statistical Area 2018 geographical level, using highcharter
and flexdashboard
packages. Please make sure the dashboard is viewed in full screen for a better experience.
Application made in Python Dash, summarising data description and sklearn
machine leanring (Regression and Classficiation) algorithms. The data used is available at this github source.