Me


Vivian Ellis

Data Scientist

I'm a Data Scientist with over three years of experience building predictive models and creating interactive reports that drive product innovation and provide valuable data insights.

Develop Icon

Machine Learning

I accurately predict customer lifetime value using advanced techniques, such as MBG/NBD models with Deep Neural Networks and XGBoost. Regression analysis allows me to gain valuable insights into customer behavior and its connection to predicted lifetime value, allowing me to develop precise strategies for maximizing customer value. By applying these advanced ML methodologies, I can efficiently pinpoint high-value customers and offer marketing strategies to boost profit.

Analyze Icon

Analyze

As a skilled data analyst, I provide strategic guidance by conducting comprehensive data analysis using statistical techniques and machine learning algorithms to uncover patterns and trends in large datasets. My adeptness at leveraging data to guide strategic decision-making and optimize business processes is a key asset of mine.

Visualize Icon

Visualize

I have developed over 20 Looker Studio dashboards featuring visualizations for experimental analysis, behavioral insights, and model metric tracking. My expertise in data visualization helps convey complex data in an understandable and actionable format, aiding stakeholders in making informed decisions based on clear, data-driven insights. Besides Looker Studio, I have experience in Streamlit, Seaborn, Plotly, and Bokeh.

About Me

Hi, I'm Vivian 👋🏻

I am Vivian, a data scientist with extensive experience developing machine learning models using Deep Neural Networks and XGBoost. My work directly impacts customer lifetime value (CLV) prediction and provides data-driven insights for making informed decisions for various campaigns, including political and marketing initiatives. Feel free to explore my portfolio to learn more about my data visualization and machine learning expertise.

Data Analyst

Eastern Washington University

Sept 2017 - July 2019

Implementation Consultant

Fast Enterprises

Aug 2019 - Mar 2021

Data Analyst

PredictWise

May 2021 - Aug 2022

Jr. Data Scientist

Ocurate

Aug 2022 - Oct 2023

Data Scientist

Ocurate

Oct 2023 - June 2024

Portfolio

Selections from my recent personal projects

Project Image

Mushroom Classification

July 2024

Random Forest Overfitting Feature Engineering

What mushroom characteristics cause certain death and which are most edible.

Project Image

Predicting Housing Prices

June 2024

XGBoost Ridge Regression Ensemble Models Feature Engineering

A Kaggle competition to predict the final price of a home.

Project Image

A/B Testing

June 2024

Hypothesis Testing T-test Two Way ANOVA

This project is an A/B test conducted on an e-commerce website to determine the impact of a website variation on profit and customer satisfaction.

Project Image

Significance of a Fireplace in the Home

June 2024

Hypothesis Testing Shapiro-Wilk Kruskal-Wallis Dunn test

This projects determines the statistical significance between the cost of a house and how many fireplaces are present in the home.

Project Image

Urbanism Score

May 2024

EDA Data Cleaning Statistical Analysis Data Visualization Streamlit

Find US cities by focusing on your preferences for public amenities. Specify which amenities matter most to you, and the project will generate a personalized ranking of cities.

Project Image

Cohort Analysis

Dec 2023

Looker Studio

A monthly interval of historical and future revenue of customers grouped by first-order month.

Project Image

Video Game Sales

Oct 2023

Bokeh

What's the highest player rating of all time? And could it match the title of the best-selling game?

Project Image

Customer Insights

May 2023

Looker Studio Elastic Net

An acquisition dashboard that illuminates profitable marketing campaigns and the importance of customer attributes.

Project Image

At-Risk Churn Customers

Jan 2023

Looker Studio Z-test Power Analysis

An experiment conducted to evaluate the effectiveness of discount strategies on reducing customer churn and their impact on customer lifetime value (CLV)

Project Image

Podcast AD Hoc Retrieval

June 2020

Bayesian Network Vector Space Model Information Retrieval

A recommendation system to enhance the search functionality within Spotify and enable users to find a jump-in point for relevant podcast episodes.

Project Image

MySQL Auto-Complete

Dec 2017

Databases

A database design tool that assists users in making context aware suggestions.