Hello, I’m Nathan! This portfolio is intended to show my growing knowledge and skillset as it relates to data analysis and data science. I’ve always had a strong passion for analytics, and upon receving my degree in Health Administration I decided to pursue this passion further. Over the past year I have made countless strides in strenghting my abilites as a detail-oriented thinker and working with data tools. I have gained vital experience working with Excel, SQL, Tableau, and others. My future goals are to, hopefully, land a position working with data on a day-to-day basis, and to eventually enter a Master’s program with a focus on Data Science.
Below, you will be able to find some projects I have worked on in the past as well as some other skills and certificates I have obtained. My intention is to continuosuly update this portfolio, so I hope you’ll be able to follow my journey!
In this section, you will find some of the more in-depth projects I have completed, with a brief description of each and code attached.
Description: This Project’s focus was to analyze a datset of Yelp reviews of various businesses from 2005 to 2022. The goal was to help a fictional start-up coffee company in providing insights for location, market demographics, and other various business advice. This dataset included 5 distinct tables with information on businesses, reviews, Yelp users, tips users could leave, and times users checked into the business. In order to accompolish the intended goals, the data was loaded into Google Cloud, cleaned & processed using Google BigQuery, and thustly analyzed. Exploratory Data Analysis and correlation analysis were performed as well as creating a predictive model based on text sentiment analysis on keywords left on reviews. All visualizations were created using Tableau.
Skills: Data Analysis, ETL, Data Modeling, Data Cleaning, Relational Database Mgmt, Cloud Storage
Technology: Google Cloud, SQL (Google Big Query), Microsoft Excel, Tableau
Description: The aim of this project was to provide insights on global financial metrics for various products and segments. The original data was taken from an Excel file containing 700 rows and 16 columns. This data included various financial metrics (profit, COGS, sales, etc.) for products in various countries and market segments. One area I found in which profit could potentiallly be increased was in altering the discounts offered for the products. Per the line and column chart in the center of the dashboard, products classifed in the high and medium discounts bands produced significantly lower profit then low and none. On top of this, there were over double the amount of products produced in these low profit bands. Lowering discounts on these products, could provide a key opportunity in increasing profitability.
Skills: Data Cleaning, Dashboard Creation, Data Visualization, Data Modeling & Transformation, Financial Analysis
Technology: Power BI
Findings/Results: Profit was stagnant for most of the year, with a sharp increase in the winter months, specifically Ocotober. Gross Sales by month followed this trend as well. Predictably, Goverment was the domineering segment, contributing to 65% of the total profit. One area I found in which profit could potentiallly be increased was in altering the discounts offered for the products. Per the line and column chart in the center of the dashboard, products classifed in the high and medium discounts bands produced significantly lower profit then low and none. On top of this, there were over double the amount of products produced in these low profit bands. Lowering discounts on these products, could provide a key opportunity in increasing profitability.
This section comprises some additional, smaller projects that I have worked on.
### A/B Testing in SQL Dataset: ‘dsv1069 on Mode Analytics, tables detailing users, events, orders, etc. for an online retailer
Goal: Determine if making an item-level change on website configuration will affect the amount of items that are viewed and ultimately ordered
Skills: Joins, Subqueries, AB Testing, Stastistics, Hypothesis Testing
### Insurance Claims Analysis
dataset To open the excel file click “open raw” and then download
Dataset: Free public dataset downloaded from Kaggle.com. Contains demographic data on 1,339 patients and charges billed to their respective insurances.
Goal: Determine if any causual factors (BMI, age, smoker status, etc.) can explain dollar amount charged to insurance
Skills: Microsoft Excel, Data Cleaning, Conditional Formatting, Linear Regression, Statistics, Visualizations, Dashboards