Data Science Projects


Applied Machine Learning for Healthcare

Machine learning algorithms in Python for real world life science problems.

Projects: Autism Screening, DNA Classification, Breast Cancer Detection, Heart Disease Prediction

Uses SVM, KNN, Neural Networks. Pandas, numpy, sklearn, keras.


Shelf Finder (tentative)

Machine learning script used to detect text from an image.It then uses that data to map the location of the shelf.

Click for more details for this project.




Language Processing in Python

Click here for NLP projects

Text Mining the Bible

Text mining with the Text Mining (TM) package in R.

Blog

PDF of Project


Potential Attacker Analysis

Simple data analysis proving you can identify potential attackers

Project tools: R and Excel

Project in PDF format






Titanic Project (Kaggle)

The purpose of this project was to understand algorithms available to accomplish a classification task using the Titanic dataset. The data being analyzed deals with different classifications of people, such as gender, age, passenger class, etc. The model is then applied to predict who survived or not.

Project tools: R and Excel used | SVM, naive Bayes, kNN and Random Forest models used

Project in PDF Format

R code in PDF Format

Personal Equity Plan (Apriori Algorithm example)

This reports purpose is to use available algorithms to accomplish a classification task. The data is in the form of a csv file and contains attributes on people’s demographics and banking information on if they participate in a Personal Equity Plan (PEP) .

Project tools: R and Excel used| Apriori Algorithm

Project in PDF format

Classification by location

This assignment explored what information could be gathered about students location, using their zip codes and pining them to a map.This reports purpose is to use available algorithms to accomplish a classification task. The data is in the form of a csv file and contains attributes on people’s demographics and banking information on if they participate in a Personal Equity Plan (PEP) .

Project tools: R and Excel for creation of the maps,charts and graphs| Zipcode, ggmap packages in R

Project in PDF format


IMDB User Rating Prediction Model

This project involves classifying user rating data based on movie information, specifically the movie genre in this case.

Project tools: R and Excel used

Project and code in PDF Format





Transformations

This project explored transformations.

Project tools: R and Excel | plyr, ggplot2 and stringr in R

Project in PDF format


Distribution

This project investigates the distribution of price per carat, condtional on color

Project tools: R and Excel used

Project in PDF Format

Code in PDF Format


Mapping Earthquakes

Final Project that mapped out earthquakes that occurred over the last 50 years.

Project tools: R and Excel for creation of the maps,charts and graphs| Adobe Illustrator for cleaning up of images

Project in PDF format

PDF of R code










CUDA Test

Just a simple speed test that shows off the advantages of using the GPU.

Project tools:Uses Python, numpy

PDF of CUDA Project



Database Management Project

This project required us to come up with a business problem, solution, business rules and ERD.

Project in PDF format