Dr. Martin's Data Mining Page



Summer Reading
Here are some suggestions for learning more about data mining and statistics:
Think Stats
Think Bayes
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics

How to Lie with Statistics
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Weapons of Math Destruction


Wine Group
Using Data Science to Understand What Makes Wine Taste Good - from freeCodeCamp
How to Use Machine Learning to Predict the Quality of Wines - from freeCodeCamp
Wine Quality Data Set - from the Machine Learning Repository at UC Irvine

Brix Article


Health Group
New England Journal of Medicine Article
LA Times Article
Kids Count Data Center


WPST Group






Resources

Data Mining with WEKA:
WEKA
FutureLearn Course

Udacity Statistics Courses:
https://www.udacity.com/course/intro-to-descriptive-statistics--ud827
https://www.udacity.com/course/intro-to-inferential-statistics--ud201

Udemy Machine Learning with Python and R
https://www.udemy.com/machinelearning/learn/v4/overview



Internships

Pathways to Science
Code 2040




Books for Interviewing
Cracking the Coding Interview
Programming Interviews Exposed: Secrets to Landing Your Next Job 3rd Edition


=======================================
NLP

Books

Natural Language Processing with Python -Analyzing Text with the Natural Language Toolkit

Steven Bird, Ewan Klein, and Edward Loper
http://www.nltk.org/book/

Natural Language Toolkit
https://www.nltk.org/