NLP, mL & NN

Predict Political Affiliation Based on Tweets

A Binary Classification Problem
 
  1. Pull Tweets from Tweepy, the Twitter API
  2. Prepare Text for NLP; Tokenize, Remove Stop Words, Add Sentiment Polarity Scores
  3. Perform Vectorizations like Bag-of-words, TFIDF, GloVe, Word2Vec
  4. Experiment with NLP Techniques; Lemetization and POS (Part-Of-Speech) Tagging
  5. Build Machine Learning Classification Models and Neural Networks (RNN, CNN, ANN)
Sentiment

NLP Analysis and Visualizations

Multivariate Comparisons
 
  1. Pre-Process/Clean Text Data from Twitter
  2. Assign Sentiment Scores using VADAR 
  3. Perform Feature Engineering
  4. Explore Distributions and other Visualizations
  5. Build Word Clouds
Flask & Bokeh

Interactive Visualizations on Flask

 
Display Visualizations in a Dashboard
 
  1. Build an Online Dashboard using Flask, a Lightweight Pathonic Web Framework
  2. Display Data in Interactive Graphs using Bokeh
  3. Perform Vectorizations like Bag-of-words, TFIDF, GloVe, Word2Vec
  4. Enable Form Elements and HTTP POST to drive Interactive Visualizations  
BIG Data

Use PySpark to Analyze Big Data

Multivariate Comparisons
 
  1. Pre-Process/Clean Text Data from Twitter
  2. Assign Sentiment Scores using VADAR 
  3. Perform Feature Engineering
  4. Experiment with NLP Techniques; Lemetization and POS (Part-Of-Speech) Tagging
  5. Build Machine Learning Classification Models and Neural Networks (RNN, CNN, ANN)

A few of my Favorite Data Science Links

Data Science Portfolio and machine learning projects