NLP, mL & NN
Predict Political Affiliation Based on Tweets
A Binary Classification Problem
- Pull Tweets from Tweepy, the Twitter API
- Prepare Text for NLP; Tokenize, Remove Stop Words, Add Sentiment Polarity Scores
- Perform Vectorizations like Bag-of-words, TFIDF, GloVe, Word2Vec
- Experiment with NLP Techniques; Lemetization and POS (Part-Of-Speech) Tagging
- Build Machine Learning Classification Models and Neural Networks (RNN, CNN, ANN)
Sentiment
NLP Analysis and Visualizations
Multivariate Comparisons
- Pre-Process/Clean Text Data from Twitter
- Assign Sentiment Scores using VADAR
- Perform Feature Engineering
- Explore Distributions and other Visualizations
- Build Word Clouds
Flask & Bokeh
Interactive Visualizations on Flask
Display Visualizations in a Dashboard
- Build an Online Dashboard using Flask, a Lightweight Pathonic Web Framework
- Display Data in Interactive Graphs using Bokeh
- Perform Vectorizations like Bag-of-words, TFIDF, GloVe, Word2Vec
- Enable Form Elements and HTTP POST to drive Interactive Visualizations
BIG Data
Use PySpark to Analyze Big Data
Multivariate Comparisons
- Pre-Process/Clean Text Data from Twitter
- Assign Sentiment Scores using VADAR
- Perform Feature Engineering
- Experiment with NLP Techniques; Lemetization and POS (Part-Of-Speech) Tagging
- Build Machine Learning Classification Models and Neural Networks (RNN, CNN, ANN)
A few of my Favorite Data Science Links






Previous
Next