Skip to content

Used two APIs to pull data from Reddit.com subforums and classify data, and extract insights.

Notifications You must be signed in to change notification settings

colinsimon/Classifying-NLP-Data

Repository files navigation

Project 3: Web APIs & NLP

Natural Language Processing (NLP) Experiments

Modeling and Evaluation

Colin Simon - 4/24/2020

Document Summary

In this project, we will gather data from two different reddit.com subforums('subreddits'). Then, we'll attempt to create a model that can classify which subreddit any given post came from.

- We will focus more on the effectiveness and analysis of the models in general than the utility of the predictor itself 
Key Files in folder:
- 01 Preliminary code.ipynb <-- unused
- 02 Pull Data - PRAW.ipynb <-- unused
- 03 Pull Data Pushshift API.ipynb <-- this data used
- 04 Explore Data.ipynb <-- primary file
- Language differences between Saving and Investing.key <-- slide deck
This Readme only lists contents of the following file:
- 04 Explore Data.ipynb <-- primary file  

Contents:

About

Used two APIs to pull data from Reddit.com subforums and classify data, and extract insights.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •