What follows is a brief description of each folder and its contents
- dynamicPages: Contains script responsible for collecting online newspaper articles from dynamic website
- Written in Python and using the Selenium package
- eventRegistry: Contains script responsible for collecting articles from the Event Registry API
- Written in Python
- facebook: Contains script responsible for collecting posts from public facebook pages
- Written in Javascript and using the Puppeteer package
- news: Contains multiple scripts responsible for collecting articles from various online newspaper websites based on provided configuration
- Written in Python and using the Scrapy framework
- reddit: Contains script responsible for collecting reddit comments from the official Reddit API
- Written in Python and using the Praw framework
- twitter: Contains script responsible for collecting tweets from the official Twitter API
- Written in Python
- youtube: Contains script responsible for collecting youtube video comments from the Youtube Data API
- Written in Python