Step-by-step guide on how to get started with your text mining project along with examples of past text mining projects from UW researchers and students.
Tools for Web Scraping
Programming based
Python - Scrapy, BeautifulSoup
Selenium
R - rvest, RCrawler
Software
Parse Hub
Dexi.io
Scraping-bot.io
Tools for Text Cleaning
TextClean - Collection of open-source tools for cleaning & normalizing text documents in R
OpenRefine - Open-Source data cleansing tool by Google