Text Mining
Are you looking for patterns in large sets of text or researching ways to make sense of textual data using sentiment analysis, topic modeling, or more? Whether you’re new to text mining or stuck with text mining questions, we’re here to help!
What support is available for Text Mining?
We can help you with:
- Starting text mining projects
- Web Scraping, Information Retrieval, Text Collection Methods (API)
- Machine Learning for classification & Clustering
- Natural Language Processing
- Python, R, SQL
Resources
Tools
- Web Scraping:
- Programming based - Beautiful Soup, Scrapy, Selenium
- Commercial Software (Free/Paid) - Parse Hub, Dexi.io, Scraping-bot.io
- Text Cleaning
- TextClean - Collection of open-source tools for cleaning & normalizing text documents in R
- OpenRefine - Open-source data cleansing tool by Google
- Trifacta Wrangler - Free tool dor data preparation
- Text Analytics & Visualization:
- Rosette Text Analytics - Suite of interoperable components for text analytics
- WordStat - Advanced Content Analysis
- Apache OpenNLP - Document Categorizer and more
- Natural Language Toolkit - Industrial strength NLP libraries in Python