Amazon Video Games Reviews (Topic Model)
Per-Document-Topic-Distribution (CSV)
Python Analysis: Random Forest Classifier (Jupyter Notebook)
This data has been gathered from the Guardian Open Platform. Please click on the link below to directly downloaded a well-curated test file for exploring the options of the MineMyText platform. Choosing a 50-topics-analysis is a good start for the first experiments with this dataset. Lemmatizing and removing numbers enhances the quality of the result. Please note that the analysis will take roughly 5-10 minutes to run through. You will receive an email once the analysis is ready.
Guardian World News (JSON)
Note: In case that the download does not work properly, please make a right click on the link and choose "Save as ..." to save the file on your harddrive.
We have also performed a completed analysis of the Guardian World News (3 Months) for demo purposes.