Most library newspaper databases prohibit text mining. Exceptions are listed below:
For more detailed information on text mining library paid databases, read the guide here.
There are some tools that allow you to conduct text analysis using contemporary and historic newspapers, while prohibiting or limiting full-text download. Some of these targeted tools are listed below, with a description of their content and capabilities:
If you require a customized and contemporary corpus of news articles that cannot be acquired at any of the above sources, you may be able to manually compile a corpus of articles from free, publicly-available, online news sources. News sources that are published with a Creative Commons CC-0 BY attribution (for example, ProPublica or OpenNewswire) would be the most straightforward.
Researchers must always do their due diligence, read and adhere to the Terms of Service for the data source, and comply with local laws around copyright, data collection, and privacy. Check out the Web Scraping a Corpus page of this guide for more information.
This work is licensed under a Creative Commons Attribution NonCommercial 4.0 International License. | Details of our policy