Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.

Text and Data Mining

Information on text and data mining resources available through the Library

Constellate (formerly JSTOR Data for Research)

Constellate is a text analytics service from the parent company of JSTOR. The Library subscribes at the Pedagogy level, which allows you to create and save up to five datasets using materials from JSTOR and partner publishers. Constellate does not allow downloading for local analysis, but does provide online tools, including online Jupyter Notebooks with Python. 

Login with your CNetID and password to access online tools and save datasets

HathiTrust Research Center

The HathiTrust Research Center (HTRC) enables computational analysis of the HathiTrust corpus. It offers a variety of tools for doing analysis while complying with copyright law. You must create an account at HathiTrust Analytics to use the Research Center