Skip to Main Content

Text and Data Mining

Information on text and data mining resources available through the Library

Constellate (formerly JSTOR Data for Research)

Constellate is a text analytics service from the parent company of JSTOR. The Library subscribes at the Pedagogy level, which allows you to create and save up to five datasets using materials from JSTOR and partner publishers. Constellate does not allow downloading for local analysis, but does provide online tools, including online Jupyter Notebooks with Python. 

Create a JSTOR account to access online tools and save datasets. You can also use the "login with Google" option to use your CNetID.

HathiTrust Research Center

The HathiTrust Research Center (HTRC) enables computational analysis of the HathiTrust corpus. It offers a variety of tools for doing analysis while complying with copyright law. You must create an account at HathiTrust Analytics to use the Research Center, but you can now login with your CNetID. You will receive an email verification to finish account creation.