The Library subscribes to the Gale Digital Scholar Lab, an interactive interface that supports text mining for the majority of the Gale Primary Sources acquired by the library, but limits any dataset to 10,000 documents.
Researchers may request text mining access to content from most Gale Primary Sources. These are delivered to the Library as XML and PDF files. The Library has acquired the collections listed below. Contact us about others.
Note: the Financial Times will not license their historical archive for text mining at this time
The Library has secured text mining rights from the publisher, Adam Matthew The Library has purchased multiple archives from Adam Matthew, including the collections below.
The Text Creation Partnership is a joint effort to transcribe historical texts from three major databases
The Library subscribes to the full versions of these databases which provide page images but not easy access to machine readable text. The TCP has made that available for researchers to download for selected titles from two of these databases.