Skip to Main Content

Text and Data Mining

Information on text and data mining resources available through the Library

What is Text Mining

Text mining is the process of discovering previously unknown information from unstructured text. It generally consists of

  • Information retrieval (gathering the relevant documents)
  • Information extraction (unearthing information of interest from these documents)
  • Data mining (discovering new associations among the extracted pieces of information)
    Source
    Anandarajan M., Hill C., Nolan T. (2019) Introduction to Text Analytics. In: Practical Text Analytics. Advances in Analytics and Data Science, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-95663-3_1

Text Mining Using Library Resources

Most Library resources do not allow automated or systematic downloading of articles or creation of corpuses for textual analysis. Using scripts or software to download content can cause loss of access for the entire campus as well as suspension of individual accounts. Please contact the Library before beginning a project so we can advise you of your possible options.