Whenever possible, the Yale University Library works with database vendors to include text and data mining (TDM) rights in license agreements. This means that for some databases (generally those with out-of-copyright materials), Yale researchers can access the raw text for the purposes of mining. Vendors that currently permit text and data mining on specific collections include: Adam Matthew, Gale, ProQuest, and others. For additional information on these agreements and which collections are available for mining, please visit the Text and Data Mining LibGuide.
What about material that hasn’t been digitized yet? The Digital Humanities Lab can provide tools for the creation of digital corpora for text and data mining purposes (rather than for preservation or personal archives) to researchers with current DHLab awards. For longer duration or all-purpose scanning, researchers may use the machines in the Nave of Sterling Memorial Library or in the lower level of Bass Library.
Digital humanities is a robust, rapidly developing area of research and teaching. To read the latest, stop by the DHLab and browse our collection of desk copies. The collection includes theoretical books, as well as ones that teach specific coding skills. For a complete listing of what we have, visit our LibraryThing page. If there is a book you need that you can’t get, contact the DHLab to see if we could order a copy.
Also, to help collate resources for reading up on past and current methods and inquiries in digital humanities, we’ve created a collaborative Yale Digital Humanities Lab Zotero account. The site contains links to DH blogs, projects, tools, and syllabi.