Data wrangling
Data wrangling
Primary investigator (PI)
Anna Shechtman, Graduate student, English Literature and Film & Media Studies
Job description
The PI is looking to hire a computer programmer to work part time (10-20 hours a week) on a project that will last from May-August at the rate of $20/hour. This work will be used for a digital humanities project that is focused on tracking the evolution of language and ideas in a large corpus of American magazines, 1950-2000. Along with contributing to cutting-edge research in the fields of digital humanities and cultural analytics, the selected applicant will also have a chance to learn more about machine learning and natural language processing. They will also have a chance to build up CS project credentials; the code produced will be potentially used by a large number of later users.
Tasks
The selected applicant will be asked to:
- Write code in a virtual machine environment powered by Ubuntu
- Parse and wrangle large amounts of textual data (books), turning what is rather unstructured data into highly structured data
- Run “off the shelf” algorithms on this data to produce output
Qualifications
An applicant will ideally be:
- a highly proficient programmer in a commonly used language; the PI’s preference is for coding to be done in Python
- comfortable working in Ubuntu
Familiarity with text mining and machine learning is not required but is a plus; the selected applicant will not be writing algorithms from scratch.
Job details
Payrate: $20/hour
Contact: Anna Shechtman
This project is funded with a Digital Humanities Lab Seed Grant.
Fall 2023 DH Classes
Looking for classes to take this fall? Here are some that will help you explore lyric poetry with digital tools, use data visualizations to address environmental problems, study the intersection...
Learn More »Spring 2023 DH Classes
Looking for classes to take this spring? Yale will be offering more DH-related courses than ever. Here are some options that will help you learn Python and GIS, discover new...
Learn More »Welcoming Gavi Levy Haskell, Our New Developer
The Yale Digital Humanities Lab (DHLab) is happy to announce that Gavi Levy Haskell has joined us as our new Digital Humanities Developer. Gavi has worked on digital humanities projects...
Learn More »