Historical research now crucially involves the acquisition and use of digital sources. In History 9877A, students learn to find, harvest, manage, excerpt, cluster and analyze digital materials throughout the research process, from initial exploratory forays through the production of an electronic article or monograph which is ready to submit for publication.
In this course you will learn to apply techniques that are currently used by fewer than one percent of working historians. Computation won’t magically do your research for you, but it will make you much more efficient. You can focus on close reading, interpretation and writing, and use machines to help you find, summarize, organize and visualize sources.
Prerequisites, Workload, Blogging and Evaluation
There are no prerequisites for the course other than a willingness to learn new things and the perseverance to keep working when you’re confused or when you realize that you could spend a lifetime learning about the topics and technologies that we will cover in class, and still not master them all. Students will come into the course with very different levels of experience and expertise. Some, probably most, will be familiar only with the rudiments of computer and internet use. A few may already be skilled programmers.
This course also requires that you spend at least a little bit of time each day (say 20-30 minutes) practicing your new skills. It’s a lot like learning a new language, learning to play a musical instrument or going to the gym. It is going to be hard at first, but be patient with yourself and ask a lot of questions. With daily practice, you will soon find ways to do your research and coursework faster and more efficiently. If you can’t commit to regular practice, however, you should probably not take this course. The techniques that you learn in this class build cumulatively week-by-week. In addition to regular practice, it is essential that you attend every meeting of the class and do the readings carefully.
Every student in the class will have an academic blog and will be required to make weekly posts to it. These entries do not have to be long (300-500 words per week is ample). The use of blogging is to encourage you to engage in ‘reflective practice,’ that is, to force you to think about your learning and research as you are doing it. It also provides me with feedback for how the course is going. You can use each week’s blog entry to talk about what you learned, things that were clear or not, things you would like to know how to do, and so on.
If you do not already have a professional blog you will need one. Before the first class you should go to either WordPress or Blogger (not both) and create an account and a blog. If possible, create the blog under your own name; if not, choose something professional sounding. Post an introductory message about yourself and then send me the URL of your blog so that I can add you to the course blogroll for History 9877A.
You will be graded on your participation in class (20%) and on your reflective blogging (80%). There will be no midterm or final examinations, and no final paper.
Text and Requirements
There is one required text for this course, which is free. Download the Mathematica notebook version to use with this course.
Turkel, William J. Digital Research Methods with Mathematica. 2015.
To get the most out of this class, you will need a Windows, Mac or Linux laptop, which you should bring to every class.
You should purchase a desktop license for Wolfram Research’s Mathematica software. (Don’t let the name scare you, you won’t need any particular training in mathematics to do well in the class).
You can purchase the license for the semester (US $45), for the year (US $70) or permanently (US $140). If you are unwilling or unable to purchase the software, please do not take this course.
Students and Auditors
- Amenta, Annette
- Bozzo, Nina
- Clarke, Sam
- Comstock, John
- Craig, Vanessa
- Horkoff, Mitchell
- Johnson, Tristan
- Joubert, Carla
- Keelan, Geoff
- Komarnitsky, Nick
- Makarow, Philippe
- McKee, Taylor
- Medeiros, Vanessa
- Muhammedi, Shez
- Paterson, Joanne
- Rayment, Erik
- Sawatsky, Aaron
- Start, Mary Beth
- Sviezhentsev, Maksym
- Usher, Emily
- Weller, Alexandra
- 01. Sep 16.
Slides. 1A: Introduction to Digital History, 1B: Word Frequency.
Readings. Introduction, What You Need to Know to Get Started, Begin Chapter 01: Analyzing Text.
- 02. Sep 23.
Slides. 2A: Text Search, 2B: N-gram Frequency.
Readings. Finish Chapter 01: Analyzing Text.
- 03. Sep 30.
Slides. 3A: KWIC, 3B: Pattern Matching.
Readings. Begin Chapter 02: Pattern Matching.
- 04. Oct 07.
Slides. 4A: Capitalized Phrases, 4B: Collocations.
Readings. Finish Chapter 02: Pattern Matching.
- 05. Oct 14.
Slides. 5A: Associations, 5B: Named Entities.
Readings. Chapter 03: Who and What.
- 06. Oct 21.
Slides. 6A: Timelines, 6B: Maps.
Readings. Chapter 04: When and Where.
- 07. Oct 28.
Slides. 7A: Batch Downloading, 7B: Corpus Search.
Readings. Begin Chapter 05: Information Retrieval.
- 08. Nov 04.
Slides. 8A: Document Vectors, 8B: TF-IDF.
Readings. Finish Chapter 05: Information Retrieval.
- 09. Nov 11.
Slides. 9A: Markup Languages, 9B: Scraping.
Readings. Begin Chapter 06: Internet Sources.
- 10. Nov 18.
Slides. 10A: Page images and OCR, 10B: Image Processing.
Readings. Finish Chapter 06: Internet Sources.
- 11. Nov 25.
Slides. 11A: Identifying and Classifying Images, 11B: Photogrammetry and Georectification.
Readings: Chapter 07: Image Processing (only read sections with explanatory text).
- 12. Dec 02.
Slides. 12A: Application Program Interfaces (APIs), 12B: Entity Network Spidering
- 13. Dec 09.
Slides. 13A: JSTOR Data for Research.