Machine Reading the Archive

Machine Reading the Archive 2019 - end of programme workshop​ This public workshop will mark the end of the 2019 programme of Machine Reading the Archive, a digital methods development programme organised by Cambridge Digital Humanities with the support of the Isaac Newton Trust and the Researcher Development Fund. It will...

Digital Mapping for Historians This intensive workshop will provide an overview of a range of applications of digital mapping in historical research projects and introduce GIS tools and software. No prior knowledge is required. Participants will need to bring a laptop. Please register here .

Archival Photography: An Introduction This session focusses on providing photography skills for those undertaking archival research. Dr Oliver Dunn has experience spanning a decade filming documents for major academic research projects. He will go over practical approaches to finding and ordering materials in the archive...

Sources to Data (Workshop) This workshop will examine database creation from historical documents. Extracting data from these can be hard work and involves quite unusual skill combinations. You may need to digitise and transcribe from primary sources, and then design and build a database from scratch with the information...

Introduction to Text-Mining with Python 2 This session will introduce topic modelling. Topic modelling is looking for clusters of words that summarise the meaning of documents. We will talk about how to choose what sort of text mining you might want for your research. Some knowledge of Python is required, as gained from '...

Introduction to Text-Mining with Python 1 This session will introduce basic methods for reading and processing text files in Python. We will walk through an example that reads in a large text corpus, splits it into tokens (words) and sentences, removes unwanted words (stopwords), counts the words (frequency analysis), and...

Optical Character Recognition is a term used to describe techniques for converting images containing printed or handwritten text into a format that can be searched and analysed computationally. This workshop will introduce several such tools along with some practical techniques for using them, and will also highlight OCR...

This workshop will examine strategies for transforming a variety of sources into structured digital data, ranging from crumbling manuscripts to printed documents and books. Tutor: Oliver Dunn Level: Introductory / Foundation For more information and to book, click here This course is open to Cambridge University research...