The dataset I’ve chosen to work with is the Albany Muster Rolls 8th Militia dataset. It is a census of recruits for the Revolutionary war in the early 1760s, and is a textual dataset. There are a total of 944 men listed in this dataset, with 13 categories filled out. These categories include last name, first name, enlist date, age, where the recruit was born, what their previous occupation was, whose command they were under, their physical attributes, and what volume and page their information could be found on in the physical text.