Earlier this month, we presented the results of the pilot phase of VerbCorner -- our citizen science project probing the nature of linguistic structure -- at a scientific conference (the Workshop on Events in Language and Cognition). You can see the poster describing the work here.

For those who don't know or don't remember, in VerbCorner, we're trying to work out the grammar rules that apply to verbs. Why do you say Agnes looked at the wall but not Agnes saw at the wall? Why do you say Bart filled the glass with water but not Bart poured the glass with water? Many -- but not all -- linguists believe that grammatical idiosyncrasies are explained by the meanings of the verbs, but evidence is sketchy. Volunteers have been visiting our website to help analyze the meanings of verbs so we can find out.

High-Quality Analyses by Volunteers

Our initial work -- the pilot for the pilot, if you will -- suggested that we could get high-quality analyses from volunteers. But that was based on a very small sample. As of late Feb, over 10,000 volunteers had contributed over 525,000 analyses. In general, the agreement between different volunteers was pretty high -- which is a good sign. Just as importantly, we had a smaller set of 'test' items, for which we knew what professional linguists would say. When we combine the analyses of different volunteers for the same sentence in order to get a 'final answer', the results match the analyses of professional linguists very well. This shows that we can trust these results.

Where Quantity Becomes Quality

Just as importantly, we were able to analyze a lot of sentences. In the VerbCorner project, we are trying to determine which sentences have which of a very specific set of aspects of meaning. One aspect is whether the sentence involves something changing physical form (example: Agnes broke the vase as opposed to Agnes touched the vase). Another aspect is whether the sentence involves anything applying physical force to anything else (ex: Agnes pushed Bart as opposed to Agnes looked at Bart).

For purposes of bookkeeping, let's call one aspect of meaning for one sentence an 'item.' After combining across different volunteers, the results were clear enough to definitively code 31,429 items. This makes VerbCorner the largest study of it's kind by far. (A typical study might only look at a few hundred items.)

This quantity makes a big difference. Given how small studies usually are, they can only look at one tiny corner of the language. The problem is that that corner might not be representative. Imagine studying what Americans are like by only surveying people in Brooklyn. This tends to lead to disagreements between different studies; one linguist studies "Brooklyn" and another studies "Omaha", and they come to very different conclusions! Unfortunately, language is so complex and so vast, one person can only analyze one corner. This is why we are recruiting large numbers of volunteers to help!

The results

One major question we had was how much the rules of verb argument structure (that is, the kinds of grammatical rules described above) depend on meaning. Some linguists think they depend entirely on meaning: If you know the meaning of a verb, you know what its grammar will be like. Others think meaning has very little role to play. Most linguists are probably somewhere in the middle.

The results suggest that the first group is right: These rules depend almost entirely on meaning. Or maybe even entirely; it's so close it is hard to tell.

The reason I say "suggest," however, is that while we have the biggest study of its kind, it still only covers about 1% of English. So we've gone from studying Brooklyn to studying all of NYC. It's an improvement, but not yet enough.

This is why I called this first phase a "pilot". We wanted to see if we could get high-quality, clearly-interpretable results from working with volunteers. Many researchers thought this would be impossible. After all, linguists have to go through a lot of schooling to learn how to analyze sentences. But a key finding of the Citizen Science movement is that there are a lot of smart enthusiasts out there who may not be professionals but can very much contribute to science.

The next phase

We have set a goal of reaching 50,000 completed items by July 1st. That will require upping our game and increasing the rate at which we're analyzing items by almost 4x. But the beauty of Citizen Science is that this does not really require that much work on anyone's part. If 3,000 volunteers each spend about one hour contributing to the project, we'll more than hit that goal. So please help out, and please tell your friends. You can contribute here.

It is exciting times at GamesWithWords.org as we settle into our new digs at Boston College. The brick-and-mortar lab now has a name: the Language Learning Laboratory @ Boston College (L3@BC). As we build out the group, expect to see a lot more activity around the site, including new features, projects, etc. Speaking of, we are hiring a research assistant. See the posting below:

The brand-new Language Learning Laboratory at Boston College is recruiting a full-time research assistant. The research assistant will work closely with the PI (Dr. Joshua Hartshorne) and graduate students in the lab. Primary responsibilities will include coordinating the lab's crowdsourcing and citizen science activities. For example, over 10,000 volunteers have contributed over 500,000 linguistic judgments as part of the laboratory's VerbCorner project. The research assistant will help coordinate these volunteers for this and other similar projects. S/he will also manage undergraduate researchers working on these projects and engage in public outreach activities such as blogging or creating educational materials. S/he will assist in data-analysis and have the opportunity to attend and present at major scientific conferences.

Candidates should have an undergraduate degree in psychology, neuroscience, linguistics, computer science, or a related field (or a good explanation as to why they are qualified anyway). Candidates should also have familiarity with one or more computer programming languages (e.g., Python, R, Matlab, C++) or an exceptional quantitative background (i.e., degree in mathematics). Experience with any of the following would be an added advantage: laboratory research, data analysis, management/supervision, science outreach, journalism, machine learning.

Review of applications will begin immediately. Start date is flexible but not later than 9/1/2016. Members of groups underrepresented in science are particularly encouraged to apply. International candidates are welcomed but must have an MA or equivalent.

To apply, send to l3atbc@gmail.com: a CV and a one-page essay explaining why you are interested in the position and how it fits with your past experiences and future goals. Please also arrange for letters of recommendation from 2-3 references to be sent to l3atbc@gmail.com.

Please be sure that your CV lists your degree(s), major/minor, GPA, any relevant classes (psychology, linguistics, computer science, etc.), programming languages with which you have experience (and the nature of that experience), and any other experiences/qualifications you feel are particularly relevant.

About

The focus of lab and blog is language -- what it is, how we understand it, and what we can do with it. At the blog, we discuss research, findings and controversies. At the lab, we try to create new research, findings and controversies.