This summer, I’ve been doing qualitative research on ways that Reddit moderators develop common interests as they face the company, as they face their subscribers, and as they relate to other moderators. Just in the top 20,000 subreddits by subscribers, Reddit has 50,790 moderators. This July, moderators of 2,278 subreddits joined a “blackout,” demanding better communication and improved moderator tools. The blackout is one moment in the wider research I’m doing, a moment where tensions and common cause rose to the surface. Blacked-out subreddits constituted 60% of the top 10 subreddits, 29% of the top 100, and 5% of the top 20,000 subreddits, representing a total of 134.8 million combined subscriptions.

Since I can only get so far by reading Reddit threads, I’m now interviewing Reddit moderators to learn more about your experience as a moderator and your experience of the blackout. If you are interested to talk, please message me on Reddit at /u/natematias.

Work In Progress: Charting the Reddit Blackout of July 2015

Since I’m also a software engineer and quantitative researcher, I’ve been complementing my qualitative work with data analysis on what I was able to collect from the public API, combined with /u/GoldTesting’s dataset of blackout participation. Mostly, I’ve used that data to decide where to look and who to reach out to. The conversations I found led me to think about several hypotheses I could also test statistically:

When moderators discussed the blackout with their subscribers, many debated the idea of “solidarity,” wondering if they were too small to have common cause with larger subs or if they were too small to make a difference. Others expressed strong opinions that joining the blackout meant standing with other moderators or standing for Reddit users as a whole.

The conversations I found led me to think about several hypotheses I could test statistically:

H1: Larger subreddits were more likely to join the blackout, maybe because their moderators were part of ModTalk, where much of the blackout was discussed, or because they felt a blackout would make a difference, or because they felt common cause with other mods of large subs.

H2: Subreddits with more moderators were more likely to join the blackout, perhaps mods in these subs would have greater solidarity with others.

H3: Subreddits with mods who also moderate other subreddits that participated in the blackout were more likely to join the blackout

To illustrate the data used for my statistical tests, here are two network graphs of shared moderators between subreddits. The first graph includes the top 20,000 subreddits in terms of subscribers (as of mid-June 2015). The graph one filters only subreddits with more than 10,128 subscribers. In the network graphs, subreddits that did not black out are tinted blue, while yellow-tinted subreddits joined the blackout.

The charts are laid out using the ForceAtlas2 layout on Gephi, which has separated out some of the more prominent subreddit networks, including the ImaginaryNetwork, the “SFW Porn” Network, and toward the center, the ShitRedditSays “fempire”. These networks are notable because some of them made network-wide decisions about their participation in the blackout.

Using this dataset, I conducted a logistic regression testing the above hypotheses.

H1: Larger subreddits were more likely to join the blackout. This hypothesis is supported. On average in the population of top 20k subreddits, there is a large positive relationship between the log-transformed subscriber count and a subreddit’s probability of joining the blackout, holding all else constant.

H2: Subreddits with more moderators were more likely to join the blackout. This hypothesis is supported, very very weakly. I wouldn’t make much of this.

H3: Subreddits with mods who also moderate other subreddits that participated in the blackout were more likely to join the blackout. This is supported. On average in the top 20,000 subreddits, there is a positive relationship between the log of moderator roles in other blackout subs and a subreddit’s probability of joining the blackout, a relationship that is mediated by the overall number of moderators shared with other subs, holding all else constant.

So, is there evidence of moderator “solidarity” ? Yes, if we consider H1 to be a test of solidarity associated with similar subscriber numbers, and if we consider H2 to be a test of solidarity related to the number of moderators one works together with, then yes, we see support for the solidarity hypothesis. However, my qualitative research shows that many subreddits voted on this issue, indicating that subscribers also matter to this picture. Furthermore, many mods of smaller subs also expressed solidarity, even if smaller subs were less likely to participate. So more work needs to be done.

CAVEATS: This is just a preliminary statistical test. I have much more work to do before publication:

I need to define better hypotheses that can answer theoretically-meaningful questions

I need to do much more work to confirm the validity of my data collection, data processing, and models

I need better definitions of “solidarity”

This needs to be peer reviewed

In particular, I plan to spend more time with network scientists to understand the best way to set up my dataset for statistical analysis. There are many ways to project a complex network onto a single table for statistical tests, and I may need to try a different approach. Note also that this model does not include time as a factor, and that I use the term “predict” to refer to statistical inference rather than some ability to predict participation in the blackout before it occurred.

I’m sharing these preliminary results because I hope they’ll attract interest from Reddit moderators, and hopefully lead me to more interviews and data while I still have time to talk to people and enrich my understanding of what happened. If you are a Reddit moderator and want to talk with me, please message me at /u/natematias.

boyd, d. 2011. “White Flight in Networked Publics: How Race and Class Shaped American Teen Engagement with MySpace and Facebook.” in Race After the Internet, edited by L. Nakamura and P. Chow-White. New York: Routledge.

Floridi, L. (2009) The Information Society and Its Philosophy: Introduction to the Special Issue on “The Philosophy of Information, Its Nature, and Future Developments”, The Information Society, 25, 153-158.

Fuchs, Thomas, and Ludger Woessmann. 2004. “Computers and Student Learning: Bivariate and Multivariate Evidence on the Availability and Use of Computers at Home and at School.” in CESifo Working Paper Series No. 1321. Munich: CESifo Group.

Livingstone, Sonia, and Ellen Helsper. 2007. “Gradations in Digital Inclusion: Children, Young People and the Digital Divide.” New Media & Society 9:671-696.

Livingstone, Sonia, and Ellen Helsper. 2010. “Balancing opportunities and risks in teenagers’ use of the internet: the role of online skills and internet self-efficacy.” New Media & Society 12:309-329.

Min, S.-J. (2010). From the Digital Divide to the Democratic Divide: Internet Skills, Political Interest, and the Second-Level Digital Divide in Political Internet Use. Journal of Information Technology & Politics. doi:10.1080/19331680903109402

Metzger, M. J. 2007. “Making sense of credibility on the Web: Models for evaluating online information and recommendations for future research.” Journal of the American Society for Information Science and Technology 58:2078-2091.

Wu, X. (2008). When Pirated Films Met the Internet: The Chinese Cultural Public Sphere of Movies in an Unorthodox Globalization. Paper presented at the 2008 Annual Conference of the International Association of Media and Communication Research. Stockholm, Sweden. July.

Zhang, Chan, Mario Callegaro, and Melanie Thomas. 2008. “More than the digital divide? Investigating the differences between internet and non-internet users.” in Annual Conference of the Midwest Association for Public Opinion Research. Chicago, IL.

Zheng, Y, (2009) “Different spaces for e-development? What we can learn from the Capability Approach”, Journal of Information Technology for Development, 15(2), p. 66-82

I’m delighted to be teaching an intro seminar for all the new Ph.D. students in my department’s graduate program. One of my goals is to give these students a place to talk about the environment of graduate school itself. How does getting a Ph.D. work? What do you need to know?

This task has made me reflective. At first I thought I should pass along readings that had been inspirational for me during grad school. That sure didn’t work. Here is the advice I apparently once loved:

I wrote the paper with which this book begins on a microcomputer. Though this first experience with one frightened me a little at first, writing soon seemed so much less work that I wondered how I had managed before. —Writing for Social Scientists, p. 151

Fortunately, these days every legitimate library has a copy machine, and each copy costs about a dime. —How to Write a Thesis, p. 86

The process of getting a Ph.D. is very old. Wikipedia claims the first Ph.D. was awarded in Paris in 1150. I thought Ph.D. advice would be more likely to stand the test of time.

These days you’ll find better dissertation advice on Tumblr. Or at least you’ll find some comic relief from Tumblrs like When in Academia…

(That’s some great tagging.)

The upshot is that it looks like a fair amount of the advice about how to get a Ph.D. has to do with the available communication technology of the time. Both the stuff that’s in everyday use, and also the scholarly communication infrastructure (which I’ve also blogged about recently).

Has anyone reading this ever attended a conference paper sale? (No, that’s not about buying pre-written term papers.) Or have you ever received an academic journal article “preprint request postcard?” Here’s an image of one:

So far I’ve come up with a list of things that seem to still be helpful. Caveats: I’m aiming to help the social science and humanities students interested in communication and information. Our first year students won’t be teaching yet, so I am not focusing on teaching with this list.

Becker, H. S. (2007). Writing for Social Scientists. Chicago: University of Chicago Press. — Don’t let the title of this book fool you, it is equally applicable to graduate students in the humanities and professional programs. I’m excerpting the following sections:

Strunk, W., Jr. & White, E. B. (2000). The Elements of Style. (4th ed.) New York: Longman. (Important: You must avoid any “Original Edition” or public domain reprint that does not include E. B. White as a co-author. The version without E. B. White is a different book.)

…however…

I see that it’s a list woefully lacking in anything like “social media savvy for Ph.D. students” or “How new forms of scholarly communication are changing the dissertation.” I’m sure there are other newish domains I’ve left out, too. What am I missing? Can anyone help me out? Please add a comment or e-mail me.

About Us

The Social Media Collective (SMC) is a network of social science and humanistic researchers, part of the Microsoft Research labs in New England and New York. It includes full-time researchers, postdocs, interns, and visitors. Beginning in 2009, the researchers who now lead the initiative are: Nancy Baym, danah boyd, Kate Crawford, Tarleton Gillespie, and Mary Gray. Our primary purpose is to provide rich contextual understanding of the social and cultural dynamics that underpin social media technologies. We use a variety of methodologies and span multiple disciplines.

Subscribe by Email!

Enter your email address to subscribe to the SMC blog. You'll get an email when we update the blog.