I love the smell of data in the morning!
Interesting and surprising statistics about digital media and devices. Compiled & curated by Dan Calladine, Aegis Media - dan.calladine@aemedia.com -
All views expressed are my own. Please email me if you have any queries, amendments or suggestions. Follow me on twitter - I'm @dancall

Tuesday, 29 March 2011

50% of tweets with links are generated by just 20,000 twitter users

"We study several longstanding questions in media communications research, in the context of the microblogging service Twitter, regarding the production, flow, and consumption of information. To do so, we exploit a recently introduced feature of Twitter---known as Twitter lists---to distinguish between elite users, by which we mean specifically celebrities, bloggers, and representatives of media outlets and other formal organizations, and ordinary users. Based on this classification, we find a striking concentration of attention on Twitter---roughly 50% of tweets consumed are generated by just 20K elite users---where the media produces the most information, but celebrities are the most followed."
Source: Shaomei Wu, Cornell University, Jake M. Hofman, Yahoo! Research, Winter A. Mason, Yahoo! Research, Duncan J. Watts, Yahoo! Research, 'Twitter Flow' 28th March 2011
(Includes a link to download the full research report)Note 1 - Why just just looked at tweets with links (& in fact just tweets with B.t.ly Urls:
From the full pdf:
"In addition to the follower graph, we are interested in the content being shared on Twitter|particularly URLs|and so we examined the corpus of all 5B tweets generated over a 223 day period from July 28, 2009 to March 8, 2010 using data from the Twitter \ rehose," the complete stream of all tweets2. Because our objective is to understand the flow of information, it is useful for us to restrict attention to tweets containing URLs, for two reasons. First, URLs add easily identi able tags to individual tweets, allowing us to observe when a particular piece of content is either retweeted or subsequently reintroduced by another user. And second, because URLs point to online content outside of Twitter, they provide a much richer source of variation than is possible in the typical 140 character tweet. Finally, we note that almost all URLs broadcast on Twitter have been shortened using one of a number of URL shorteners, of which the most popular is http://bit.ly/. From the total of 5B tweets recorded during our observation period, therefore, we focus our attention on the subset of 260M containing bit.ly URLs."

Note 2 - According to the authors of the report, in an email to me in response to my queries, 'attention' is actually defined as being potentially exposed to tweets. So if I follow @aplusk I am paying attention to his tweets. It's a bit of a shame that they've used this very low measure of involvement, rather than to look at clicks, re-tweets or replies, in my opinionSo in fact the title should say "50% of the potential impressions generated by tweets with links are generated by just 20,000 users"