Computer scientists learn to predict which photos will go viral

Apr 04, 2014 by Tom Abate

(Phys.org) —It's hard to predict which of the many millions of photos on Facebook will spring from obscurity and go viral. But Stanford researchers have found some hints by studying "cascades," the term used to describe photos or videos being shared multiple times.

"It wasn't clear whether information cascades could be predicted because they happen so rarely," said Jure Leskovec, assistant professor of computer science.

According to data provided by Facebook scientists in a recent collaboration with university scientists, only 1 in 20 photos posted on the social network gets shared even once. And just 1 in 4,000 gets more than 500 shares – a lot but hardly an epidemic.

"It is very hard to quantify what going viral means," said Leskovec. "Anyone would say 'Gangnam Style' went viral, but that's a singular event," he said, referring to the YouTube video that has been viewed almost 2 billion times.

In a paper being presented at the International World Wide Web Conference, the team of scientists, including Leskovec, Stanford doctoral student Justin Cheng, Facebook researchers Lada Adamic and P. Alex Dow, and Cornell University computer scientist Jon Kleinberg, will describe how they accurately predicted, 8 out of 10 times, when a photo cascade would double in shares; that is, if a photo got 10 shares, would it get 20? If it got 500, would it reach 1,000, and so on?

They began by analyzing 150,000 Facebook photos, each of which had been shared at least five times. The data were stripped of names and identifiers to protect privacy.

A preliminary analysis of those photos revealed that, at any given point in a cascade, there was a 50-50 chance that the number of shares would double.

The scientists then looked for variables that might help them predict doubling events more accurately than a coin toss, including the rate and speed at which photos were shared, and the structure of sharing (photos reposted in multiple networks proved to create stronger cascades).

After factoring several criteria into their analysis the computer scientists were able to accurately predict doubling events almost 80 percent of the time.

Their algorithm became more accurate the more times a photo was shared. For photos shared hundreds of times, their accuracy rate approached 88 percent.

The speed of sharing was the best predictor of cascade growth. Simply analyzing how quickly a cascade unfolded predicted doublings 78 percent of the time.

How a photo was shared – scientists call this the structure of the cascade – was the next best predictive factor. Photos that spread among different friendship networks or fan groups indicated a breadth of interest. Structure proved 67.1 percent accurate at predicting doubling when used alone.

But the researchers found no simple trick to ensure widespread sharing.

"Even if you have the best cat picture ever, it could work for your network, but not for my boring academic friends," Leskovec said, adding, "You have to understand your network."

In the next five minutes, roughly a half-million photos will be shared online. Embarrassing or not, many are only intended for a certain audience—family, or friends maybe—not the whole world. And yet, relatively few will ...

A new algorithm designed at the University of Toronto has the power to profoundly change the way we find photos among the billions on social media sites such as Facebook and Flickr. This month, the United ...

Recommended for you

Using complex computational algorithms based on repeated sampling, a University of Arkansas graduate student helped spearhead a project to develop a computer simulator that provides customized electron beam ...

A team of researchers at Google has been looking into ways to change the way links are retrieved by its famous search engine—instead of ranking them based on popularity, the researchers are looking into ...

Visitors to interactive virtual worlds want the ability to significantly affect the outcome of a story, but authoring these digital experiences is extremely complex. A new platform developed by Disney Research ...

If you live in a city, you know that a fair amount of your movement around town is social in nature. But how much, exactly? A new study co-authored by MIT researchers uses a novel method to infer that around ...

Social networks such as Facebook, LinkedIn and Twitter play an increasingly central role in our lives. Centrality is also an important concept in the theory of social networks. Centrality of an individual, called a "node" ...

User comments : 0

Please sign in to add a comment.
Registration is free, and takes less than a minute.
Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.

Javascript is currently disabled in your web browser. For full site functionality, it is necessary to enable Javascript.
In order to enable it, please see these instructions.