Data marketing for Web 3.0

I’ll recap what I said to accompany the above slides, a kind of running commentary like they have on DVD’s. I’ll put slide numbers in brackets, like [4] for slide #4, so you can follow along.

Marketing with Linked Data

I want to talk to you about [1] the emerging field of data marketing.

Web 3.0 means different things to different people [2]. Without launching into a big nomenclature debate — there’s something ironic in arguing the semantics of the semantic web — I’d humbly propose the following categories based on different waves of linking. After all, linking is what the web is all about:

Web 1.0 was about linking pages. The giants that emerged out of that wave included Google and Yahoo!, who took advantage of those linked pages to help us find great content.

Web 2.0 was about linking people. Social networking exploded, from sharing photos on flickr and videos on YouTube, to connecting with our friends and colleagues on Facebook, LinkedIn, and Twitter.

Web 3.0 will be about linking data. Who will be the leaders of this wave? Maybe you…

Data for customers

I’ll touch on 4 data marketing topics in this presentation [5]. Let’s begin with the big picture, thinking of data for customers.

Data helps us make decisions [6].

Data helps us impress our boss [7].

Data helps us persuade people [8].

Data is the fuel of the 21st century [9].

Yet despite its ubiquity in our lives — particularly our professional lives — finding the data we want in any given circumstance is hard work [10].

And then once we find it, we tend to stow it away in silos [11]. Spreadsheets. Documents. Databases. We end up with data that is dispersed, duplicated, out-of-date, and difficult to relate from one application to the next. To paraphrase the children’s rhyme: data, data, everywhere, but not a drop to drink.

But a new generation of Web 3.0 companies is working to make data “open” [12] — linkable and shareable. Freebase, Infochimps, Factual, Calais, Flowing Data, Wolfram Alpha. These companies are making it easier for us to find and use the data we want.

But they won’t have a monopoly on linked data [13]. The beauty of semantic web standards, such as RDF and OWL, are that they will enable any web site to publish structured data and link it to other sets of related data, anywhere on the web. Just like we all did with HTML. Linked data is the great wave of Web 3.0.

Now, marketers love data [14]. We may not be database architects — at least not yet — but we appreciate information: market research, web analytics, CRM records, social media monitoring. We know the value of good data.

As Sting said [15], “If you love something, set it free.” (This may be the first — and last — time someone cites Sting for explaining the semantic web.)

Set data free.

The essence of Web 3.0 for marketers is to think about data for customers, not just data about customers [16]. Shift from data stalking to data talking. What data could you share with your prospects, customers, and partners to strengthen your relationships, build your brand, and win more business?

Data + search = SEO++

One of the first places data can help is with search marketing [17].

The web’s a big place [18]. Carl Sagan would say, “billions and billions” of pages. Just doing a search for a specific restaurant by name in a specific city can return 80,000 hits. It can be hard for us — or our customers — to find answers.

Yahoo! was one of the first major search engines to start leveraging structured data from web sites in their search results with SearchMonkey [19]. When you do a search for a restaurant — such as “slanted door san francisco” — you might see a particularly useful summary from Yelp, right in the results. Aggregate rating, number of reviews, address, phone number, a photo of the interior.

That’s a heck of a lot more useful than a random line or two of text.

Google does something similar with their Rich Snippets feature [20]. As does Bing [21].

As a marketer, the benefit of adding search engine friendly structured data to your web site is simple: more traffic [22]. Yahoo! reported that search results that embedded these useful snippets of structured data received 15% more click-throughs than the competition.

Although the search engines claim adding structured data to your web site won’t directly improve your ranking [23], to prevent spammers from gaming the system with bogus data, there is a virtuous cycle here. A better listing gets you more clicks, more visitors — some percentage of whom will link back to your site, thereby improving your ranking, which in turn wins you even more clicks.

To clarify, the data bubbling up in these search results was published by Yelp [24]. Yelp is a classic example of a company that has a value proposition rooted in data. They offer extensive data about restaurants — featuring the collective intelligence of over 9 million user reviews — that makes them a data authority in the subject.

The value of that authoritative brand position was estimated at over $500 million in recent M&A rumors.

Adding structured data to your web site is pretty straightforward, using microformats and RDFa [25]. If you’re not technical, just think of this as extended HTML.

Embedding this kind of data in your web site is a natural extension of search engine optimization (SEO) [26]. Whoever is doing your SEO now should be able to help with this. But since structured data takes SEO a little further, I call this SEO++ — a mix of SEO and data objects. It’s reminiscent of the paradigm shift to object-oriented programming, from C to C++.

They used a more advanced vocabulary called GoodRelations [28] to more fully describe their products. GoodRelations can describe components of products, accessories, delivery options, payment options, warranties — just about anything you could want for defining e-commerce offerings in a structured format.

The payoff, according to Best Buy, was a 30% increase in site traffic from organic search [29].

This is why search is a great place to dip your toe with data marketing. It provides a clear business incentive — more traffic to your web site — and there’s an existing budget to invest in it [30]. This year over $2 billion will be spent on SEO; within 4 years, that will be more than $5 billion.

Data branding

Now let’s look beyond search and think more broadly about data marketing and what it can mean for your brand [31].

Josh Jones-Dilworth, who handles PR for many of the leading companies in the semantic web and linked data space, wrote an excellent article on Mashable, Marketing in 2010: It’s All About the Data. He starts with this brilliant insight [32]:

Data shapes conversations and markets.

That is the essence of data branding — positioning your company through the data you create and publish — and, in a linked data world, how you relate that data to others.

In data branding, you want to ask yourself three questions:

1. What data, if widely disseminated, would grow my market? [33]

For example, consider Google Insights for Search [34]. Google’s “database of intent” is arguably their most valuable asset. But rather than lock it in a vault, they let anyone mine this data for trends. With their Keyword Tool you can even drill down into specific number of impressions for all related keywords. This is smart, as it helps get marketers more engaged in search. Ultimately, this data encourages more — and more successful — search advertising.

2. What data would position you as a leading authority in your field? [35]

The New York Times is struggling, like all newspapers, to find their place in the digital world. But Evan Sandhaus, the paper’s resident Semantic Technologist, is forging an exciting strategy [36]: to have the “paper of record” become the authoritative data source of record — at least for the paper’s areas of expertise with news, public figures, politics. They have over 150 years of original content to draw upon. Check out data.nytimes.com and developer.nytimes.com to glimpse this future.

But they’ll have competition. Thomson Reuters also has claim to being a data authority, particularly with companies, law, markets, and finance [37]. And they’ve got a strong competitive advantage with their Open Calais technology to help web developers leverage that data quickly and easily.

CNET is a data authority with product reviews. Yelp is a data authority with restaurant reviews. Google is a data authority with search behaviors. What will your company be a data authority for?

3. What data vocabularies (ontologies) will define your market? [38]

To unlock the real power of linked data, you must do more than publish structured data [39]. You need to use vocabularies — or, more technically, ontologies — that other people will recognize and use to publish or relate their data too. The interlinked web of data is the real vision.

For example, there’s a well-established vocabulary for social network data — friend of a friend (FOAF) — and a very popular vocabulary for publishing called the Dublin Core. GoodRelations, which we saw earlier, is another.

But for most areas of expertise and most domains of knowledge — probably many in your specific market — there aren’t well-defined vocabularies. Yet. You have an opportunity to be the first mover, to shape the nomenclature and structure of information in your space.

Reflect for a moment on the brand potential here, defining the structure of shared data as a way of shaping the use of that data in your market — in ways that are particularly synergistic and relevant to your position.

Engaging in such data branding initiatives will involve a new ecosystem of players [40] — developers and popular data applications. Search engines are one example of a data application, but there will be hundreds, thousands as the semantic web blossoms.

Data business models

Finally, I’d like to mention a bit about business models for data [41].

Implementing linked data doesn’t happen automagically. Companies need to think about how they can get return on such investments [42]. We briefly discussed this with SEO++, but there’s a wide range of possibilities.

Once you realize that data has tremendous value — and that the interoperability of linked data will only increase its usefulness and accessibility — you can start thinking creatively of many ways to capture that value.

That’s going to be one of the great adventures in marketing for this next decade.

Comments

What happened to DBpedia re. your slides? Freebase and DBpedia mutually inclusive variants of the same thing: Linked Open Data Spaces.
Remember, Calais, New York Times, BBC, and many other service providers in the Linked Open Data realm tap into DBpedia.
Put bluntly, Linked Open Data is the House that DBpedia built 🙂
Links:
1. http://dbpedia.org/About
2. http://dbpedia.org/resource/DBpedia .

Hi, Kingsley.
You’re right — DBpedia absolutely should be there. My mistake. I originally started that slide with Infochimps in mind, specifically to use them as an example of a “data” company, but not a LOD company per se. I wanted to demonstrate how different ventures were heading towards a common destination from very different origins. But then the slide started to grow, and my heuristic for selecting logos broke down into a random function.
I’ll make sure to give DBpedia its due credit in the next one!

Hello, thank you for your presentation that s helpfull and clarify a lot the ways that companies and professionals will be concerned by the semantic technologies. Im french so i apologize in advance for my english.
I ve just a different point of view regarding the page 23 titled “catalyse a virtuous cycle”. I ve writed a doc for one of my client to answer to his question “why is it important to consider SW for our referencement?” and i ve explained that : “more detailed informations in the SE helps people make a better decison and while the click is more qualitative then the rebounds decrease and the time spent on the site grow up and then the rank is higher. The advantage is quality and not quantity.
Thank you again, Ghalem http://blog.eoweo.com

Very good presentation. I esp liked the 42nd slide on the 8 business models.
Would LOVE to re-blog but it seems I can only use a TypePad blog. My existing blog uses another host. Can bloggers like you who are customers of TypePad encourage them to ‘open’ their tool to other blog hosts so that people like me can indeed re-blog your very interesting post?