In 10 years of fixing other people's SQL databases, I've noticed that the less the original developer knew, the more complex the databases are ... and the more complex the problems. Here I offer a refreshing approach for simple SQL database design.
Read more.

Scaling is a perennial problem. One day you are happily serving 10,000 users and suddenly that pesky CNN picks you on you and you have to deal with a million users. It isn't all about putting the latest hardware, more disk or more RAM. Scaling is a subtle art of discovering pain points in the application and using various Open Source software and technologies to get you to where you want.
Read more.

Apache CouchDB can serve complete web apps, without a middle-tier application server. Because these apps can be deployed to any running CouchDB node (including user's local machines), they present potential for end-user innovation, but because of view source but also through peer based replication. We'll learn to use the CouchApp JavaScript and HTML framework to build sharable applications.
Read more.

This tutorial will show you how to get started with Gearman, the flexible job queuing system used to power websites such as LiveJournal and Digg. We'll cover common architectures, installation, APIs, and deployment. A few use cases will be described and built, including a Map/Reduce cluster and database-driven URL mining application.
Read more.

The GeoDjango project provides a set of extensions to the python Django framework that allows for the easy and rapid development of spatially enabled applications. Using GeoDjango's model-driven design methods, PostGIS's spatial database extensions to PostgreSQL, and OpenLayers, we will explain and demonstrate how to build powerful spatially enabled applications.
Read more.

Protecting your data, by any and all means possible, is no longer an option. Rather, it is mandated by today's security conscious management. This tutorial will demonstrate a hands on methodology of using the latest encryption and cipher technology available in PostgreSQL. Following best condoned practices used in the industry today, PostgreSQL can be used to manage your data securely.
Read more.

Semantic Technologies provide a simple, standardized methodology for representing, combing and sharing data and serve as the foundation for creating communities of open data. These technologies are both easy to learn and easy to use. This tutorial will introduce you to semantic programming using a variety of open source tools and programming techniques that you can use on your projects today.
Read more.

Ubuntu One isn't just a set of services for Ubuntu, it's a platform for you to build your own services too. Stuart Langridge explains the APIs Ubuntu One offers to developers and shows some examples of applications you could build that take advantage of storage in the cloud and synchronised databases for your apps: build your own on the desktop or the web to work collaboratively with Ubuntu One.
Read more.

Many people know how to use memcached, the popular caching system powering much of web1+. Most folks, though, don't know how not to use it, and how improper usage can cause data problems, poor site/application performance, and an incredibly grumpy DBA. Learn what memcached is good for, and what it's not good for from those that have learned the wrong way.
Read more.

Come learn the fundamentals of how to leverage Gearman, the open-source, distributed job queuing system. Originally designed to scale LiveJournal.com, Gearman is now faster than ever and can help you build your own scalable applications. Gearman's generic design allows it to be used as a building block for almost any use - from speeding up your website to building your own Map/Reduce cluster.
Read more.

Sphinx Full Text search engine became increasingly popular over years powering search for number of Alexa 100 sites as Craigslist and NetLog. Sphinx combines powerful full text search features with ease of use and high performance. Being specially designed for indexing database content it is natural fit for modern database powered web sites.
Read more.

YAML is the serialization language that enables sharing of complex data between Perl, Python, Ruby, PHP and Java. It does it so in a human friendly manner. Many popular frameworks use YAML, including Ruby on Rails.
In this talk, Ingy döt Net, one of the authors of the YAML specification, will show you how to share data objects not feasible by JSON or XML.
Read more.

How do you choose the right filesystem for your database management system? Administrators have a variety of filesystems to choose from, as well as volume management and hardware or software RAID. This talk will examine how different the performance of filesystems really are, and how do you go about systematically determining which configuration will be the best for your application and hardware.
Read more.

SQL is from Mars, Objects are from Venus.
This talk is for software developers who know SQL but are stuck trying to implement common object-oriented structures in an SQL database. Mimicking polymorphism, extensibility, and hierarchical data in the relational database paradigm can be confusing and awkward, but they don't have to be.
Read more.

CouchDB's web API and offline replication capabilities make it ideally
suited to power a sea-change in the relationships between users and
service providers. I'll talk about the benefits and challenges of the
P2P web as well as give a brief overview of the technologies that make
CouchDB an "obvious" extension to the current architecture of the web.
Read more.

In this panel talk a number of core Drizzle developers will explain where development sits today, critical tools involved, best practices that were used to get here, and how a vibrant open-source developer community has been built.
Read more.

Using JRuby, apps created with Ruby frameworks like Rails or Merb can now be deployed to Google's highly scalable infrastructure. This talk, will provide an overview of App Engine, with attention to current features and apis. We will also show some demos, including deployment to the production environment, and provide some insight into (and best practices for) using the App Engine Datastore.
Read more.

Nearly all Web Applications need persistent solutions to be effective. For Perl and Ruby, the choice is generally "use an Object-Relational Mapper to put data into an SQL database", but with Smalltalk's object model, pure-object storage is also available as an option. We'll look at ORM and Object solutions for web apps built with Seaside, including a few commercial solutions like GemStone/S
Read more.

Nowadays, data is everywhere: databases, spreadsheets, the web...if only we could access it at on time, at the right place, in the right form...
Turning data into information is a struggle. Like diamonds are mined and cut to create jewels, so must data be extracted and transformed to create information.
Learn how the open source data integration tool Kettle helps to fight your data dragons.
Read more.

PostgreSQL 8.4 is the first Open Source database management system to handle trees and lists using SQL:2008-compliant Common Table Expressions and Windowing functions. You'll learn how these work, see intriguing examples, and walk out ready to use them to your advantage.
Read more.

A graph db stores data in a network structure rather than in relational tables. This model is well suited for many web use cases such as tagging, metadata annotations, social networks, wikis and other network-shaped or hierarchical data sets. This talk will introduce Neo4j: a high-performance, transactional open source graph db, which frequently outperforms RDBMSs with >1000x for such use cases.
Read more.

A storage engine for MySQL based on the InnoDB storage engine, designed to better scale on modern hardware, and including a variety of other features useful in high performance environments. It is fully backwards compatible, and so can be used as a drop-in replacement for standard InnoDB.
Read more.

You unit test your application API. You unit test your presentation layer. You write integration and acceptance tests. But your database is tested only as a side-effect to testing everything else. That's a pretty important part of the stack to just leave to the assumption it works as expected!
Come to this talk to learn about the tools that enable integrated unit tests for your database.
Read more.

Besides MySQL release officially available from Sun there are multiple patches and extensions developed by community. In this Presentation we will look into them to see what extra features patches from Google, Percona and OurDelta offer and how can you use them to make your MySQL life more fun.
Read more.

We have embarked on a mission to share more of what we do on the development side of The Times. So far, we’ve done that via conference presentations, open-source software, blog posts and (most recently and probably most importantly) our APIs. We see our site as more than just a source of news and information: it’s a platform on which news and information become building blocks.
Read more.

David Dooling (The Genome Center at Washington University in St. Louis)

Average rating:

(3.33, 3 ratings)

It has long been know that free/libre/open source software (FLOSS), especially GNU/Linux and Perl, played a major role in the Human Genome Project. This presentation will discuss the use and development of FLOSS in the post-genomics era, focusing on its pervasive use in sequencing the first cancer genome at The Genome Center at Washington University in St. Louis.
Read more.

Everyone agrees that we need coding standards, but they often overlook the need to define a naming standard for thier SQL and database related items. This talk we not be a top-down explination of "the right way to do it", but rather we'll explore the key issues you need to be aware of, from all sides, and help you determine the right standards for your organization.
Read more.

Replication. Partitioning. Relational databases. Bigtable. Dynamo.
There is no one-size-fits-all approach to scaling your database, and the CAP theorem proved that there never will be. This talk will explain the advantages and limits of the approaches to scaling traditional relational databases, as well as the tradeoffs made by the designers of newer systems like Google's Bigtable.
Read more.

Diamond Sponsors

Platinum Sponsor

Gold Sponsors

Silver Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at scordesse@oreilly.com