The Network as a Scientific Instrument

By Nicole Hemsoth

June 10, 2013

In June 2012, Greg Bell was named the head of the U.S. Department of Energy’s Energy Sciences Network, better known as ESnet.

Funded by the DOE Office of Science, and managed and operated by the ESnet team at Lawrence Berkeley National Laboratory, ESnet provides reliable, high-performance networking capabilities to thousands of researchers tackling many of the world’s most pressing scientific and engineering problems: finding sources of clean energy, understanding climate change, developing advanced materials, and discovering the fundamental nature of our universe. ESnet interconnects scientists at more than 40 DOE sites with experimental and computing facilities in the U.S. and abroad, and with collaborators around the world.

Invited to give the closing keynote address at the 2012 NORDUnet conference in Oslo, Norway, Bell delivered a presentation entitled “Network as Instrument: The View from Berkeley,” in which he argued that it’s time to start thinking about research networks as instruments for discovery, not just infrastructures for service delivery. The talk struck a chord with the audience, and Bell has since been invited to give versions of the presentation at conferences in the United States and Canada. Most recently, he contributed the April 25 keynote address at the THINK Conference 2013 organized by ORION, the high-speed network linking 1.8 million researchers in Ontario, Canada.

A video of Bell giving a version of this presentation at a meeting on the genomics of energy and the environment, sponsored by the DOE Joint Genome Institute, can be found at the end of the article.

In this Q&A for HPCwire, Berkeley Lab Computing Sciences Communications Manager Jon Bashor talks with Bell about his vision, ESnet news and more.

Question: To start, can you give us a short description of ESnet?

Bell: We’re the Department of Energy’s high-performance networking facility, engineered and optimized for large-scale science. ESnet was created in 1986, making it one of the longest-operating research networks in the world.

ESnet interconnects the entire national lab system, including its supercomputer centers and dozens of large-scale user facilities. Thanks to ESnet, tens of thousands of scientists around the world can transfer data, access remote resources, and collaborate productively.

ESnet is more than a network, though — it’s a collection of skilled and dedicated people, and a great place to work. Even though we’re located near Silicon Valley, we find it relatively easy to attract talent, because we do cutting-edge engineering in the service of scientific discovery.

Q: In a sense, ESnet has always been at the forefront of handling Big Data, and now the rest of the community is catching up. How big is Big Data on ESnet?

Bell: Scientific data sets can be truly enormous, up to petabytes in size, and they’re growing rapidly. Sometimes we use the term “Extreme Data” to distinguish data at this scale from the Big Data you’ve read about in other contexts.

The advent of extreme-data science naturally has an impact on the amount of traffic ESnet carries. In fact, we’re growing about twice as fast as the commercial internet — our traffic doubles every 18 months. I don’t foresee this trend slowing down any time soon, because the underlying exponential drivers just keep cranking along.

Everyone reading this understands that high-performance computing changed the way large-scale science is conducted. It’s clear that data intensity will have an important impact as well. Modeling and simulation will continue to be critically important, but these tools will be supplemented by new techniques that can extract insight from complex data sets, exchanged and accessed over ultra-fast research networks.

Q: Although ESnet was created in 1986, its profile seems to have risen considerably in the past five years or so. What’s behind this?

Bell: More and more researchers are discovering that networks are critical to their science. Faster networks mean faster discovery. In addition, ESnet was lucky enough to receive significant stimulus funds a few years ago. That investment allowed us to build the world’s first 100 Gbps network at continental scale, in partnership with Internet2. We finished that project just in time: the previous-generation network was showing its age, and we were beginning to outgrow it. Our new architecture gives us lots of headroom, and the ability to develop new architectures for maximizing scientific productivity.

We’ve also significantly ramped up our activity in the area of science engagement, partnership, and outreach. We understand that building the world’s fastest science network is not sufficient. We need to make it useful to scientists, and easy to use. That’s harder than it sounds, and we’re still developing models for helping scientists take full advantage of the “fast lanes” we’ve engineered for them.

One final contributor is the success we’re having with applied research and innovation. This critical activity has been enhanced by our dedicated, national-scale 100 Gbps research testbed, which has supported dozens of researchers in the public and private sectors. We’re really trying to push the envelope on a range of topics — including software-defined networking, alternatives to TCP, and security models for 100 Gbps and faster networks.

While we appreciate the recognition, it’s not really important unless it helps us advance our overall mission, which is to accelerate discovery for DOE’s Office of Science. And I do think it’s having that effect. Vendors are coming to us to ask about the unique challenges of supporting science, and our users are beginning to have much higher expectations of ESnet. These are both good developments.

Q: You’ve also been busy. Your talk describing the network as an instrument of discovery has led to multiple invited presentations in North America and Europe — and most recently you gave a version of it as the April 25 keynote address at the THINK conference organized by ORION, the high-speed network in Ontario, Canada. What’s the gist of your presentation?

Bell: My overall goal is to inspire the audience to start thinking about networks differently. Modern research networks such as ESnet and Internet2 (and similar networks around the world) can do a lot more than most people imagine. I try to explain how certain collaborations have profited by incorporating advanced networks into their discovery processes. High-energy physics pioneered this model, and other fields are following. I make the argument that research networks such as ESnet have evolved into extensions of large-scale discovery instruments. For example, the discovery of the Higgs Boson would not have been possible without a worldwide grid computing infrastructure, interconnected by high-speed research networks. Harvey Newman at Caltech pioneered this idea years ago, and the world has finally caught up.

In these presentations, I also give concrete advice about how people can improve networking in their own back yard. ESnet maintains a website devoted to this sort of simple, practical advice: fasterdata.es.net. If you want to start learning about how to use advanced networks more effectively, this is the place to start. It’s a very popular website, with more hits than www.es.net.

Q: Why do you think the message has resonated so well in the networking community?

Bell: It’s not surprising that networkers like to hear that their work is important! But there are a couple of deeper reasons as well. In recent years, networking had become a little dull. Thanks to the challenges of extreme data (and also to the advent of software defined networking), it’s a really exciting place to be again. This new energy is very obvious at networking conferences, and in the academic research community. There are a lot of eyes on networking at the moment.

Q: Last question: What is ESnet focusing on for the coming year? For the next five years?

Bell: Over the next five years, our challenge will be to accommodate the remarkable growth curve in DOE science traffic while simultaneously making the network useful to many more researchers. It’s hard to believe, but even with our new 100 Gbps network and access to underlying optical capacity to carry multiple terabits per second, we will begin to feel a little cramped by 2018-20. At that point, we think we’ll need to light up a new nationwide optical fiber footprint. Whatever else we do, we’re always in a mode of acceleration and growth!

In the coming year, we’ll focus on recruiting about eight new staff, most of them technical. When you consider that we now have about 40 employees, adding eight is significant. We take recruitment very seriously at ESnet. We look for people who are at the top of their game technically, but that’s not enough — they need to be flexible, great communicators, and exemplary colleagues.

MapD Technologies, the GPU-accelerated database specialist, said it is working with university researchers on leveraging graphics processors to advance geospatial analytics.
The San Francisco-based company is collabor Read more…

By George Leopold

A collaboration between the Department of Energy’s National Energy Research Scientific Computing Center (NERSC), Intel and five Intel Parallel Computing Centers (IPCCs) has resulted in a new Big Data Center (BDC) that Read more…

By Linda Barney

Spreading the use of machine learning tools is one of the goals of Google’s PAIR (People + AI Research) initiative, which was introduced in early July. Last week the cloud giant released deeplearn.js as part of that in Read more…

Can’t wait to see next week’s solar eclipse? You can at least catch glimpses of what scientists expect it will look like. A team from Predictive Science Inc. (PSI), based in San Diego, working with Stampede2 at the Read more…

By John Russell

Microsoft has acquired cloud computing software vendor Cycle Computing in a move designed to bring orchestration tools along with high-end computing access capabilities to the cloud. Terms of the acquisition were not disclosed. Read more…

By Tiffany Trader

This week Cray announced that it is picking up Seagate's ClusterStor HPC storage array business for an undisclosed sum. "In short we're effectively transitioning the bulk of the ClusterStor product line to Cray," said CEO Peter Ungaro. Read more…

By Tiffany Trader

The Russian Quantum Center today announced it has overcome the threat of quantum cryptography by creating the first quantum-safe blockchain, securing cryptocurrencies like Bitcoin, along with classified government communications and other sensitive digital transfers. Read more…

By Alex Woodie

In this contributed perspective piece, Intel’s Jim Jeffers makes the case that CPU-based visualization is now widely adopted and as such is no longer a contrarian view, but is rather an exascale requirement. Read more…

By John Russell

On Thursday, Google announced that MIT math professor and computational number theorist Andrew V. Sutherland had set a record for the largest Google Compute Engine (GCE) job. Sutherland ran the massive mathematics workload on 220,000 GCE cores using preemptible virtual machine instances. Read more…