This is an annotated CAD tool layout of the Princeton Piton Processor showing 25 cores.
CREDIT
Princeton University

Increased power and slashed energy consumption for data centers

Princeton University researchers have built a new computer chip that promises to boost performance of data centers that lie at the core of online services from email to social media.

Data centers – essentially giant warehouses packed with computer servers – enable cloud-based services, such as Gmail and Facebook, as well as store the staggeringly voluminous content available via the internet. Surprisingly, the computer chips at the hearts of the biggest servers that route and process information often differ little from the chips in smaller servers or everyday personal computers.

By designing their chip specifically for massive computing systems, the Princeton researchers say they can substantially increase processing speed while slashing energy needs. The chip architecture is scalable; designs can be built that go from a dozen processing units (called cores) to several thousand. Also, the architecture enables thousands of chips to be connected together into a single system containing millions of cores. Called Piton, after the metal spikes driven by rock climbers into mountainsides to aid in their ascent, it is designed to scale.

“With Piton, we really sat down and rethought computer architecture in order to build a chip specifically for data centers and the cloud,” said David Wentzlaff, an assistant professor of electrical engineering and associated faculty in the Department of Computer Science at Princeton University. “The chip we’ve made is among the largest chips ever built in academia and it shows how servers could run far more efficiently and cheaply.”

Wentzlaff’s graduate student, Michael McKeown, will give a presentation about the Piton project Tuesday, Aug. 23, at Hot Chips, a symposium on high performance chips in Cupertino, California. The unveiling of the chip is a culmination of years of effort by Wentzlaff and his students. Mohammad Shahrad, a graduate student in Wentzlaff’s Princeton Parallel Group said that creating “a physical piece of hardware in an academic setting is a rare and very special opportunity for computer architects.”

Other Princeton researchers involved in the project since its 2013 inception are Yaosheng Fu, Tri Nguyen, Yanqi Zhou, Jonathan Balkind, Alexey Lavrov, Matthew Matl, Xiaohua Liang, and Samuel Payne, who is now at NVIDIA. The Princeton team designed the Piton chip, which was manufactured for the research team by IBM. Primary funding for the project has come from the National Science Foundation, the Defense Advanced Research Projects Agency, and the Air Force Office of Scientific Research.

The current version of the Piton chip measures six by six millimeters. The chip has over 460 million transistors, each of which are as small as 32 nanometers – too small to be seen by anything but an electron microscope. The bulk of these transistors are contained in 25 cores, the independent processors that carry out the instructions in a computer program. Most personal computer chips have four or eight cores. In general, more cores mean faster processing times, so long as software ably exploits the hardware’s available cores to run operations in parallel. Therefore, computer manufacturers have turned to multi-core chips to squeeze further gains out of conventional approaches to computer hardware.

In recent years companies and academic institutions have produced chips with many dozens of cores; but Wentzlaff said the readily scalable architecture of Piton can enable thousands of cores on a single chip with half a billion cores in the data center.

“What we have with Piton is really a prototype for future commercial server systems that could take advantage of a tremendous number of cores to speed up processing,” said Wentzlaff.

The Piton chip’s design focuses on exploiting commonality among programs running simultaneously on the same chip. One method to do this is called execution drafting. It works very much like the drafting in bicycle racing, when cyclists conserve energy behind a lead rider who cuts through the air, creating a slipstream.

At a data center, multiple users often run programs that rely on similar operations at the processor level. The Piton chip’s cores can recognize these instances and execute identical instructions consecutively, so that they flow one after another, like a line of drafting cyclists. Doing so can increase energy efficiency by about 20 percent compared to a standard core, the researchers said.

A second innovation incorporated into the Piton chip parcels out when competing programs access computer memory that exists off of the chip. Called a memory traffic shaper, this function acts like a traffic cop at a busy intersection, considering each programs’ needs and adjusting memory requests and waving them through appropriately so they do not clog the system. This approach can yield an 18 percent performance jump compared to conventional allocation.

The Piton chip also gains efficiency by its management of memory stored on the chip itself. This memory, known as the cache memory, is the fastest in the computer and used for frequently accessed information. In most designs, cache memory is shared across all of the chip’s cores. But that strategy can backfire when multiple cores access and modify the cache memory. Piton sidesteps this problem by assigning areas of the cache and specific cores to dedicated applications. The researchers say the system can increase efficiency by 29 percent when applied to a 1,024-core architecture. They estimate that this savings would multiply as the system is deployed across millions of cores in a data center.

The researchers said these improvements could be implemented while keeping costs in line with current manufacturing standards. To hasten further developments leveraging and extending the Piton architecture, the Princeton researchers have made its design open source and thus available to the public and fellow researchers at the OpenPiton website: http://www.openpiton.org

“We’re very pleased with all that we’ve achieved with Piton in an academic setting, where there are far fewer resources than at large, commercial chipmakers,” said Wentzlaff. “We’re also happy to give out our design to the world as open source, which has long been commonplace for software, but is almost never done for hardware.”

The Latest on: Piton processor

November 9, 2016 - Princeton University researchers have developed a 25-core processor, dubbed Piton named after the metal spikes used by rock climbers, which has been designed to be flexible, highly scalable, fast and energy-efficient to satisfy the demands of ...

October 5, 2016 - Back in 2010, an Intel researcher said 1,000-core processors would be feasible. We're in that era, and the race to make chips faster and more power efficient is gaining steam. The latest mega-chip is a 1,024-core processor called Epiphany V, which was ...

August 30, 2016 - The Piton processor has been developed by researchers at Princeton University. It is a custom 25 core processor and the cool thing about the Piton processor is that it can scale up to have one million cores inside a single system. It is targeted at ...

August 30, 2016 - Another rumor about the specifications suggests that the device will be powered by Intel's upcoming Kaby Lake processor which might suggest why Microsoft hasn't made an official announcement yet since the processor isn't available in the market right now.

August 28, 2016 - Nintendo promises that it's learned from its past mistakes, mainly the improper communication surrounding the Wii U. The Japanese console giant says it absolutely has to change its ways and make sure consumers know exactly what its new NX console ...

August 27, 2016 - Princeton's new computer chip is called Piton, and it's a many-core open-source research processor aimed at revolutionizing data-center and enterprise-grade cloud-based solutions that power the internet--from email to Facebook and Twitter--with a cheap ...

August 26, 2016 - Researchers at Princeton University recently showed off a 25-core processor they designed specifically for data centers. It's called "Piton," named after the metal spikes rock climbers hammer into cracks or seams of mountainsides to anchor their ...

August 26, 2016 - Princeton Piton Processor, is a many-core designed by Prof. Wentzlaff's research group in March, 2015. It was taped-out in IBM's 32nm SOI process. Some of Piton's features are listed below: 25 modified OpenSPARC T1 cores. Directory-based shared ...

August 26, 2016 - Researchers want to give a 25-core open-source processor called Piton some serious bite. The developers of the chip at Princeton University have in mind a 200,000-core computer crammed with 8,000 64-bit Piton chips. It won't happen anytime soon, but ...

August 23, 2016 - Piton could substantially increase processing speed while slashing energy usage. The chip architecture is scalable — designs can be built that go from a dozen to several thousand cores, which are the independent processors that carry out the ...

December 2, 2016 - "In a developed market, you are competing with cheaper forms of conventional power generation, such as gas and also hydro. Energy storage costs still have some way to come down for a hybrid plant like Kennedy Energy Park to be competitive," he said.

December 1, 2016 - SEATTLE - When firefighter paramedics Morlon Malveaux and Mark Pedeferri learned that their powerhorse diesel ambulance was going to be traded for a gas-powered hybrid they were more than a little concerned. The two, who run a Medic One rig ...