<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Steve Leibson &#187; Design</title>
	<atom:link href="http://low-powerdesign.com/sleibson/index.php/category/design/feed/" rel="self" type="application/rss+xml" />
	<link>http://low-powerdesign.com/sleibson</link>
	<description>Leibson's Laws and the Penalties for Breaking Them</description>
	<lastBuildDate>Wed, 01 Feb 2012 00:01:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
		<item>
		<title>Do you believe in 21st century Intelligent Design?</title>
		<link>http://low-powerdesign.com/sleibson/2012/02/01/do-you-believe-in-21st-century-intelligent-design/</link>
		<comments>http://low-powerdesign.com/sleibson/2012/02/01/do-you-believe-in-21st-century-intelligent-design/#comments</comments>
		<pubDate>Wed, 01 Feb 2012 00:01:15 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[Honeywell]]></category>
		<category><![CDATA[Learning Thermostat]]></category>
		<category><![CDATA[Nest]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=779</guid>
		<description><![CDATA[Late last month, columnist Mike Cassidy wrote about visionary Clayton Christensen’s Innovator’s Dilemma in the San Jose Mercury News and his words reminded me that it was time, past time, to make yet another blog-based plea for intelligent design. No, &#8230; <a href="http://low-powerdesign.com/sleibson/2012/02/01/do-you-believe-in-21st-century-intelligent-design/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Late last month, columnist Mike Cassidy wrote about visionary Clayton Christensen’s Innovator’s Dilemma in the <strong>San Jose Mercury News </strong>and his words reminded me that it was time, past time, to make yet another blog-based plea for intelligent design. No, I’m not talking about “intelligent design” in the form of an alternative to evolutionary theory. Not that one. I’m talking about “intelligent design” in the form of adding more microprocessors and more software to all electronic designs in a valiant attempt to produce products are more aware of the context of their surroundings. In other words, products that are far less stupid. At the same time, I believe that this form of intelligent design can help you cut product manufacturing cost.</p>
<p>More value at lower cost. Doesn’t that seem like a good deal?</p>
<p>Here’s what Cassidy wrote that triggered this blog:</p>
<p>“Clay Christensen has an idea: Scare the hell out of yourselves.</p>
<p>OK, that&#8217;s not precisely the way he put it. But the author of &#8220;The Innovator&#8217;s Dilemma&#8221; is all about new ideas. Not just new &#8212; but different, unorthodox, radical, uncertain, frightening and disruptive. You don&#8217;t solve old problems with old ideas. The other day, Christensen held a one-man teach-in for non-profits and their supporters at San Jose&#8217;s Mexican Heritage Plaza, preaching the gospel of &#8220;disruptive innovation.&#8221; It&#8217;s an idea that is embedded in Silicon Valley&#8217;s DNA. It is also an idea that is a lot easier to talk about than to actually deploy.”</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2012/01/Honeywell-Thermostat-circa-1952.png"><img class="alignright size-full wp-image-780" title="Honeywell Thermostat circa 1952" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2012/01/Honeywell-Thermostat-circa-1952.png" alt="" width="200" height="201" /></a>So what’s the connection with “intelligent design”? I was immediately reminded of a great new product, a home thermostat of all things, that I’d just written up from last month’s CES show.(See “<a href="http://eda360insider.wordpress.com/2012/01/13/two-minutes-of-system-design-expertise-from-matt-rogers-vp-of-engineering-and-founder-of-nest-and-designer-of-the-thermostat-of-the-future/" target="_blank">Friday Video: Two minutes of system-design expertise from Matt Rogers, VP of Engineering and founder of Nest and designer of the thermostat of the future</a>”) <a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2012/01/Honeywell-Thermostat-circa-1952.png"></a>The thermostat is from a new company called <a href="www.nest.com " target="_blank">Nest</a> and the product is called the Nest Learning Thermostat. It looks like an updated 21<sup>st</sup> century version of the old golden Honeywell manual thermostats that were common in the 1950s and 1960s. (Noted industrial designer Henry Drefuss created the Honeywell thermostat’s iconic circular industrial design in 1952.)</p>
<p>However, unlike those old Honeywell thermostats that were based on bimetallic temperature-sensing coil springs and mercury tilt switches, the Nest Learning Thermostat is based on a 32-bit microprocessor. And a TCP/IP stack. And WiFi. In adding these specific technologies to its Learning Thermostat, Nest demonstrates a grasp of Christensen’s concept of the “Innovator’s Dilemma.”</p>
<p>How?</p>
<p>First, understand that the context of what we mean by “home” has changed. Frequently in the modern Western world, there’s no one home. That means the home’s heating and cooling requirements are different. Also, our definition of “home” increasingly includes a home WiFi network, possibly with access to the outside world through the home’s broadband router. Add some intelligence and connectivity to a thermostat, and you can disruptively change how we heat and cool our homes with a large resulting energy savings.</p>
<p>After installation, the Nest Learning Thermostat needs to know three things:</p>
<ol>
<li>What’s your Zip code (Yes, I know there’s a US bias built in here. Early intelligence has its limits.)</li>
<li>Should the thermostat start to heat or cool your home?</li>
<li>What temperatures should the Nest Thermostat use to heat or cool your home while you are away?</li>
</ol>
<p>After that, you set the thermostat to the desired current temperature and the Nest Learning Thermostat then starts to observe your daily habits (with respect to heating and cooling only!). It monitors when you turn up the heat (like in the morning) and when you turn it down (like before you go to sleep). It notes when you turn on the cooling (like when the afternoon sun starts to make it overly warm for you).</p>
<p>The Thermostat learns your daily routing and your weekend routine (weekend habits are different for most people) and by the eight day, the thermostat has developed a pretty good idea of your habits and strives to maintain a comfortable home for you by catering to those habits. The Nest Learning Thermostat can start to heat your home several minutes before you rise in the morning. (This is a major feature for those of us whose first job in the morning is to turn up the heat for our spouses so they can get up.)</p>
<p>You can log into your thermostat from your office to adjust the heat so that your home is warm by the time you return from work or to let the thermostat know you’re going out for dinner and the heat can be delayed for a few hours. And of course, you can do the same from your smartphone no matter where you are on the planet (assuming you’ve got cellular coverage wherever you are).</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2012/02/Nest-Learning-Thermostat.png"><img class="alignright size-full wp-image-782" title="Nest Learning Thermostat" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2012/02/Nest-Learning-Thermostat.png" alt="" width="250" height="244" /></a>The Nest Learning Thermostat also tries to train you after it learns your habits. It wants you to learn better habits in terms of energy consumption. It does this by starting to display a small green leaf when you turn down the heat or taper off on the air conditioning relative to your habitual heating and cooling use. This leaf tells you that you’re saving energy and thus money. It’s a subtle form of coercion and some people won’t like being told what to do by a thermostat. Others—the ones most likely to buy this product—will appreciate the watchful eye.</p>
<p>None of this would be possible if the Nest Learning Thermostat did not have a 32-bit microprocessor and a WiFi connection. Note that it took an entirely new company to build a thermostat like this. Although Honeywell still makes thermostats, microprocessor-based ones at that, it’s not building anything like the Nest Learning Thermostat. At least not this year. We just had to replace the thermostat in our condo and it is a Honeywell thermostat. It’s a standard setback thermostat design with an LCD. I have to tell it the time. I have to program it for a setback sequence. And Honeywell knew that the user interface on this product was so unintuitive that it kindly included a fold-out instruction booklet that pokes out of the top left of the thermostat, as you can see from this photo.</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2012/02/New-Honeywell-Thermostat.png"><img class="alignright size-full wp-image-783" title="New Honeywell Thermostat" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2012/02/New-Honeywell-Thermostat.png" alt="" width="560" height="378" /></a></p>
<p>Frankly, I find the display of the new Honeywell thermostat extremely confusing. It shows the current temperature on the left side in large characters, the setpoint temperature in somewhat smaller characters on the right side, and the time in even smaller characters in the middle. To me, it looks like the display information was simply thrown on the display with little consideration to how the information is portrayed. The time is not the central piece of information on a thermostat yet that information is central to the display, albeit in small characters. The two important and conjoined pieces of information, the current and setpoint temperatures, are spaced nearly as far apart on the display as possible. And there’s no fixing this design with a firmware update. That LCD’s permanently configured to save manufacturing cost.</p>
<p>Not, repeat not intelligent design.</p>
<p>You can bet that the Nest Intelligent Thermostat does not have an ungainly instruction booklet poking out of its sleek, smooth industrial design. To anyone who has ever used a regular dumb thermostat, the Nest Learning Thermostat’s everyday operational use appears to be entirely intuitive. And if you want to do more complex things with the Nest Learning Thermostat, you interact with it through a Web page, not a handful of multi-use rubber buttons and a limited (for cost reasons) LCD. For these reasons (and a few more), I think the Nest Learning Thermostat is one of the best examples of 21<sup>st</sup> century intelligent design I’ve seen. It offers up several lessons in thoughtful design that I hope you will appreciate as much as I do.</p>
<p>Note: To read Mike Cassidy’s entire column, click on the following link: “<a href="http://www.mercurynews.com/mike-cassidy/ci_19778468" target="_blank">Clay Christensen sees Silicon Valley non-profits&#8217; dilemma as the innovator&#8217;s dilemma</a>”</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2012/02/01/do-you-believe-in-21st-century-intelligent-design/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>2011: A great year for low-power design, wasn’t it?</title>
		<link>http://low-powerdesign.com/sleibson/2011/12/17/2011-a-great-year-for-low-power-design-wasn%e2%80%99t-it/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/12/17/2011-a-great-year-for-low-power-design-wasn%e2%80%99t-it/#comments</comments>
		<pubDate>Sat, 17 Dec 2011 18:46:25 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[2.5D]]></category>
		<category><![CDATA[ARM]]></category>
		<category><![CDATA[Design]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[Microcontroller]]></category>
		<category><![CDATA[Multicore]]></category>
		<category><![CDATA[SOC]]></category>
		<category><![CDATA[2000T]]></category>
		<category><![CDATA[NXP]]></category>
		<category><![CDATA[Virtex-7]]></category>
		<category><![CDATA[Xilinx]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=734</guid>
		<description><![CDATA[2011 was a great year for low-power design. I don’t think I can remember a year as good to low-power designers. I thought I’d devote this blog to a review of some major developments in 2011 that made low-power designers’ &#8230; <a href="http://low-powerdesign.com/sleibson/2011/12/17/2011-a-great-year-for-low-power-design-wasn%e2%80%99t-it/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>2011 was a great year for low-power design. I don’t think I can remember a year as good to low-power designers. I thought I’d devote this blog to a review of some major developments in 2011 that made low-power designers’ lives easier. In fact, there’s so much to talk about that I’m splitting this blog post in two. In the first half, I’ll write about significant developments in standard silicon offerings including microcontrollers, embedded application processors, and FPGAs. In part B, I’ll discuss some of the year’s most significant developments in design at the silicon level and the implications for people who design ASICs, SoCs, and ASSPs. It truly was a bountiful year.<strong> </strong></p>
<h3><strong>Low-power microcontrollers</strong></h3>
<p>If there ever was a year for microcontroller advancement, this was it. Every major microcontroller vendor had something new and exciting on the low-power front. So many developments that I can only hit the highlights:</p>
<p>In August, ARM’s Alan Rampon wrote a blog post listing 17 microcontroller vendors that were offering a broad range of low-power devices based on various ARM Cortex-M series processor cores. The vendor list includes:</p>
<ul>
<li>Analog Devices (Cortex-M3)</li>
<li>Atmel (Cortex-M3)</li>
<li>Broadcom (Cortex-M3)</li>
<li>Cypress Semiconductor (Cortex-M3)</li>
<li>Dust Networks (Cortex-M3)</li>
<li>Ember (Cortex-M3)</li>
<li>Energy Micro (Cortex-M0, M3)</li>
<li>Freescale Semiconductor (Cortex-M4)</li>
<li>Fujitsu (Cortex-M3)</li>
<li>Holtek (Cortex-M3)</li>
<li>Nuvoton (Cortex-M0)</li>
<li>NXP (Cortex-M0, M3, M4)</li>
<li>ON Semiconductor (Cortex-M3)</li>
<li>Samsung (Sortex-M0, M3)</li>
<li>ST Microelectronics (Cortex-M3)</li>
<li>Texas Instruments (Cortex-M3)</li>
<li>Toshiba (Cortex-M3)</li>
</ul>
<p>That list is probably somewhat dated already, but you get the idea. The proliferation of low-power microcontrollers greatly accelerated during 2011. One such device that really sticks in my mind (because it’s recent), is the onset of shipments of the <a href="http://eda360insider.wordpress.com/2011/12/05/asymmetric-dual-core-nxp-lpc4300-microcontrollers-split-tasks-between-arm-cortex-m4-and-m0-cores-cost-3-75-and-up/" target="_blank">NXP Semiconductor LPC4350</a>, which packs an ARM Cortex-M4 and an ARM Cortex-M0 into one microcontroller that costs less than $4 in quantities of 10,000.</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/NXP-LPC4350-block-diagram.jpg"><img class="aligncenter size-full wp-image-738" title="NXP LPC4350 block diagram" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/NXP-LPC4350-block-diagram.jpg" alt="" width="560" height="454" /></a></p>
<h3>This microcontroller is on the forefront of a new wave of processor design called “asymmetric multiprocessing” and there’s a real “wave-of-the-future” look to this development. (See “<a href="http://eda360insider.wordpress.com/2011/12/07/more-news-on-the-asymmetric processing-soc-front/" target="_blank">More news on the asymmetric processing SoC front</a>”)</h3>
<h3><strong>Asymmetric Multiprocessing</strong></h3>
<p>The microprocessor is 40 years old (last month!) and silicon microprocessor implementations have really advanced over those four decades while many of our design memes have not. In particular, I’m thinking of the meme that says “processors are expensive, so layer as many tasks as possible on a processor to save money.” The net effect of this meme is to make us develop increasingly complex multitasking schemes in an attempt to get processor utilization up to 80% or 90% or perhaps even 95%.</p>
<p>Now any engineer can tell you that when you load any component to near 100%, you have just sent and engraved, gold-plated invitation to Murphy, asking for an audience. In other words, something will go wrong. You won’t always get the latency you expected. You won’t always get the bandwidth you need.</p>
<p>So you’d better ask yourself: Are complex multitasking systems really worth the effort when I can get two 32-bit microprocessor cores in one device for less than $4? You’d better be serious coming up with that answer. I believe that asymmetric multiprocessing will remake all of design, including low-power design, during this coming decade.</p>
<p>Asymmetric multiprocessor design wasn’t the only innovation that loomed in 2011. Xilinx finally <a href="http://low-powerdesign.com/sleibson/2011/03/01/xilinx-zynq-epps-create-a-new-category-that-fits-in-among-socs-fpgas-and-microcontrollers/" target="_blank">announced</a><a href="../2011/03/01/xilinx-zynq-epps-create-a-new-category-that-fits-in-among-socs-fpgas-and-microcontrollers/"></a> the first four members of its new Zynq 7000 EPP (Extensible Processing Platform) family, which fuses a processor complex containing two ARM Cortex-A9 processor cores with an FPGA fabric.</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/Xilinx-EPP-Block-Diagram-v4.jpg"><img class="aligncenter size-full wp-image-737" title="Xilinx EPP Block Diagram v4" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/Xilinx-EPP-Block-Diagram-v4.jpg" alt="" width="587" height="469" /></a></p>
<p>Now assembling systems with microprocessors and FPGAs isn’t new. In fact, putting processor cores and FPGA fabrics onto the same piece of silicon isn’t particularly new either. However, doing it right? That is new. And this development fits into the low-power design world because putting the processor complex and the FPGA fabric on chip with a massive on-chip interconnect between the two cuts interface power significantly by reducing the interconnect frequency. You don’t need GHz interconnect clock rates when you have thousands of wires for parallel interconnect.</p>
<h3><strong>2.5D IC Assembly</strong></h3>
<p>Speaking of Xilinx, the company started shipping engineering samples of the Virtex-7 2000T FPGAs to customers last month and this too is a low-power design story. The story is completely told in this graphic:</p>
<p>&nbsp;</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/Xilinx-Virtex-7-2000T.jpg"><img class="aligncenter size-full wp-image-735" title="Xilinx Virtex 7 2000T" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/Xilinx-Virtex-7-2000T.jpg" alt="" width="560" height="319" /></a></p>
<p>&nbsp;</p>
<p>The Xilinx Virtex-7 2000T is a very large FPGA with two million logic elements. But it’s not a monolithic piece of silicon. Rather, the Virtex-7 2000T consists of four “identical” FPGA tiles, each with half a million logic elements (and a ton of other stuff). The FPGA tiles are mounted on a silicon interposer, which establishes more than 10,000 connections between each tile (56,000 connections in total). The silicon interposer is a fascinating piece of technology. It’s a 65nm IC with four layers of metal on each side of the die and no transistors. It’s a silicon circuit board that must be made in a wafer fab. In this case, TSMC owns the fab. The interposer is as large as the stepper reticule will allow. The advantage here is that each FPGA tile is a quarter of the size of the interposer, and die yield has an exponential relationship to die size. The smaller the die, the better the yield percentage. So 2.5D assembly makes a lot of sense in several different ways.</p>
<p>The 2.5D IC assembly-with-interposer approach taken to create the Xilinx Virtex-7 2000T allows the FPGA tiles to use lower power I/O drivers because these drivers will only be driving short, closely controlled traces between adjacent tiles. That system-design knowledge saves power. Although the Xilinx Virtex-7 2000T uses four identical die fabricated with a 28nm process technology to realize the active elements, 2.5D IC assembly permits heterogeneous die assembly as well, as shown in this <a href="http://eda360insider.wordpress.com/2011/11/16/3d-thursday-how-xilinx-developed-a-2-5d-strategy-for-making-the-worlds-largest-fpga-and-what-the-company-might-do-next-with-the-technology/" target="_blank">image</a> from Xilinx:</p>
<p>&nbsp;</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/Xilinx-2000T-next-step.jpg"><img class="aligncenter size-full wp-image-736" title="Xilinx 2000T next step" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/12/Xilinx-2000T-next-step.jpg" alt="" width="560" height="171" /></a></p>
<p>&nbsp;</p>
<p>As you can see, 2.5D IC assembly allows designers the freedom to intermix die from radically different IC technologies such as logic, memory (DRAM, Flash, SRAM, etc.), analog, and RF. It’s a pc-board-like technology but on a much smaller scale. The resulting 2.5D device may well be better optimized and cost less than it might if the design team attempted to place everything on one monolithic die. That’s a topic I’ll take up in Part B of this blog entry.</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/12/17/2011-a-great-year-for-low-power-design-wasn%e2%80%99t-it/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What if 2.5D got really cheap? How would that affect low-power design?</title>
		<link>http://low-powerdesign.com/sleibson/2011/11/17/what-if-2-5d-got-really-cheap-how-would-that-affect-low-power-design/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/11/17/what-if-2-5d-got-really-cheap-how-would-that-affect-low-power-design/#comments</comments>
		<pubDate>Thu, 17 Nov 2011 18:09:37 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[2.5D]]></category>
		<category><![CDATA[CMOS]]></category>
		<category><![CDATA[Design]]></category>
		<category><![CDATA[Flash]]></category>
		<category><![CDATA[Low-Power]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=727</guid>
		<description><![CDATA[Last week, silicon-interposer foundry Deca Technologies unstealthed. I found out from an article in the San Jose Mercury News and just published a blog about the announcement in my other blog, the EDA360 Insider. Deca is a subsidiary of Cypress &#8230; <a href="http://low-powerdesign.com/sleibson/2011/11/17/what-if-2-5d-got-really-cheap-how-would-that-affect-low-power-design/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Last week, silicon-interposer foundry Deca Technologies unstealthed. I found out from an <a href="http://www.mercurynews.com/business/ci_19297216" target="_blank">article</a> in the San Jose Mercury News and just published a <a href="http://eda360insider.wordpress.com/2011/11/17/is-cypress-subsidiary-deca-technologies-onto-2-5d-packaging-in-a-big-way/" target="_blank">blog</a> about the announcement in my other blog, the <strong>EDA360 Insider</strong>. Deca is a subsidiary of Cypress Semiconductor and the outspoken President and CEO of Cypress, TJ Rodgers, was good for a quote, as always:</p>
<p>“We want to use the dense, reliable silicon interconnect inherent in Moore’s Law to integrate the dissimilar chips used in today’s systems, but we face an economic barrier because the interconnect on silicon chips is 1,000 times more expensive than the interconnect on PC boards.</p>
<p>“We could enable a new silicon-based interconnect paradigm if we could make silicon interconnect wafers for $10, just what silicon solar wafers cost today. The problem of mapping solar technology onto Moore’s Law is straightforward, but difficult, and we believe DecaTech has the answer.”</p>
<p>Now don’t take that $10 per interposer wafer to the bank. I get the impression that’s a long-term goal, not a short-term pricing roadmap. However, even a 10x drop in interposer costs will have a big influence on the future of 2.5D assembly technology.</p>
<p>And why should we as low-power systems designers care? Because interconnect is expensive and because interconnect now largely determines system performance. First, think about expense. Let your mind go back 40 years (if it can) to the birth of the microprocessor, which we celebrate this month. In the 1970s, microprocessor interconnect meant a bus. Not on a board but in a system. One of the most successful early microprocessor buses was the S-100 bus. It was named for the 100-pin edge connectors and the 100-conductor bus used to interconnect system boards in the original Altair 8800 microcomputer introduced in 1975 and subsequently adopted by several microcomputer vendors including Imsai, Vector Graphics, North Star Computers (formerly known as Kentucky Fried Computers), and Processor Technology and by board vendors including Godbout Electronics/Compupro and Morrow Micro-Stuff/ThinkerToys.</p>
<p>Back then, due to the nascent state of semiconductor integration, you would to have a processor board, one or likely more than one memory boards, a video board, and one or more I/O boards. A major system expense was just the half dozen or so 100-pin edge connectors and the simple but large circuit board that implemented the bus. The S-100 connectors were expensive and you needed a lot of energy to drive the bus lines because they were physically large and because—as bus speeds increased—they required resistive termination to prevent ringing and you needed even more energy to drive the termination resistors.</p>
<p>By 1981 when the IBM PC appeared, things were getting somewhat better. We still had half a dozen edge connectors but we were down to 62 edge-connector pins (for 8-bit systems). Add another 36 pins when we jumped to 16 bits and we found ourselves right back at close to a 100-pin bus. So much for progress.</p>
<p>For board-to-board interconnect, things are going serial (think of the PCI evolution to PCIe) but chip-to-chip interconnect on a board is still largely parallel with lots of pins on a chip looking to connect to lots of pins on other chips. There is still impedance in those pcb traces and you still need relatively big drivers that consume significant amounts of energy to drive those traces. Hence a movement to go serial for chip-to-chip interconnect on a board—an extension of the migration of buses to serialized versions.</p>
<p>High-speed serial buses incur their own costs. There’s the energy cost of driving even a few wires at multi-GHz speeds and there’s the performance hit in the form of latency increase that you get when you serialize and then deserialize a data stream. It’s not all wine and roses.</p>
<p>So we try to put as much as we can on one IC, but that’s not an ideal solution either. Not all IC processing is the same and chips with different functions are really better off being fabricated with different IC fabrication processes. For example, NAND Flash and DRAM processes push as far down the Moore’s Law curve as they can get, as fast as they can to boost density and cut cost per bit. CMOS logic processes are right behind the memory processes but use more layers of on-chip metal interconnect. Because they require more random connectivity than memories. Analog ICs typically operate at higher supply voltages and they’re nowhere near the leading/bleeding edge of IC processor technology. It’s not economical to put all of these different functions on one die, and so we see renewed interest in multichip modules, known by the 21<sup>st</sup>-centrury name: 2.5D IC assembly using silicon interposers.</p>
<p>2.5D assembly using bare semiconductor die attached to small interposers instead of big circuit boards significantly changes the parallel/serial I/O equation. Suddenly, you don’t need such big I/O drivers on the chip because there’s no wire bond, no IC package interconnect, and significantly shorter traces to drive. Suddenly, massively parallel I/O consumes only a fraction of the energy it previously needed and the balancing equation that calculates the breakeven point between parallel I/O and serialized, high-speed I/O alters. The balance alters to favor parallel I/O more and serial I/O less.</p>
<p>And when major changes like that happen to such equations, the way we design systems also changes.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/11/17/what-if-2-5d-got-really-cheap-how-would-that-affect-low-power-design/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>1972: When scientific calculators truly went low power</title>
		<link>http://low-powerdesign.com/sleibson/2011/09/19/1972-when-scientific-calculators-truly-went-low-power/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/09/19/1972-when-scientific-calculators-truly-went-low-power/#comments</comments>
		<pubDate>Mon, 19 Sep 2011 02:04:51 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[calculator]]></category>
		<category><![CDATA[HP]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=660</guid>
		<description><![CDATA[Dave Cochran recently wrote about his long engineering career at Hewlett-Packard on the www.hpmemory.org Web site. Who? What Web site? Well, the Web site is an amazing living museum that&#8217;s a tribute to Bill and Dave&#8217;s HP. And Dave Cochran is &#8230; <a href="http://low-powerdesign.com/sleibson/2011/09/19/1972-when-scientific-calculators-truly-went-low-power/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Dave Cochran recently <a href="http://www.hpmemory.org/timeline/dave_cochran/a_quarter_century_at_hp_00.htm" target="_blank">wrote about</a> his long engineering career at Hewlett-Packard on the <a href="http://www.hpmemory.org/">www.hpmemory.org</a> Web site. Who? What Web site? Well, the Web site is an amazing living museum that&#8217;s a tribute to Bill and Dave&#8217;s HP. And Dave Cochran is likely one of the most important people you’ve never heard about in the annals of low-power design. He spent 25 years at HP, starting as a part-time test technician in 1956 and departing as a celebrated HP engineer in 1981.</p>
<p>Between those two years, Cochran worked on a huge number of projects including the HP 204B audio oscillator where he used transistors and a hugely ingenious double-spiral-cam potentiometer actuator to design the famous tungsten light bulb out of Bill Hewlett’s original audio oscillator design. (That double-cam actuator makes a linear pot do unnatural things and you need to see the short, 3-second video to believe it!) But it’s what he did in the 1960s and 1970s that make him the topic of this particular blog post.</p>
<p>Cochran was working at HP Labs when Malcolm McMillan and Tom Osborne dropped by with two very different calculator prototypes. It was the legendary Barney Oliver, Grand Wizard of HP Labs who conjured the idea of merging these two very different machines into one massively powerful scientific calculator. McMillan’s design—called Athena—could perform transcendentals using Jack Volder’s CORDIC algorithms but it had a fixed-point architecture so it was not deemed accurate enough for repeated engineering calculations. Osborne’s design was barely more than a simple 4-banger calculator but it had a really elegant, floating-point hardware design based on the 1960s version of a VLIW processor.</p>
<p>Cochran got the job of trying to come up with a way to unify the two architectures. He writes “I was looking at Osborne&#8217;s architecture and trying to figure out what an algorithm was. I even flew down to Southern California to talk with Jack Volder who had developed the <a href="http://en.wikipedia.org/wiki/CORDIC" target="_blank">CORDIC</a>  transcendental functions used in the Athena machine and talked to him for about an hour. He referred me to the original papers by Meggitt where he&#8217;d gotten the pseudo division, pseudo multiplication generalized functions.</p>
<p>My job was to determine how many digits and what the operation time was required; what the architecture had to be; how many registers did we need, clock speed, etc? Other people were coming back with their inputs on cathode ray tube display, keyboards [and so on]. Should we use transistors or small-scale integration? There was no large-scale integration, but there was medium-scale integration, MSI which meant maybe 10 transistors in a chip.”</p>
<p>From Cochran’s architectural contributions, plus substantial work from other engineers in HP Labs and Tom Osborne (who remained an HP consultant for many more years), HP introduced the HP 9100 Scientific Calculator in late 1968. It’s a marvelous machine and it was a real design breakthrough for its day. It also weighed 40 pounds and drew 70 Watts out of a wall socket.</p>
<p>Now 70W isn’t all that much compared to the amount of electrical power you’d need to replicate the HP 9100’s computational abilities with a minicomputer or a mainframe, so you could consider this a lower-power design for its day. But it’s what happened next that really takes us to the domain of miraculously low power.</p>
<p>Cochran writes: “As soon as the 9100 started showing success in the market place Hewlett started to bug me personally. I know he also talked to Tom Osborne about it, what do you think, and so on. But he would come into the lab and he&#8217;d look for me and he&#8217;d say, ‘Hey, how are you coming with putting the 9100 in my shirt pocket?’ He said, ‘I want all that computational power in my shirt pocket.’”</p>
<p>OK. From 40 pounds and 70W to something that runs off batteries and fits in your shirt pocket. Now that’s a stretch. Oh, and one thing I forgot to mention. The HP 9100 calculator design had exactly one integrated circuit in it. That IC was used in the magnetic card reader. Osborne didn’t like the primitive ICs offered at the time. He didn’t design the card reader but the rest of the machine was implemented with discrete transistors, magnetic-core RAM, rope memory (go look that one up), and a large capacitive ROM fabricated from a 16-layer printed-circuit board. None of that technology was headed for a shirt pocket. Not in this universe.</p>
<p>Cochran then writes about his shirt-pocket epiphany: “Tom Whitney and I went down to Fairchild Semiconductor on Ellis Street, Mountain View, and they wanted to show us a calculator architecture that they were planning to provide to various companies that wanted to build calculators and semiconductors. So we went down to look at it. And I looked at it, and oh, this could do the algorithms. See, I knew. By this time, I had already fit the algorithms into a small-scale integrated machine, the 9100. So I knew exactly what architecture I needed, the capabilities of the architecture. I didn&#8217;t know what it was going to look like, but I knew what its capabilities had to be.</p>
<p>It was I think September of 1970; I saw a design that was different than anything else. It was not your classic computer architecture as taught at the universities. It was all shift register. It was designed for the technology at the time. When talking to the people at Fairchild I meet a fellow, Rich Whicker, who later came to work at HP. I said, &#8220;God, this design, did you think of this?&#8221; He says, &#8220;No. We got it from Sweda, the cash register company.&#8221; Sweda at the time was trying to make an electronic cash register or Point-of-Sale products and they were using shift registers.</p>
<p>Shift registers were the densest form of integration of integrated circuits at the time; you had to keep the clock moving and so on. It had no static memory. So here was a design using shift registers a 20-digit chipset that could satisfy anybody making a four-function machine. Add, subtract, multiply, and divide. It could give you the numbers as big as most people wanted, but it was all fixed point. 20 digits should be more than enough for anybody. You could have the decimal point anywhere in that stream.</p>
<p>I got really cranked up about seeing that architecture at Fairchild, I got very excited. And I&#8217;m whispering in Tom Whitney&#8217;s ear, ‘God, this is great.’ And I&#8217;m trying not to be too excited while I&#8217;m there. When we drove away from there and, I said, ‘God, that&#8217;s exactly—you know, I can tweak that architecture just a little bit. We don&#8217;t need the full 20 digits, we can do this and this and this. And gosh, yes, I can do it, I can do it, I can do it.’”</p>
<p>And that’s when the HP 35 Pocket Scientific Calculator crossed over from the realm of the impossible to the realm of the possible. When David Cochran thought that it could. Two years later, in 1972, it became a reality. A pocket scientific calculator that ran on three NiCd batteries in a pack. The rest, as they say, is history.</p>
<p>Be sure to read the whole story at <a href="http://www.hpmemory.org/timeline/dave_cochran/a_quarter_century_at_hp_00.htm#chapter_08">http://www.hpmemory.org/timeline/dave_cochran/a_quarter_century_at_hp_00.htm#chapter_08</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/09/19/1972-when-scientific-calculators-truly-went-low-power/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Imagine no uninterruptible power supplies. I wonder if you can. A sad story of six fried hard disk drives</title>
		<link>http://low-powerdesign.com/sleibson/2011/08/17/imagine-no-uninterruptable-power-supplies-i-wonder-if-you-can-a-sad-story-of-six-fried-hard-disk-drives/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/08/17/imagine-no-uninterruptable-power-supplies-i-wonder-if-you-can-a-sad-story-of-six-fried-hard-disk-drives/#comments</comments>
		<pubDate>Wed, 17 Aug 2011 12:30:03 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[DRAM]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[Agiga Tech]]></category>
		<category><![CDATA[AGIGARAM]]></category>
		<category><![CDATA[Dell]]></category>
		<category><![CDATA[Server]]></category>
		<category><![CDATA[UPS]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=641</guid>
		<description><![CDATA[This is the story of six fried hard disk drives and why they died needlessly of heat failure as told to me by my good friend Ron Sartore, founder and CEO of AgigA Tech, at this month’s Flash Memory Summit. &#8230; <a href="http://low-powerdesign.com/sleibson/2011/08/17/imagine-no-uninterruptable-power-supplies-i-wonder-if-you-can-a-sad-story-of-six-fried-hard-disk-drives/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>This is the story of six fried hard disk drives and why they died needlessly of heat failure as told to me by my good friend Ron Sartore, founder and CEO of <a href="www.agigatech.com" target="_blank">AgigA Tech</a>, at this month’s <a href="www.flashmemorysummit.com" target="_blank">Flash Memory Summit</a>. It’s also the story of the disaster’s aftermath and why it shouldn’t happen again—to anyone. Finally, it’s the story of why we just might need to reconsider our views regarding the use of uninterruptible power supplies to supply emergency back-up power to servers. Sometimes, truth is stranger than fiction.</p>
<p>AgigA Tech’s servers sit in a small room in the company’s San Diego corporate headquarters. One recent Saturday, these servers were quietly doing their jobs when an unscheduled power outage occurred. Thanks to uninterruptible power systems backing up the servers, the servers continued to do their jobs. Unfortunately, the air conditioning system at AgigA Tech headquarters has no backup power and did not continue to do its job. The temperature in the server room at AgigA Tech started to climb from the waste heat being thrown off by the equipment.</p>
<p>There was no one in the building to notice.</p>
<p>Eventually, power from the grid came back on and the cooling system restarted. However, by that time the temperature in the server room had climbed high enough to cook six of the hard disk drives in the server room’s RAID arrays. Fortunately, the RAID arrays performed as designed and there was no data loss. But if the power outage had lasted longer and if the servers had continued to run without cooling, eventually all of the RAID drives would have died. Even the best RAID system cannot preserve data when all of its drives fail. As it stands, even the hard disk drives that did not fail are suspect because they’ve been heat stressed. They too need to be replaced quickly before they fail prematurely as well.</p>
<p>After this small weekend near-disaster, my friend Ron Sartore started to ponder the ramifications and lessons of the incident. First, he realized that his servers should be sensing ambient temperature and shutting down gracefully when the server room overheats. In fact, Ron sort of assumed that’s what would happen. Bad assumption, as it turned out. “How often do you test that?” asked Ron. “No one wants to pull the plug on these things to find out” he added. I’ll bet a lot of business owners with little server rooms make precisely the same assumptions that Ron and his IT team did when they designed their server room.</p>
<p>Next, Ron began to think about the uninterruptible power supplies from an engineering perspective, which really calls into question the entire concept of uninterruptible power supplies for servers. There’s clearly no reason to continue to operate servers during a power outage if there’s no cooling available. In fact, there’s a clear reason to shut them down as quickly as possible to prevent overheating and hardware failure due to lack of cooling.</p>
<p>Now large data centers—like the ones operated by Amazon, Microsoft, and Google—have on-site Diesel generators that operate both the servers and the data center’s cooling systems during a power outage. These companies cannot afford to have their servers shut down. Every minute—actually every second—that the servers are down means lost revenue, lost profit, and lost customers. But most companies are not Amazon, Microsoft, or Google.</p>
<p>There are hundreds of thousands of companies in the US and millions in the world that run their servers in small server rooms or even closets where there are uninterruptible power systems designed to keep servers running as long as possible but where there is no backup power for cooling. For these companies, their server-system designs will cook and kill their hard drives rather quickly in the event of a power failure. We design smaller server systems this way almost without thinking. The UPS is a check-box item, meaning we don’t even think critically about including one.</p>
<p>Ron’s story made me think about UPS costs—both the acquisition cost and the operating cost over the life of the UPS. You see most uninterruptible power systems are designed so that they always supply power to the attached servers. Even while a UPS is running from the power grid, it’s still consuming and wasting energy. Today’s best UPS designs are perhaps 95% efficient. That means they consume about 5% of their input energy. All…the…time.</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/08/UPS-to-server-efficiency.jpg"><img class="aligncenter size-full wp-image-642" title="UPS to server efficiency" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/08/UPS-to-server-efficiency.jpg" alt="" width="338" height="278" /></a></p>
<p>Less expensive UPS designs might be only 90% efficient. They waste about 10% of the energy consumed by the attached server(s).</p>
<p>In reality, a UPS that’s not operating at maximum load is likely to run at somewhat reduced efficiency because UPS manufacturers tend to rate their products’ efficiency at maximum load. In addition, IT managers prefer to overspecify the capacity of a UPS, which is normally good engineering practice but here it pretty much guarantees that the UPS will not operate at maximum efficiency.</p>
<p>Where does that wasted energy from the UPS go? It’s converted to waste heat, of course. Ron told me that his home UPS (purchased from Costco) runs hot even though it is lightly loaded—13W for a small PC appliance. Lightly loaded, a UPS might convert just as much power into waste heat as it delivers to the load.</p>
<p>And where does the waste heat from the UPS go? Right into the cooling system, assuming the cooling system is operational. During a power failure, it probably isn’t.</p>
<p>How much energy is needed for cooling? It depends on the cooling system. I’ve often heard from various presenters talking about energy needs for cooling data centers that it’s a 1:1 ratio—for every Watt of power emitted by equipment as waste heat, you need another Watt to remove the waste heat. Ron tells me that many of today’s cooling systems are actually more efficient than that. Some need only half a Watt to remove a Watt of waste heat from the server room—K factor of 2 for you HVAC engineers. Some high-efficiency cooling systems achieve a K factor of 3. So you can multiply the waste heat generated by a UPS by a factor of between 1.33 and 2 to determine the actual energy cost of UPS inefficiency.</p>
<p>Finally, we get down to computing the actual costs for using a UPS in a server room. Let’s start with a <a href="http://accessories.us.dell.com/sna/productdetail.aspx?c=us&amp;l=en&amp;s=bsd&amp;cs=04&amp;sku=330-7523&amp;SMCID=702&amp;CAWELAID=416871679&amp;dgc=SS&amp;cid=27722&amp;lid=628335" target="_blank">2700W Dell UPS Short Depth Rack High Efficiency Online power-backup unit</a>. When I looked it up, the purchase cost for the UPS was $1360.99 plus $115.68 for tax for a grand total of $1476.67. Shipping, at least, appears to be free. The UPS has a 3-year expected life and is rated as 95% efficient. About 5% of the energy it consumes is converted to waste heat when it’s fully loaded.</p>
<p>How much will it cost to run this UPS over its three-year expected life? Let’s use <a href="http://www.pge.com/tariffs/electric.shtml" target="_blank">northern-California electric rates for commercial/general service</a> from Pacific Gas and Electric where I live. PG&amp;E charges about $0.20/kWh in the summer and about $0.15/kWh in the winter. On average, that’s about $0.175/kWh over the course of a year. Waste heat from the 2700W Dell UPS is about 135W (95% efficient) with another 70W or so needed to remove the waste heat through the air-conditioning system (assuming a K factor of 2). Total power required to have the UPS online all the time in case of a power-grid failure is about 200W continuously because the power for the servers is always flowing through the UPS.</p>
<p>Do the math and it works out to about $0.84 per day just to run the UPS, which is $306.60 per year in electricity cost for power-failure insurance. Over the three year rated life of the UPS, you’ll spend another $919.80 to run this UPS continuously—nearly as much as you spent for the initial UPS purchase. The total cost of adding this UPS to your system is about $2400 every three years using a back-of-the-envelope sort of calculation. If the UPS or air-conditioning systems are less efficient than the ones used in this example calculation, then the energy costs will be higher.</p>
<p>Now let’s be crystal clear here. Ron Sartore isn’t professionally disinterested in this story. He’s not exactly objective. His company, AgigA Tech, makes a line of <a href="http://www.agigatech.com/agigaram.php" target="_blank">AGIGARAM</a> DDR2 and DDR3 memory modules that combine DRAM and NAND Flash memory with a controller on board that can independently transfer data back and forth between the module’s DRAM and NAND Flash without going through the server processor. In the case of a power outage, the on-board AGIGARAM controller backs up the data in the DRAM and puts it into the NAND Flash on the memory module using a small amount of standby power supplied by an independent bank of ultra capacitors connected directly to the memory module. Once backed up, the server data is safely stored for 10 years in the NAND Flash even without power. Standard servers and server-management software aren’t designed to exploit the features of this type of server memory that can safely back itself up. So even AgigA Tech’s IT department can’t configure a standard server system using AGIGARAM. At least not yet.</p>
<p>Bottom line, you or your customers may well be spending thousands of dollars for acquiring and powering a server UPS but you will not get the power-failure insurance you expect. You will not get a system that protects your data very well in the event of a power failure, as AgigA Tech has discovered. Instead of reliable backup, equipping a small data center with a UPS can cause hard disk drives to fry should there actually be a power failure—as they did at AgigA Tech—and it costs thousands of dollars extra in UPS costs to allow this to happen. “We paid good money to self-destruct ourselves” said Ron. Even though no data was lost in this example because the drives that failed were installed in RAID arrays, this approach seems like very poor engineering design. Ron has convinced me.</p>
<p><strong>PS:</strong> While writing this blog entry, I received a letter from the IEEE. The envelope prominently featured this statement on the outside: “The BEST project plans include <strong>dependable</strong> backups for ‘out-of-the-blue’ accidents.” Although it might appear that the IEEE was thinking about this very blog entry when it mailed this letter to me, it turns out they’re just trying to sell me accidental death and dismemberment insurance. However, the coincidence is uncanny. In reality, that’s exactly what we’re discussing here—reducing the cost and improving the effectiveness and energy efficiency of accidental death and dismemberment insurance for servers.</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/08/IEEE-Envelope.jpg"><img class="aligncenter size-full wp-image-643" title="IEEE Envelope" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/08/IEEE-Envelope.jpg" alt="" width="560" height="376" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/08/17/imagine-no-uninterruptable-power-supplies-i-wonder-if-you-can-a-sad-story-of-six-fried-hard-disk-drives/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Will Compaan’s HotSpot Parallelizer technology take us to the promised land of parallel computing?</title>
		<link>http://low-powerdesign.com/sleibson/2011/07/17/will-compaan%e2%80%99s-hotspot-parallelizer-technology-take-us-to-the-promised-land-of-parallel-computing/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/07/17/will-compaan%e2%80%99s-hotspot-parallelizer-technology-take-us-to-the-promised-land-of-parallel-computing/#comments</comments>
		<pubDate>Sun, 17 Jul 2011 01:31:01 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[Multicore]]></category>
		<category><![CDATA[Compaan]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=621</guid>
		<description><![CDATA[In connection with my just-written blog entry on the massively parallel SpiNNaker project (see below), I want to relate some information about another meeting I had last March at the DATE (Design Automation and Test) conference in Grenoble, France. I &#8230; <a href="http://low-powerdesign.com/sleibson/2011/07/17/will-compaan%e2%80%99s-hotspot-parallelizer-technology-take-us-to-the-promised-land-of-parallel-computing/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>In connection with my just-written blog entry on the massively parallel SpiNNaker project (see below), I want to relate some information about another meeting I had last March at the DATE (Design Automation and Test) conference in Grenoble, France. I met with Compaan (<a href="http://www.compaandesign.com/">www.compaandesign.com</a>) and got a presentation on the company’s HotSpot Parallelization technology.</p>
<p>Here’s how it works. You start with application code written in C. You add pragmas around known code hotspots to switch on Compaan’s HotSpot Parallelizer and to switch it off. You discover these hotspots using regular code-analysis techniques already used for other sorts of software-specific optimizations. So far, nothing new here.</p>
<p>Then you submit the code to the Compaan HotSpot Parallelizer for analysis. The Parallelizer analyzes the code and creates a Kahn Process Network (KPN, <a href="http://en.wikipedia.org/wiki/Kahn_process_networks">http://en.wikipedia.org/wiki/Kahn_process_networks</a>) that consists of many independently executable processes and the communications linkages needed to pass data between these processes. What you then end up with is several independent C programs that can be compiled and run on one processor, run on several processors, run or on some mix of processors and hardware built using a C-to-hardware compiler. Here’s a picture of the process:</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/07/Compaan-HotSpot-Parallelizer.jpg"><img class="aligncenter size-full wp-image-622" title="Compaan HotSpot Parallelizer" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/07/Compaan-HotSpot-Parallelizer.jpg" alt="" width="545" height="369" /></a></p>
<p>The advantage of this approach is that it’s entirely automatic once you mark the hotspots with pragmas. To use this approach, your design will need to consist of deterministic, independent processes. Parallelization consists of creating a Kahn Process Network and then generating C code for the various independent programs. That generated code must include all of the inter-program communications needed to operate the KPN. Inter-program communications take place through FIFOs, which might be real hardware FIFOs or, more likely, FIFOs implemented in shared memory.</p>
<p>You could do this by hand and in a simple system you can do this by hand. In a complex system, you’ll want all the automation you can muster because otherwise the complexity will kill you, your team, and your project.</p>
<p>Once you have the several C programs that constitute the KPN, you can decide where each will execute. Some might execute on the same processor. That’s convenient because the inter-program communications is simple and takes place in the processor’s memory space. However, you’ll get no acceleration by running everything on one processor. In fact, you’ll likely slow things down with the added inter-program communications overhead. So, you might choose a multicore processor. Compaan’s HotSpot Parallelizer would seem to be a fast way to accelerate code execution for multicore designs. You might also wish to take some of the C programs in the KPN and transform them into hardware to maximize the acceleration potential. It’s your choice, based on cost/performance tradeoffs that are familiar to every System Realization team.</p>
<p>Compaan’s got some case histories that are certain to interest you. Just ask them to share.</p>
<p>Now the reason I’m writing about this product in my Low-PowerDesign blog is because you must use parallelization to drop power consumption. Stacking every possible task on one multi-GHz processor is not going to result in low-power operation and we should all know that by now. However, there are naysayers who tremble and say “we don’t know how to code for parallel execution.” Well, Compaan’s HotSpot Parallelizer apparently does.</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/07/17/will-compaan%e2%80%99s-hotspot-parallelizer-technology-take-us-to-the-promised-land-of-parallel-computing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Think Globally, Act in Parallel. What can you do with one million ARM cores acting in parallel and how do you get there?</title>
		<link>http://low-powerdesign.com/sleibson/2011/07/16/think-globally-act-in-parallel-what-can-you-do-with-one-million-arm-cores-acting-in-parallel-and-how-do-you-get-there/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/07/16/think-globally-act-in-parallel-what-can-you-do-with-one-million-arm-cores-acting-in-parallel-and-how-do-you-get-there/#comments</comments>
		<pubDate>Sat, 16 Jul 2011 23:47:06 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[ARM]]></category>
		<category><![CDATA[CMOS]]></category>
		<category><![CDATA[Design]]></category>
		<category><![CDATA[DRAM]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[Networking]]></category>
		<category><![CDATA[SDRAM]]></category>
		<category><![CDATA[SOC]]></category>
		<category><![CDATA[SRAM]]></category>
		<category><![CDATA[Cortex-M0]]></category>
		<category><![CDATA[Intel]]></category>
		<category><![CDATA[Samsung]]></category>
		<category><![CDATA[SpiNNaker]]></category>
		<category><![CDATA[UMC]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=615</guid>
		<description><![CDATA[Professor Steve Furber’s SpiNNaker project is in the news again. I wrote about Furber’s massively parallel brain-emulation project back on March 30 after listening to his keynote at this year’s DATE (Design Automation and Test Europe) conference in Grenoble, France. &#8230; <a href="http://low-powerdesign.com/sleibson/2011/07/16/think-globally-act-in-parallel-what-can-you-do-with-one-million-arm-cores-acting-in-parallel-and-how-do-you-get-there/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Professor Steve Furber’s SpiNNaker project is in the news again. I wrote about Furber’s massively parallel brain-emulation project back on March 30 after listening to his keynote at this year’s DATE (Design Automation and Test Europe) conference in Grenoble, France. (See “<a href="http://low-powerdesign.com/sleibson/2011/03/30/the-incredible-vanishing-power-of-a-machine-instruction-is-this-the-way-to-the-brain/" target="_blank">The incredible vanishing power of a machine instruction. Is this the way to the brain?</a>”) Furber’s DATE keynote title says it all: “Biologically-inspired massively-parallel architectures—computing beyond a million processors.” Furber and his team are referencing nature to help them tackle the really hard processing problems we need to solve in the future through massively parallel, brain-like computing. Brain-like computing—go slow, go wide, go massively parallel—seems to offer a proven, low-power approach to solving some of these big computational problems.</p>
<p>The SpiNNaker project is again in the news at EETimes Europe (see “<a href="http://www.electronics-eetimes.com/en/a-million-arm-cores-to-host-brain-simulator.html?cmp_id=7&amp;news_id=222908354&amp;vID=209" target="_blank">A million ARM cores to host brain simulator</a>”) and the idea of harnessing one million ARM processor cores is certainly a big idea. It excites me. However, we’re still at the humble beginnings of the project.</p>
<p>The SpiNNaker project’s first test chip harnesses 18 ARM9 cores on one 130nm chip manufactured by UMC in Taiwan. This is a 100M-transistor chip and, like most many-processor SoCs, the SpiNNaker SoC mostly consists of memory. The memory needs to be close to the processors for speed and for low-power consumption and there are 55 32Kbyte SRAM blocks on the SpiNNaker die. That’s 14 million bits of SRAM and, frankly speaking, that’s really not very much SRAM. Eighteen processors isn’t really a large number of processors either when your stated goal is one million.</p>
<p>The ARM processors on the SpiNNaker chip use packet communications to emulate the electrical spike communications that occur among the neurons in human and animal brains. From a hardware perspective, I think it’s easy to conceive of a system-level design like this and even conceptually scaling the design to a million connected ARM9 processors isn’t really hard, as long as you don’t try to enumerate all of the processors in your mind. However, with 18 processors per chip, you’ll need approximately 55,600 chips to build an interconnected network of one million processors. That’s still a mighty big box of hardware. More on that in a bit.</p>
<p>The rub is that we really don’t have many good ideas for programming such a massively parallel system. The SpiNNaker project seems to be mostly a hardware endeavor with the explicitly stated intent of developing a hardware testbed for brain researchers who will use SpiNNaker systems for studying various theories of brain function. Presumably, we’ll learn more about massively parallel programming by working with these systems and no doubt we will. As Furber says in a quote published in the EETimes Europe article, “We don&#8217;t know how the brain works as an information-processing system, and we do need to find out. We hope that our machine will enable significant progress towards achieving this understanding.&#8221;</p>
<p>Each SpiNNaker chip in the current design is bundled with a 166MHz, 1Gbit DDR SDRAM and packaged in a 300-pin BGA package. But we’re not going to be building million-processor testbeds with 18 processors per packaged chip. I’m almost absolutely, positively certain about that. This first SpiNNaker prototype just doesn’t scale to one million processors very easily. So the question is, how to get there?</p>
<p>Well, possible clues to answer that question can be found in two recent blogs that I wrote on the <strong>EDA360 Insider</strong> blog. First, Samsung has just announced successful tapeout of a 20nm test chip incorporating an ARM Cortex-M0 processor core. (See “<a href="http://eda360insider.wordpress.com/2011/07/12/samsung-20nm-test-chip-includes-arm-cortex-m0-processor-core-how-many-will-fit-on-the-head-of-a-pin/" target="_blank">Samsung 20nm test chip includes ARM Cortex-M0 processor core. How many will fit on the head of a pin?</a>”) Now an ARM Cortex-M0 processor is not as powerful as an ARM9 processor, but then it’s not supposed to be. It’s designed for control-oriented applications and its 3-stage execution pipeline isn’t designed to get maximum speed from any given process technology. However, we’re building a system that emulates a brain that operates at a few hundred Hertz (that’s <strong>Hertz</strong>, not kilohertz, megahertz, or gigahertz) so I really don’t think the clock speed is all that critical when you’re talking about a million processors. The ARM Cortex-M0 processor core is still a 32-bit RISC processor and I am guessing with a high degree of confidence that it’s fully up to the task of executing the required electrical-spike calculations, albeit not quite as quickly as an ARM9 processor.</p>
<p>What’s interesting about a 12-to-14Kgate ARM Cortex-M0 processor implemented in 20nm process technology is that my calculations suggest that more than half a million ARM Cortex-M0 processors would fit on a chip the size of an Intel “Tukwila” Itanium processor (OK, that’s a big chip, but it’s a commercial one) and that calculation is based on the published number for the area required by an ARM Cortex-M0 implemented in 90nm process technology, not 20nm. Now there’s a lot of slop in this calculation. First, there’s the disparity of using 90nm numbers instead of 20nm numbers. Then there’s the disparity caused by putting no memory at all into the calculation. I just mentally tiled processors edge to edge. Ditto, there’s no on-chip interconnect.</p>
<p>So you probably won’t get half a million ARM Cortex-M0 processor cores on one 20nm chip. But you might get 100,000 or 200,000 ARM Cortex-M0 processor cores on a chip along with an interesting amount of memory and the required interconnect. Now we’re talking about only a handful of chips to get to one million processors. We’re talking about a tabletop box. Now we’re getting into the realm of the feasible for million-processor systems.</p>
<p>The second related blog entry I recently wrote in <strong>EDA360 Insider</strong> that also bears on this very interesting endeavor was about an announcement from Imec, a global research company. Just days ago, Imec announced that it and its partners successfully assembled a custom logic chip with two DRAMs in a stacked 3D configuration. (See “<a href="http://eda360insider.wordpress.com/2011/07/14/3d-thursday-imec-prototypes-3d-chip-stack-finds-some-thermal-surprises/" target="_blank">3D Thursday: IMEC prototypes 3D chip stack, finds some thermal surprises</a>”.) This 3D stacked-chip prototype allowed Imec to test out some process ideas for manufacturing 3D stacked chip assemblies and to make some critical thermal tests to verify thermal models that will be so necessary when 3D assembly goes mass market. The 3D chip stack uses copper-tin micro-bumps and compression bonding for the electrical and mechanical assembly of the chip stack and you can see photos of the assembled stack below.</p>
<p>Here’s a photo of the overall chip stack:</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/07/Imec-3D-Chip.bmp"><img class="aligncenter size-full wp-image-616" title="Imec 3D Chip" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/07/Imec-3D-Chip.bmp" alt="" /></a></p>
<p>And here’s a close-up of the edge of the chip stack to show the three stacked die.</p>
<p><a href="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/07/Imec-3D-Chip-Closeup.bmp"><img class="aligncenter size-full wp-image-617" title="Imec 3D Chip Closeup" src="http://low-powerdesign.com/sleibson/wp-content/uploads/2011/07/Imec-3D-Chip-Closeup.bmp" alt="" /></a></p>
<p>The 3D Stack’s base chip is approximately 750µm thick. The two top components in the chip stack are each 25µm thick. There’s more technical info in the referenced <strong>EDA360 Insider</strong> blog.</p>
<p>I am convinced that 3D stacking of logic and RAM chips will be absolutely essential to developing massively parallel, low-power systems like the ones envisioned by the SpiNNaker project. First, the only way to feed data and instructions to massively parallel processing chips is through large amounts of on-chip memory and through high-bandwidth, low-energy channels connected to large off-chip memories. 3D assembly techniques permit both Wide I/O and high-speed serial I/O channels to work most effectively and at minimal energy levels and I expect to see rapid adoption of 3D assembly—even and perhaps especially in high-volume, cost-sensitive applications such as mobile phone handsets—in the next few years. This is precisely the sort of manufacturing technology we require to think seriously about million-processor systems.</p>
<p>Now all we need to do is figure out how to program them.</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/07/16/think-globally-act-in-parallel-what-can-you-do-with-one-million-arm-cores-acting-in-parallel-and-how-do-you-get-there/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Need to cut IP power? (Who doesn’t?) “Press here” says Calypto</title>
		<link>http://low-powerdesign.com/sleibson/2011/06/12/need-to-cut-ip-power-who-doesn%e2%80%99t-%e2%80%9cpress-here%e2%80%9d-says-calypto/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/06/12/need-to-cut-ip-power-who-doesn%e2%80%99t-%e2%80%9cpress-here%e2%80%9d-says-calypto/#comments</comments>
		<pubDate>Sun, 12 Jun 2011 20:29:33 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[Clock Gating]]></category>
		<category><![CDATA[CMOS]]></category>
		<category><![CDATA[Design]]></category>
		<category><![CDATA[EDA]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[SRAM]]></category>
		<category><![CDATA[Calypto]]></category>
		<category><![CDATA[PowerPro]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=599</guid>
		<description><![CDATA[All SoCs are built with IP blocks. Some of those are legacy IP blocks. Some are purchased from other vendors. Some are developed in-house. All of them draw power—static and dynamic power. At nanometer lithographies, the way to cut static &#8230; <a href="http://low-powerdesign.com/sleibson/2011/06/12/need-to-cut-ip-power-who-doesn%e2%80%99t-%e2%80%9cpress-here%e2%80%9d-says-calypto/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>All SoCs are built with IP blocks. Some of those are legacy IP blocks. Some are purchased from other vendors. Some are developed in-house. All of them draw power—static and dynamic power. At nanometer lithographies, the way to cut static power is through circuit tricks like high-Vt transistors and by powering down entire blocks when not needed. The way to cut dynamic power within an IP block is to stop clocking anything that doesn’t need to be clocked. Designers can gate clocks during the development of an IP design but what about existing IP blocks? Some can be retrofitted with clock gating but the ease of that exercise depends on how familiar the IP designer is with that IP block and how well documented the block is.</p>
<p> </p>
<p>Face it, <span style="text-decoration: line-through;">some</span> most IP blocks aren’t that well documented. You may never know enough about the internals of a purchased IP block to fiddle with its clocking. Legacy IP blocks may have been long abandoned by their designers who have gone off to other tasks, other companies, other planes of existence. Even a block you’ve designed yourself may have scrolled off your own internal memory window long ago.</p>
<p> </p>
<p>Designers everywhere have a common solution for these sorts of problems. “Give me a tool to do this” they demand from EDA vendors. “I just want to push the button.”</p>
<p> </p>
<p>Usually, that’s easier said than done. Calypto’s got a tool you can try however. It’s called PowerPro and comes in two flavors: CG and MG. The CG flavor is based on the company’s SLEC sequential logic equivalency checker. That’s a tool that checks to see if modified IP block “A prime” works the same as original IP block “A.” It’s a general-purpose EDA tool with a variety of uses and one of those uses is for comparing an IP block’s function before and after clock gating.</p>
<p> </p>
<p>Calypto’s PowerPro CG encapsulates the SLEC EDA tool to produce a “done for you” tool that can automatically insert clock gating into an IP design. It also checks to make sure the IP block’s behavior doesn’t change as a result of the added clock gating. Usually the insertion process takes 4 to 8 hours according to Calypto CEO Doug Aitelli who spoke to me about the product at DAC 2011 in San Diego. What do you get for this overnight run? Usually 10% to 30% reduction in dynamic power said Aitelli. Sometimes as much as 60%. Not bad for “pushing the button” I’d say.</p>
<p> </p>
<p>There’s another flavor of PowerPro called PowerPro MG. Nope, not named for a cute little British sports car, “MG” stands for “memory gating.” We tend to forget that today’s SoCs are more than half memory measured by die area. Usually SRAM. We sort of allude to this fact when we talk about MPSoCs—multiple processor SoCs. With each of those processors comes a boatload of on-chip SRAM for fast execution. However, we don’t seem to explicitly call out the memory. We tend to ignore it. I guess MMSoC—mostly memory SoC—doesn’t have the same cachet as MPSoC in our processor-centric world.</p>
<p> </p>
<p>However, if more than half of an SoC is SRAM, it makes sense to pay some attention to reducing the power consumption of an SoC’s on-chip SRAM blocks. That’s what Calypto’s PowerPro MG does. It can automatically add memory gating to an SoC design by evaluating the design’s behavior across many cycles.</p>
<p> </p>
<p>It also goes a step further. Many SRAM blocks for SoCs now have a sleep mode where the memory’s operating power can be reduced by shutting down peripheral circuitry such as address decoders and sense amps while keeping the memory storage array alive. According to Calypto’s Aitelli, most SoC designers find these sleep modes too hard to use, so they simply don’t use them. They don’t have the time. But those sleep modes are still there just waiting to be used. PowerPro MG will add the necessary sleep/wake-up state machine to exploit this little-used memory feature. Push the button, save power.</p>
<p> </p>
<p>Just a story from a chance meeting at DAC. Par for the course. There’s always something new to learn, something new to try.</p>
<p> </p>
<p>To read my blog on the Low-Power Report Card Panel at DAC, click <a href="http://j.mp/kwX9HI" target="_blank">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/06/12/need-to-cut-ip-power-who-doesn%e2%80%99t-%e2%80%9cpress-here%e2%80%9d-says-calypto/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The incredible vanishing power of a machine instruction. Is this the way to the brain?</title>
		<link>http://low-powerdesign.com/sleibson/2011/03/30/the-incredible-vanishing-power-of-a-machine-instruction-is-this-the-way-to-the-brain/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/03/30/the-incredible-vanishing-power-of-a-machine-instruction-is-this-the-way-to-the-brain/#comments</comments>
		<pubDate>Wed, 30 Mar 2011 03:25:56 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[Networking]]></category>
		<category><![CDATA[SOC]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=525</guid>
		<description><![CDATA[I attended DATE (Design and Test Europe) this month in Grenoble and was fascinated by Steve Furber’s keynote titled “Biologically-inspired massively-parallel architectures—computing beyond a million processors.” Furber’s introductory remarks really clarify what’s been happening to the energy cost per instruction &#8230; <a href="http://low-powerdesign.com/sleibson/2011/03/30/the-incredible-vanishing-power-of-a-machine-instruction-is-this-the-way-to-the-brain/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I attended DATE (Design and Test Europe) this month in Grenoble and was fascinated by Steve Furber’s keynote titled “Biologically-inspired massively-parallel architectures—computing beyond a million processors.” Furber’s introductory remarks really clarify what’s been happening to the energy cost per instruction executed over the past 60 years—and what’s likely to happen in the future. Strike that—make it “what’s got to happen.” Just in case you didn’t know, Furber was the principal designer of the original ARM processor back when “ARM” stood for “Acorn RISC Machine.” Acorn was a leading UK personal computer maker and in the early 1980s, it decided it needed its own microprocessor. The rest, as they say, is history. Acorn is gone. ARM is here, big time.</p>
<p>But back to Furber. Today, he’s the ICL Professor of Computer Engineering at the School of Computer Science, Manchester University, UK and his CV sports a long list of impressive achievements. Let’s just say he’s been busy since leaving ARM. These days, he and his group at Manchester University are developing digital ways to emulate organic brain functions. In essence, his group is developing digital analogs of neural networks. Now electronic neural networks aren’t something new. I can remember discussing them when I was a college freshman. That was 1971. Not new. Not recent.</p>
<p>The Manchester University team is developing an SoC with a “massively” parallel network of eighteen ARM 968 RISC processors all mutually interconnected through a Silistix self-clocked network on chip (NoC). Furber had a hand in the early development of this NoC, also at Manchester University. (See, he’s been busy, like I said.) The project is called SpiNNaker. (<a href="http://apt.cs.man.ac.uk/projects/SpiNNaker/" target="_blank">http://apt.cs.man.ac.uk/projects/SpiNNaker/</a>)</p>
<p>Now there’s a reason for repeatedly emphasizing Furber’s connections to Manchester University and he discussed it in his keynote. Any serious discussion of the history of computing must include the Manchester University Mark I “Baby,” which was the first fully programmable, stored-program digital computer to go online. Baby executed its first program in 1948. ENIAC, developed at the Moore School of Electrical Engineering at the University of Pennsylvania and usually called the first fully electronic computer, was operational two years before the Manchester Baby. But ENIAC was physically programmed with wires—at least initially. Eventually, ENIAC was retrofitted with some programmability but the Manchester Baby was first.</p>
<p>When operational, the Manchester Baby computer executed roughly 800 instructions per second. That was a heck of a lot faster than the mechanical calculators and punched-card equipment of the day but it’s laughably slow when compared to today’s processors. (Even the Intel 4004, the world’s first commercial microprocessor introduced in 1971, executed 108,000 instructions/second.) More to the point for the purposes of this blog, the Manchester Baby consumed approximately 5 Joules of energy to execute each instruction.</p>
<p>Fast forward to today and those ARM 968 microprocessors in the SpiNNaker chip. An ARM 968 processor executes roughly 20 million instructions per second, dissipating 10^-10 Joules per instruction. In other words, the per-instruction energy consumption needed to execute a machine instruction has improved by a factor of about 50 billion in 60 years.</p>
<p>Now the old, worn comparison usually asks you to consider what the world would be like today if automobile manufacturers had improved the energy consumption of their products by a factor of 50 billion in 60 years. That’s not the point here.</p>
<p>Furber’s point is this: if the energy cost per instruction had not improved by such a huge amount since 1948, this world would be a very different place. There would be no cell phones, no iPads, no personal computers, no personal music players, and very few embedded systems of any sort. These would simply be impractical for reasons of all three “P”s: price, performance, and power.</p>
<p>We have relied almost exclusively on Moore’s Law to get to this point.</p>
<p>That ride’s over.</p>
<p>At today’s bleeding-edge IC fabrication process lithographies, 28nm, we’re imaging individual atoms. Layers are a handful of atoms thick. The number of atoms in a transistor is so shockingly few that dopant atoms no longer operate statistically. The resulting on-chip parametric variability is becoming a very real problem that forces physical designers to use bigger and bigger guard bands on design rules. Speed and power gains are slowing from IC generation to generation. We have arrived at the point of rapidly diminishing returns and we’re clearly not getting another factor of 50 billion improvement in the power needed to execute a machine instruction from here on.</p>
<p>Yet the guidepost pointing to lower power operation is frustratingly close and familiar. It sits between your ears. We have chosen to design processors that execute one (or perhaps a few) instructions at one time, but at a very high execution rate. The higher the better. The brain is designed with an entirely different approach. It’s a highly parallel machine where “parallel” means a lot more than 18 processors. The brain contains approximately 10^11 neurons with 10^15 synapses. The neurons are the brain’s processors and the synapse connectivity is the brain’s memory and programming.</p>
<p>Neurons are very simple and very slow processors, but there are a lot of them working in parallel.</p>
<p>The entire brain human operates at roughly 100W—about the power consumption of a PC processor—but the brain runs at 100Hz. Although we can certainly get a lot of processing done with 100W, it’s not a drop in the bucket compared to the brain’s audio and visual processing abilities, let alone its ability for abstract thought. And we can’t get anything done at 100Hz. Our programming models cannot currently accommodate brain-style processing. We do not yet understand parallelism on the brain’s scale.</p>
<p>In addition, our processing systems are remarkably intolerant of failure. Microprocessors represent single-point failure nodes in most embedded designs with a few exceptions such as majority-voting avionics systems where single-point failure usually means death, so we go “massively” parallel with three processors.</p>
<p>The brain however is very tolerant of failure. Our brains lose neurons all the time. In fact, some of us hurry that process a bit by regularly drinking alcohol and killing off a few extra neurons a day. So what? When you’ve got 10^11 neurons, you’re not going to miss a few and of course the brain doesn’t.</p>
<p>The goal of the SpiNNaker project is to create an early parallel platform that will allow brain researchers to study the operation of a machine that can digitally emulate mechanisms that the brain uses to process a wide range of sensory data, to control an incredibly complex system of muscles and organs, to deal with the complex issues of written and spoken language, and to make huge leaps in abstract thought. SpiNNaker will not produce a leap by a factor of 50 billion, but perhaps it will get us going on the right path, now that we’ve managed to come this far.</p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/03/30/the-incredible-vanishing-power-of-a-machine-instruction-is-this-the-way-to-the-brain/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>18th Annual Electronic Design Process Symposium brings together the top thinkers of the EDA world, April 7-8, Monterey, CA</title>
		<link>http://low-powerdesign.com/sleibson/2011/02/25/18th-annual-electronic-design-process-symposium-brings-together-the-top-thinkers-of-the-eda-world-april-7-8-monterey-ca/</link>
		<comments>http://low-powerdesign.com/sleibson/2011/02/25/18th-annual-electronic-design-process-symposium-brings-together-the-top-thinkers-of-the-eda-world-april-7-8-monterey-ca/#comments</comments>
		<pubDate>Fri, 25 Feb 2011 07:30:13 +0000</pubDate>
		<dc:creator>sleibson321</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[EDA]]></category>
		<category><![CDATA[Low-Power]]></category>
		<category><![CDATA[SOC]]></category>

		<guid isPermaLink="false">http://low-powerdesign.com/sleibson/?p=497</guid>
		<description><![CDATA[EDPS (The Electronic Design Process Symposium) provides an exchange of ideas among the top thinkers, movers, and shakers who focus on how chips and systems are designed in the electronics industry. Attendees of this elite workshop have met each year &#8230; <a href="http://low-powerdesign.com/sleibson/2011/02/25/18th-annual-electronic-design-process-symposium-brings-together-the-top-thinkers-of-the-eda-world-april-7-8-monterey-ca/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>EDPS (The Electronic Design Process Symposium) provides an exchange of ideas among the top thinkers, movers, and shakers who focus on how chips and systems are designed in the electronics industry. Attendees of this elite workshop have met each year in Sand City (Monterey, CA) since 1993. It has attracted some of the most far-seeing people in electronics as speakers. (Plus me.) It’s a forum for the Design community to discuss EDA’s state-of-the-art with an eye towards improving electronics design processes and EDA/CAD methodologies, rather focusing on individual tools themselves.</p>
<p><strong>Schedule:</strong></p>
<p><strong>Thursday, April 7, 2011 </strong></p>
<ul>
<li><strong>8:30 AM:</strong> Check-ins and On-site Registration</li>
<li><strong>9:00 AM:</strong> Morning Keynote Speaker</li>
<li><strong>10:00 AM:</strong> Parallel EDA</li>
<li><strong>Noon:</strong> Keynote Speaker</li>
<li><strong>1:30 PM:</strong> High-Level Design</li>
<li><strong>4:30 PM:</strong> Cloud Computing</li>
</ul>
<p><strong>Friday, April 8, 2011</strong></p>
<ul>
<li><strong>9:00 AM:</strong> Low-Power Design</li>
<li><strong>11:30 AM:</strong> 3D ICs</li>
</ul>
<p>This is an inexpensive conclave with an emphasis on discussion and networking. It’s only $280 for IEEE Members; $100 for unemployed IEEE members. Save $50 through March 18th.</p>
<p>Full information and registration: <a href="www.eda.org/edps" target="_blank">www.eda.org/edps</a></p>
]]></content:encoded>
			<wfw:commentRss>http://low-powerdesign.com/sleibson/2011/02/25/18th-annual-electronic-design-process-symposium-brings-together-the-top-thinkers-of-the-eda-world-april-7-8-monterey-ca/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.580 seconds -->

