<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Antipatter &#187; architecture</title>
	<atom:link href="http://antipatter.com/tag/architecture/feed/" rel="self" type="application/rss+xml" />
	<link>http://antipatter.com</link>
	<description>The Web, The Business, The Smoke and Mirrors</description>
	<lastBuildDate>Tue, 15 Nov 2011 15:34:53 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Single Point of Failure</title>
		<link>http://antipatter.com/2008/07/single-point-of-failure/</link>
		<comments>http://antipatter.com/2008/07/single-point-of-failure/#comments</comments>
		<pubDate>Mon, 21 Jul 2008 18:53:16 +0000</pubDate>
		<dc:creator>loren</dc:creator>
				<category><![CDATA[commentary]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[management]]></category>
		<category><![CDATA[tech policy]]></category>

		<guid isPermaLink="false">http://antipatter.com/?p=14</guid>
		<description><![CDATA[
If you follow tech news, you may have heard about how badly the city of San Francisco screwed up with their network administrator.  Although this is an unusually spectacular blowup, the conditions that existed to create this situation are, sadly, replicated throughout the I.T. world.  This is about an animal called the Bus Test.  In [...]
]]></description>
			<content:encoded><![CDATA[<p>If you follow tech news, you may have heard about how badly the city of San Francisco <a title="SF Network Lockout" href="http://www.infoworld.com/article/08/07/18/30FE-sf-network-lockout_1.html" target="_blank">screwed up with their network administrator</a>.  Although this is an unusually spectacular blowup, the conditions that existed to create this situation are, sadly, replicated throughout the I.T. world.  This is about an animal called the <a title="Bus Test" href="http://www.isp-planet.com/business/bus-test.html" target="_blank">Bus Test</a>.  In essence, the Bus Test idea is that the overall system should survive if any one person who is closest to it is hit by a bus.  Or disappears.  Or goes rogue.</p>
<p>In system design, be it a network, application or any other piece of automated infrastructure, we eschew <em>single points of failure</em>.  We know that in the real world, things go wrong, and consequently we design systems that have redundancy built in.  If something fails, the system can transfer operations over to a redundant subsystem, and keep on going.  That&#8217;s why, for example, websites have back-up load balancers or data servers.  These apparently redundant elements are there to keep the system online if the primary subsystem breaks.</p>
<p>Unfortunately, in the I.T. world, it is common to neglect that <strong>the human operators of automated systems are also effectively part of that system</strong>.  They can also constitute a single point of failure, and we should be avoiding this problem with humans as well as machines, for the same reasons.</p>
<p><strong>No single operator (be they employee or principal) should ever control exclusive passwords or knowledge about a critical system.</strong> To do so makes the system fragile, and sets it up for the kind of snafus that are currently plaguing San Francisco.</p>
<p>In the linked article, it&#8217;s described how the network administrator in question was unwilling to allow anyone else to work with the network because he felt they were incompetent, and the configuration for the network was extremely complicated.  I can&#8217;t help thinking that centralizing control over the network was a band-aid on a tumor, however.  Essentially the city was running a network that was too complicated for them to staff properly, and was relying on a bad management decision in order to cover for it.</p>
<p>Essentially this is trading risk for cost.  However, it raises the question about whether it was acceptable risk and whether the decision was made consciously, or if it just happened through the ignorance of the network administrator&#8217;s supervisors.  I&#8217;m betting the latter.   This was sweeping the problem under the carpet, and the city of San Francisco is now paying the cost.  Bad policy.</p>
<div class="acc_license"></div><!---->]]></content:encoded>
			<wfw:commentRss>http://antipatter.com/2008/07/single-point-of-failure/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

