<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>VM /ETC &#187; fail over</title>
	<atom:link href="http://vmetc.com/category/fail-over/feed/" rel="self" type="application/rss+xml" />
	<link>http://vmetc.com</link>
	<description>Go Green with Virtualization. Go UGLY Green with vmetc.com.</description>
	<lastBuildDate>Tue, 27 Jul 2010 12:17:30 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Simply Automating Virtual Machine IP Addressing For Disaster Recovery Sites (without scripting)</title>
		<link>http://vmetc.com/2010/01/12/simply-automating-virtual-machine-ip-addressing-for-disaster-recovery-sites-without-scripting/</link>
		<comments>http://vmetc.com/2010/01/12/simply-automating-virtual-machine-ip-addressing-for-disaster-recovery-sites-without-scripting/#comments</comments>
		<pubDate>Wed, 13 Jan 2010 00:51:10 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[dr]]></category>
		<category><![CDATA[esx]]></category>
		<category><![CDATA[fail over]]></category>
		<category><![CDATA[how to]]></category>
		<category><![CDATA[vSphere]]></category>
		<category><![CDATA[vizioncore]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[replication]]></category>
		<category><![CDATA[virtualization]]></category>
		<category><![CDATA[vreplicator]]></category>

		<guid isPermaLink="false">http://vmetc.com/2010/01/12/simply-automating-virtual-machine-ip-addressing-for-disaster-recovery-sites-without-scripting/</guid>
		<description><![CDATA[If you are looking at various options to automate virtual machine (VM) ip address reconfiguration when failing over virtual machines to a disaster recovery (DR) site, this post explains an option so simple it is beautiful. To give full credit, the Vizioncore vReplicator 2.5 Best Practices document enlightened me to the strategy of using a [...]]]></description>
			<content:encoded><![CDATA[<p>If you are looking at various <strong>options to automate virtual machine (VM) ip address reconfiguration when failing over virtual machines to a disaster recovery (DR) site, </strong>this post explains an option so simple it is beautiful. To give full credit, the <a href="http://www.vizioncore.com/products/vReplicator/documents/vReplicator-Best-Practices-v1.20.pdf" target="_blank">Vizioncore vReplicator 2.5 Best Practices</a> document enlightened me to the strategy of <strong>using a local only VMware vSwitch and an extra virtual NIC (vNIC) in each VM.</strong> It’s been a long time since I had a “<a href="http://vmetc.com/2008/05/07/virtual-security-solutions/" target="_blank">ton</a> <a href="http://vmetc.com/2007/09/13/replicate-your-vmfs-partitions-netapp/" target="_blank">of</a> <a href="http://vmetc.com/2007/09/10/inside-vmware-consolidated-backup-perspectives-from-the-field-par303/" target="_blank">bricks</a>” moment, but this concept crashed down on me with the realization of a configuration that works in any version of ESX, doesn’t require extra software or hardware, and better yet, doesn’t have to be scripted! Just configure some extra virtual networking and forget about it!</p>
<p>Here is a general outline for automating the DR ip addressing with this method:</p>
<p><strong><span style="text-decoration: underline;">At the Primary Site</span></strong></p>
<ul>
<li>For these instructions assume the production vSwitch at the primary site has a Portgroup named VM Network</li>
<li>Build a new vSwitch and do not attach any physical NICs (local only isolated switch). Create a Portgroup named DR Network</li>
<li>For each VM you need to fail over to a DR site, add an extra vNIC and attach it to the DR Network Portgroup</li>
</ul>
<p><strong><span style="text-decoration: underline;">At the DR Site</span></strong></p>
<ul>
<li>Create your DR site production vSwitch, attach physical NICs and add a Portgroup named DR Network.</li>
<li>Create another vSwitch and do not attach any physical NICs (local only isolated switch). Create a Portgroup named VM Network</li>
</ul>
<p>All you have to do for this to work is</p>
<p><span id="more-5272"></span><center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center></p>
<p>configure the extra vNIC in each VM at the primary site with the correct VLAN and ip address of the DR site production network. This ip address configuration will be cloned in the replicated DR VM, and when it’s time to power on the VM at the DR site both vNICs will connect to their respective virtual networks. The DR Network Portgroup will now be on the vSwitch with the physical connectivity to the DR production network and the VM Network vNIC will be isolated to a local only vSwitch.</p>
<p>Vizioncore deserves the credit for showing me the idea, but there’s no reason why this won’t work with any VM replication or VM backup product. It’s not automated DR fail over workflow like VMware SRM, but using this extra virtual networking strategy eliminates some manual or even scripted ip re-addressing.</p>
<p>Simple!</p>
<p><center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2010%2F01%2F12%2Fsimply-automating-virtual-machine-ip-addressing-for-disaster-recovery-sites-without-scripting%2F';
  addthis_title  = 'Simply+Automating+Virtual+Machine+IP+Addressing+For+Disaster+Recovery+Sites+%28without+scripting%29';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2010/01/12/simply-automating-virtual-machine-ip-addressing-for-disaster-recovery-sites-without-scripting/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>VMware SRM 4.0 Released &#8211; Supports vSphere 4.0, NFS, vCenter Linked Mode</title>
		<link>http://vmetc.com/2009/10/05/vmware-srm-4-0-released-supports-vsphere-4-0-nfs-vcenter-linked-mode/</link>
		<comments>http://vmetc.com/2009/10/05/vmware-srm-4-0-released-supports-vsphere-4-0-nfs-vcenter-linked-mode/#comments</comments>
		<pubDate>Mon, 05 Oct 2009 05:01:35 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[fail over]]></category>
		<category><![CDATA[srm]]></category>
		<category><![CDATA[vSphere]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[NFS]]></category>
		<category><![CDATA[vcenter]]></category>

		<guid isPermaLink="false">http://vmetc.com/?p=4830</guid>
		<description><![CDATA[VMware has announced an upgrade release of Site Recovery Manager (SRM).&#160; Available today, SRM version 4 not only adds the much anticipated compatibility for vSphere 4 but also provides support for NFS storage, allows multi instance replication between single site pairs, and can be managed in vCenter Linked Mode. More details on the new version [...]]]></description>
			<content:encoded><![CDATA[<p>VMware has announced an upgrade release of Site Recovery Manager (<a href="http://www.vmware.com/products/srm/" target="_blank">SRM</a>).&nbsp; <strong>Available today, SRM version 4 not only adds the much anticipated compatibility for vSphere 4 but also provides support for NFS storage, allows multi instance replication between single site pairs, and can be managed in vCenter Linked Mode.</strong> More details on the new version follow in the paragraphs below.</p>
<p>VMware customers who currently own SRM with active Sales and Support (SnS) contracts will receive the upgrade at no additional cost. VMware has not changed the per processor licensing model or cost for new customers wishing to purchase.</p>
<p>Some quick links for SRM 4:
<ul>	
<li>SRM 4. Release notes &#8211; <a href="http://www.vmware.com/support/srm/srm_releasenotes_4_0.html" target="_blank">http://www.vmware.com/support/srm/srm_releasenotes_4_0.html</a></li>
<p>	
<li>Upgrade KB article &#8211; <a href="http://kb.vmware.com/kb/1013166" target="_blank">http://kb.vmware.com/kb/1013166</a></li>
<p>	
<li>SRM download &#8211; <a href="http://downloads.vmware.com/d/" target="_blank">http://downloads.vmware.com/d/</a> and select Site Recovery Manger</li>
<p>	
<li>Michael White&#8217;s post <a href="http://blogs.vmware.com/uptime/2009/10/srm-40-is-here-the-wait-for-vsphere-and-nfs-support-is-over.html" target="_blank">SRM 4.0 is here!  The wait for vSphere and NFS support is over!</a> on the VMware Uptime Blog covers upgrade strategy options.</li>
<li>VMware&#8217;s documentation for all version of SRM &#8211; <a target="_blank" href="http://www.vmware.com/support/pubs/srm_pubs.html">http://www.vmware.com/support/pubs/srm_pubs.html</a></li>
<p></ul>
<p><span style="text-decoration: underline;"><strong>vSphere 4.0 Supported</strong></span></p>
<p><span style="text-decoration: underline;"><strong></strong></span>The main requirement to enable the new compatibility and features in SRM is upgrading vCenter to version 4. In fact, <span id="more-4830"></span></p>
<p><center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center><br />after upgrading vCenter, SRM 4 can manage fail over workflow of ESX 3.0.3 to the current 4.x host versions. When storage replication is established between vSphere 4 site pairs, new vSphere features such as Fault Tolernance (FT) for VMs and Distributed Switches (vDS) are also understood and supported by SRM. FT must be manually recreated once a VM is failed over to the secondary site, but the inclusion of a production VM with FT configured in DR site workflow planning is a critical.<br /><span style="text-decoration: underline;"><strong><br />Multi Instance Replication From a Single Site Pairing</strong></span></p>
<p>SRM 4 can manage the fail over of multiple VMware Datacenters configured from a single pairing of a primary vCenter instance and a secondary vCenter instance at a recovery site (DR site). The key to this configuration is the dependency on and the location of the 2 vCenter 4 servers. Although I do not recall it being specifically stated in the product pre release call I participated in, technically speaking, it seems as if VI storage at multiple physical locations could be replicated to multiple DR sites if all the sites involved were managed by only the 2 vCenters. This implies vCenter management, and the necessary Site Recovery Agents (SRA), would be remote to some of the shared storage replication taking place. Since the storage replication is configured independent of vCenter and SRM, SRA connectivity and SRM workflow automation via the SRM vCenter plugins would be possible.</p>
<p><span style="text-decoration: underline;"><strong>NFS Support</strong></span></p>
<p>New SRAs are being released from the storage vendors for automating fail over between sites with NFS storage replication in place. Although not all vendors currently on the SRM HCL will have NFS SRAs available today, we were told most of the missing vendor agents will be available immediately. Each vendor has to go through a separate qualification process for the new storage protocol.</p>
<p>My notes about NFS included a bullet about SRM 4 now being able to support up to 1000 VMs. This is up from a 500 VM maximum previously.&nbsp; This is per VMware Datacenter in vCenter. Multi VMware datacenter pairings would be the method to go beyond the 1000 max. I&#8217;m not sure if this is only for NFS storage, or if the maximum number of VMs applies to all supported SRM storage protocols.</p>
<p><span style="text-decoration: underline;"><strong>vCenter Linked Mode Supported</strong></span></p>
<p><a href="http://pubs.vmware.com/vsp40_e/admin/wwhelp/wwhimpl/common/html/wwhelp.htm#href=c_using_vcenter_server_in_linked_mode.html&amp;single=true" target="_blank">vCenter Linked Mode</a> enables an administrator to manage objects from multiple instances of vCenter in a single VI Client (VIC) session. With SRM now supporting Linked Mode, VI administrators can more easily monitor and manage SRM at both the primary and secondary sites. Note that SRAs must be installed separately on both instances of vCenter, but the SRM plugin can be installed once in a administrator&#8217;s VIC connecting to a Linked Mode vCenter.</p>
<p><span style="text-decoration: underline;"><strong>Other Notes on SRM 4</strong></span></p>
<p>Probably the largest feature of interest which SRM users have been<br />waiting for has still not been implemented in this latest release &#8211; an<br />automated fail back method. The process still requires manual reconfiguration effectively reversing the primary and secondary sites after storage device replication is established (in the reverse direction as well).</p>
<p>If you are interested in possible long term futures and the roadmap for SRM, check out the session titled BC3421<br /><a href="http://www.vmworld.com/docs/DOC-3498">SRM Architecture &amp; Features: The Road Ahead</a> from VMworld 2009 recently in San Francisco. PDF of the session available at <a href="http://www.vmworld.com/docs/DOC-3498" target="_blank">http://www.vmworld.com/docs/DOC-3498</a>.<center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></p>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" alt="" src="http://img.zemanta.com/pixy.gif?x-id=1f59694e-a865-8050-a332-1ef7f99c1c46" /></div>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2009%2F10%2F05%2Fvmware-srm-4-0-released-supports-vsphere-4-0-nfs-vcenter-linked-mode%2F';
  addthis_title  = 'VMware+SRM+4.0+Released+%26%238211%3B+Supports+vSphere+4.0%2C+NFS%2C+vCenter+Linked+Mode';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2009/10/05/vmware-srm-4-0-released-supports-vsphere-4-0-nfs-vcenter-linked-mode/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Implementing VMware SRM: Pay Attention to that Man Behind the Curtain</title>
		<link>http://vmetc.com/2009/02/15/implementing-vmware-srm-pay-attention-to-that-man-behind-the-curtain/</link>
		<comments>http://vmetc.com/2009/02/15/implementing-vmware-srm-pay-attention-to-that-man-behind-the-curtain/#comments</comments>
		<pubDate>Sun, 15 Feb 2009 15:07:45 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[fail over]]></category>
		<category><![CDATA[gestaltit]]></category>
		<category><![CDATA[srm]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[disaster recovery]]></category>
		<category><![CDATA[disasterrecovery]]></category>
		<category><![CDATA[site recovery manager]]></category>

		<guid isPermaLink="false">http://vmetc.com/?p=3339</guid>
		<description><![CDATA[Now that I&#8217;ve sat the VMware Site Recovery Manager (SRM) class, done the labs, and had some design and implementation time with the product I am reminded of a scene from the movie The Wizard of Oz. &#8220;Pay no attention to that man behind the curtain&#8221; is a famous line from the movie which comes [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://vmetc.com/wp-content/uploads/2009/02/021509-1413-implementin1.png" alt="" align="right" />Now that I&#8217;ve sat the VMware <a href="http://vmware.com/products/srm/" target="_blank">Site Recovery Manager</a> (SRM) class, done the labs, and had some design and implementation time with the product I am reminded of a scene from the movie <a href="http://www.imdb.com/title/tt0032138/" target="_blank">The Wizard of Oz</a>. &#8220;Pay no attention to that man behind the curtain&#8221; is a famous line from the movie which comes from the scene when Dorothy and gang discover that the mighty and powerful Wizard they fear is really just an elaborate machine controlled by an ordinary man.</p>
<p>I am not suggesting that SRM is a sham. In fact, it provides automation of virtual infrastructure fail over between sites that is truly wizard-like. Understand however, <strong>VMware SRM software is just the last piece of the total data center recovery &#8220;machine&#8221;</strong>. Many organizations may be seeking the semblance of automated site fail over, but have they really considered in detail what it takes to start up their business critical systems at a secondary location?</p>
<p>A simple determination of readiness for SRM&#8217;s wizardry is answering this question:<span id="more-3339"></span><br />
<center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center><br />
&#8220;Can you create (or have you already created) <a href="http://www.boche.net/blog/index.php/2009/01/01/datacenters-need-shutdownstartup-order/" target="_blank">a document listing the complete shut down and start up of your business infrastructure?</a>&#8221; Many call this a disaster recovery playbook or runbook. Better yet, have you provisioned and tested the physical resources you need to actually fail over to another location based on the runbook that was created? If and when that&#8217;s done, there are numerous business continuity <a href="http://www.yellow-bricks.com/2009/02/12/vmware-ha-or-vmware-srm-what-should-i-use/" target="_blank">technologies considered other than SRM</a>. Those that have already realize that SRM along with consolidation to virtual infrastructure will <a href="http://blogs.vmware.com/uptime/2009/02/failback-absolutely-absolutely-absolutely.html" target="_blank">replace several sections of their runbook</a> and several pieces of hardware.  The point, however, is that SRM does not replace the contents of the entire runbook.</p>
<p>For those that are considering SRM, <a href="http://www.yellow-bricks.com/2008/11/20/srm-its-just-too-easy/" target="_blank">take the time</a> to at least put on paper every possible step you would need to <a href="http://www.mikedipetrillo.com/mikedvirtualization/2009/01/powering-up-your-datacenter-from-scratch.html" target="_blank">restart your business in another data center</a>. This includes the mundane like providing power and cooling, racks, internet access, and telephones as well as the mission critical like employee access, email, and business processing. Analyze how much time it would take to rebuild and restart systems, and what recovery point objectives (RPO) and recovery time objectives (RTO) are acceptable for each system or service. When it&#8217;s time to <a href="http://blog.scottlowe.org/2008/09/16/bc1693-architecting-dr-solutions-with-vmware-srm/" target="_blank">map your disaster site fail over solution to the runbook</a> you will clearly see the efficiencies and the speed that VI 3.5 and SRM allow.</p>
<p>Therefore, unlike the line from The Wizard of Oz, you better pay close attention to the man and machine behind the curtain in order to achieve the prodigious expectation and pyrotechnic results of a SRM implementation.<center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2009%2F02%2F15%2Fimplementing-vmware-srm-pay-attention-to-that-man-behind-the-curtain%2F';
  addthis_title  = 'Implementing+VMware+SRM%3A+Pay+Attention+to+that+Man+Behind+the+Curtain';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2009/02/15/implementing-vmware-srm-pay-attention-to-that-man-behind-the-curtain/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Enterprise-class High Availability and Disaster Recovery and Management for VMware ESX Environments #BC2370</title>
		<link>http://vmetc.com/2008/09/19/enterprise-class-high-availability-and-disaster-recovery-and-management-for-vmware-esx-environments-bc2370/</link>
		<comments>http://vmetc.com/2008/09/19/enterprise-class-high-availability-and-disaster-recovery-and-management-for-vmware-esx-environments-bc2370/#comments</comments>
		<pubDate>Sat, 20 Sep 2008 04:37:49 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[dr]]></category>
		<category><![CDATA[fail over]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[vmworld]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[disaster recovery]]></category>
		<category><![CDATA[symantec]]></category>
		<category><![CDATA[vcs]]></category>
		<category><![CDATA[vima]]></category>
		<category><![CDATA[virtualization]]></category>
		<category><![CDATA[vmworld2008]]></category>

		<guid isPermaLink="false">http://vmetc.com/2008/09/19/enterprise-class-high-availability-and-disaster-recovery-and-management-for-vmware-esx-environments-bc2370/</guid>
		<description><![CDATA[I attended this VMworld 2008 session on Wednesday 09.18 at 9:00 AM. The presenter was Sunder Parameswaran who is a Senior Product Manager at Symantec. The session was about using Veritas Cluster Server (VCS) in VMware virtual infrastructure (VI) to overcome application high availability and disaster recovery (DR) challenges. My main interest in this session [...]]]></description>
			<content:encoded><![CDATA[<p><!-- 		@page { size: 8.5in 11in; margin: 0.79in } 		P { margin-bottom: 0.08in } 		H3 { margin-bottom: 0.08in } 		H3.western { font-family: "Arial", sans-serif } --></p>
<p style="margin-bottom: 0in;">I attended this VMworld 2008 session on Wednesday 09.18 at 9:00 AM. The presenter was Sunder Parameswaran who is a Senior Product Manager at Symantec. The session was about using Veritas Cluster Server (VCS) in VMware virtual infrastructure (VI) to overcome application high availability and disaster recovery (DR) challenges.</p>
<p style="margin-bottom: 0in;">My main interest in this session was on the topic of Geo Clustering with VCS. Also known as Metro Clustering, this is the ability to have one node of the cluster at your primary data center location and the second node at a separate physical location like a disaster recovery site.</p>
<p style="margin-bottom: 0in;">Sunder began by outlining various VI challenges.</p>
<p><span id="more-978"></span><center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center></p>
<ul>
<li>
<p style="margin-bottom: 0in;">Need for heterogeneous management of mission critical applications in both physical and virtual<br />
servers.</p>
<ul>
<li>
<p style="margin-bottom: 0in;">VI makes management more complex because a distinct set of tools is used today for both physical and virtual systems</p>
</li>
</ul>
</li>
<li>
<p style="margin-bottom: 0in;">VI creates a larger emphasis for handling unplanned downtime and DR</p>
</li>
</ul>
<p style="margin-bottom: 0in;">With over companies using VCS for over 10 years in the field, VCS provides the following features to meet<br />
these challenges:</p>
<ul>
<li>
<p style="margin-bottom: 0in;">Can manage mixed physical and virtual environments</p>
</li>
<li>
<p style="margin-bottom: 0in;">can manage various server operating systems</p>
<ul>
<li>
<p style="margin-bottom: 0in;">solaris</p>
</li>
<li>
<p style="margin-bottom: 0in;">aix</p>
</li>
<li>
<p style="margin-bottom: 0in;">linux</p>
</li>
<li>
<p style="margin-bottom: 0in;">windows</p>
</li>
<li>
<p style="margin-bottom: 0in;">VMware</p>
</li>
</ul>
</li>
</ul>
<p style="margin-bottom: 0in;">Next Sunder talked specifically about how VCS provides application aware high availability for VMware above<br />
and beyond VI Enterprise HA features of an ESX Cluster. The session discussion also covered VM application clustering advantages unique from Microsoft&#8217;s clustering (MSCS)</p>
<ul>
<li>
<p style="margin-bottom: 0in;">VCS can monitor and fail over several mission critical applications such as Exchange, SQL, Oracle,<br />
and SAP</li>
<li>
<p style="margin-bottom: 0in;">There is no need for passive standby hardware</p>
</li>
<li>
<p style="margin-bottom: 0in;">Requires agents in the server OS</p>
</li>
<li>
<p style="margin-bottom: 0in;">provides DR ability to cluster between physical data center locations</p>
</li>
</ul>
<h3 class="western">Geo Clusters (Metro Clusters) with VCS</h3>
<ul>
<li>
<p style="margin-bottom: 0in;">Requires synchronis replication between 2 sites within 80 km of each other</p>
</li>
<li>
<p style="margin-bottom: 0in;">It is directly effected by the link latency and the quality of the replication technology</p>
</li>
<li>
<p style="margin-bottom: 0in;">single click cluster configuration that is completely automated.</p>
</li>
<li>
<p style="margin-bottom: 0in;">can handle split brain network scenarios</p>
</li>
</ul>
<p style="margin-bottom: 0in;">
<h3 class="western">Global DR with VCS</h3>
<ul>
<li>
<p style="margin-bottom: 0in;">VCS Firedrill provides zero downtime testing of DR plans</p>
</li>
<li>
<p style="margin-bottom: 0in;">requires array based replication</p>
</li>
<li>
<p style="margin-bottom: 0in;">leverages storage snapshots of thefiler at the DR site</p>
</li>
<li>
<p style="margin-bottom: 0in;">can be scheduled (example of DR testing every month)</p>
</li>
<li>
<p style="margin-bottom: 0in;">provides network isolation of DR test servers</p>
</li>
<li>
<p style="margin-bottom: 0in;">Is not fault tolerant and client will notice downtime</p>
</li>
</ul>
<p style="margin-bottom: 0in;">
<h3 class="western">VMware HA versus VCS</h3>
<ul>
<li>
<p style="margin-bottom: 0in;">VCS protets against application crashes where HA watches for ESX or VM crashes</p>
</li>
<li>
<p style="margin-bottom: 0in;">VCS can control the specific start up order order</p>
</li>
<li>
<p style="margin-bottom: 0in;">VCS protects against admin errors in the VM configuration.</p>
<ul>
<li>
<p style="margin-bottom: 0in;">Configuration changes</p>
</li>
<li>
<p style="margin-bottom: 0in;">admin error</p>
</li>
<li>
<p style="margin-bottom: 0in;">Server NIC failure (not usually a problem in VMs, but possible)</p>
</li>
</ul>
</li>
<li>
<p style="margin-bottom: 0in;">VCS protects against storage connectivity failures</p>
</li>
</ul>
<p style="margin-bottom: 0in;">
<h3 class="western">VCS Compatibility</h3>
<ul>
<li>
<p style="margin-bottom: 0in;">VCS is not supported in ESX version 2.5.x or earlier</p>
</li>
<li>
<p style="margin-bottom: 0in;">VCS is certified with various SAN vendor replication technologies</p>
<ul>
<li>
<p style="margin-bottom: 0in;">HDS Truecopy / Universal</p>
</li>
<li>
<p style="margin-bottom: 0in;">EMC SRDF</p>
</li>
<li>
<p style="margin-bottom: 0in;">IBM Metromirror</p>
</li>
<li>
<p style="margin-bottom: 0in;">HP EVA is on the roadmap</p>
</li>
<li>
<p style="margin-bottom: 0in;">NetApp Snap Mirror is on the roadmap</p>
</li>
</ul>
</li>
<li>
<p style="margin-bottom: 0in;">VCS Application support</p>
<ul>
<li>
<p style="margin-bottom: 0in;">IIS</p>
</li>
<li>
<p style="margin-bottom: 0in;">SQL 2005 and 2005</p>
</li>
<li>
<p style="margin-bottom: 0in;">Apache</p>
</li>
<li>
<p style="margin-bottom: 0in;">Linux</p>
</li>
<li>
<p style="margin-bottom: 0in;">Oracle 10g</p>
</li>
<li>
<p style="margin-bottom: 0in;">SAP</p>
</li>
<li>
<p style="margin-bottom: 0in;">Exchage 2003 and 2007</p>
</li>
<li>
<p style="margin-bottom: 0in;">Websphere</p>
</li>
<li>
<p style="margin-bottom: 0in;">Sharepoint</p>
</li>
<li>
<p style="margin-bottom: 0in;">Weblogic on Linux</p>
</li>
<li>
<p style="margin-bottom: 0in;">Framework to customize for any application</p>
</li>
<li>
<p style="margin-bottom: 0in;">is not supported / can not use or combine with MSCS</p>
</li>
</ul>
</li>
</ul>
<p style="margin-bottom: 0in;">
<h3 class="western">VCS Considerations for VMware Vmotion, DRS, and HA</h3>
<ul>
<li>
<p style="margin-bottom: 0in;">Considered by Symantec as complimentary solutions</p>
</li>
<li>
<p style="margin-bottom: 0in;">Postion VCS for applications and scenarios where Vmotion and DRS do not cover availability needs.</p>
</li>
<li>
<p style="margin-bottom: 0in;">VMware HA is power off crash consistent fail over and VCS is clustered fail over</p>
<ul>
<li>
<p style="margin-bottom: 0in;">recommendation is to disable VMware HA when using VCS</p>
</li>
</ul>
</li>
<li>
<p style="margin-bottom: 0in;">VCS installs agents on each ESX host. State of each host is known by other hosts</p>
</li>
</ul>
<p style="margin-bottom: 0in;">
<h3 class="western">Protecting VirtualCenter Server with VCS</h3>
<ul>
<li>
<p style="margin-bottom: 0in;">Discussion was focused on VC as a VM, but I imagine this applies to physical VCs as well</p>
</li>
<li>
<p style="margin-bottom: 0in;">Has additional logic to auto reconnect ESX hosts after fail over to second node</p>
</li>
</ul>
<ul>
<li>
<ul>
<li>
<p style="margin-bottom: 0in;">Same goes for Geo Clustering of VC (Metro Clustering)</p>
</li>
</ul>
</li>
</ul>
<p style="margin-bottom: 0in;">
<h3 class="western">VCS Futures</h3>
<ul>
<li>
<p style="margin-bottom: 0in;">Future VCS ESX installations will utilize VIMA. It will be optional whether using a dedicatd VIMA or a<br />
shared VIMA</li>
<li>
<p style="margin-bottom: 0in;">Prioritized applications – the ability to give priority to specific application fail overs</p>
</li>
<li>
<p style="margin-bottom: 0in;">browser based management</p>
</li>
<li>
<p style="margin-bottom: 0in;">enable active server provisioning based on workload</p>
</li>
<li>
<p style="margin-bottom: 0in;">multi tier application control and support</p>
</li>
</ul>
<p style="margin-bottom: 0in;">
<p style="margin-bottom: 0in;">There are currently 2 licensing models for VCS. You can purchase licensing by number of VMs or by the ESX<br />
CPU.</p>
<p class="technorati-tags"><a rel="tag" href="http://technorati.com/tag/virtualization">virtualization</a>, <a rel="tag" href="http://technorati.com/tag/vmware">vmware</a>, <a rel="tag" href="http://technorati.com/tag/veritas">veritas</a>, <a rel="tag" href="http://technorati.com/tag/vcs">vcs</a></p>
<p><center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2008%2F09%2F19%2Fenterprise-class-high-availability-and-disaster-recovery-and-management-for-vmware-esx-environments-bc2370%2F';
  addthis_title  = 'Enterprise-class+High+Availability+and+Disaster+Recovery+and+Management+for+VMware+ESX+Environments+%23BC2370';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2008/09/19/enterprise-class-high-availability-and-disaster-recovery-and-management-for-vmware-esx-environments-bc2370/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>VMware Site Recovery Manager Overview</title>
		<link>http://vmetc.com/2008/05/08/vmware-site-recovery-manager-overview/</link>
		<comments>http://vmetc.com/2008/05/08/vmware-site-recovery-manager-overview/#comments</comments>
		<pubDate>Thu, 08 May 2008 15:38:33 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[dr]]></category>
		<category><![CDATA[fail over]]></category>
		<category><![CDATA[srm]]></category>
		<category><![CDATA[esx]]></category>
		<category><![CDATA[labs]]></category>
		<category><![CDATA[lefthand]]></category>
		<category><![CDATA[Partner Exchange 2008]]></category>
		<category><![CDATA[replication]]></category>
		<category><![CDATA[SAN]]></category>
		<category><![CDATA[site recovery manager]]></category>
		<category><![CDATA[storagevmotion]]></category>
		<category><![CDATA[virtualcenter]]></category>
		<category><![CDATA[vmware]]></category>

		<guid isPermaLink="false">http://vmetc.com/2008/05/08/vmware-site-recovery-manager-overview/</guid>
		<description><![CDATA[One of the hands on labs I attended at VMware Partner Exchange was the Site Recovery Manager (SRM) lab. In the lab I was able to get a good understanding of the technical details of how the yet to be released product is configured. The lab then walked us through the fail over process and [...]]]></description>
			<content:encoded><![CDATA[<p>One of the hands on labs I attended at VMware Partner Exchange was the <a href="http://www.vmware.com/vmworldnews/srm.html" target="_blank">Site Recovery Manager (SRM)</a> lab. In the lab I was able to get a good understanding of the technical details of how the yet to be released product is configured. The lab then walked us through the fail over process and workflow. This post  is a high level summary of what I learned. This post is not intended to be a detailed how to, but instead just a logical overview about what it will take to set up SRM.<br />
<span id="more-351"></span><center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center></p>
<p><strong>SRM requirements:</strong></p>
<ul>
<li>VirtualCenter licensed and running at both the primary and secondary sites.</li>
<li>ESX hosts licensed and running at both the primary and secondary sites.</li>
<li>Backend database for SRM at both the primary and secondary site.</li>
<li>Shared storage capable of, and licensed for, block level replication (SAN based) at both the primary and secondary sites.</li>
<li>Site Recovery Agent (SRA) installer provided by the SAN manufacturer.</li>
</ul>
<p>Note: if your SAN manufacturer does not create a SRA you can not use SRM. There is not an official SRA manufacturer list at this time. The instructors in my lab indicated most all major SAN manufacturers are working on developing SRAs, but the list will not be official until product release. My lab used a <a href="http://vmetc.com/wp-admin/LeftHand%20Networks" target="_blank">LeftHand Networks</a> SRA.</p>
<p><strong>Preparation for SRM:</strong></p>
<p><u>LUNs </u></p>
<p>Organize your LUNs / volumes for SRM because you will probably not need all of your VMs to be replicated. For example, your VC server, Update Manager server, and SRM server (if they are VMs) should not be replicated. If other similar infrastructure server VMs should not be replicated then move these VMs out of the LUNs / Volumes containing the VMs that will be replicated. Move all of your non critical VMs to their own LUNs. The idea is to reduce replicated traffic as much as possible by syncing only select LUNs containing your critical VMs. Obviously, this will be a significant project for some organizations by itself. Leverage <a href="http://www.vmware.com/products/vi/storage_vmotion.html" target="_blank">storage vmotion</a> to minimize VM downtown during this process as much as possible.</p>
<p><u>Database </u></p>
<p>I was told the database requirements are exactly the same as VirtualCenter. Since the product has not been fully user tested yet, my instructors indicated there are no solid best practices yet for SRM database design. In other words, can SRM be installed on VC? Can Update Manager and SRM be installed on the same server? Should all of these servers be VMs or physical servers? Personally, considering VirtualCenter, SRM, Guided Consolidation, and Update Manager all require a database, I am of the opinion it will make the most sense to use ODBC connections to a stand alone database server. If I am going to make all of these servers VMs then I would make a separate server for each as well. Be sure to include this design in the LUN preparations!</p>
<p><strong>Lab and Configuration overview:</strong></p>
<p><u>Primary Site</u></p>
<ol>
<li>Install SRM code on SRM server (SRM was installed on VC in my lab)</li>
<li>Install SRM plugin in VI Cleint connecting to VirtualCenter and enable plugin</li>
<li>Install SRA on SRM server</li>
</ol>
<p><u>Secondary Site</u></p>
<ol>
<li>Install SRM code on SRM server</li>
<li>Install SRM plugin in VI Client connecting VirtualCenter and enable plugin</li>
<li>Install SRA on SRM server</li>
</ol>
<p><u>Replication from Primary Site</u></p>
<ol>
<li>Use SRM plugin to configure and verify replication to secondary site</li>
</ol>
<p><u>Test fail over and work flow using the SRM plugin from the primary site</u></p>
<p>When the lab materials are released to the public I will post a copy here on vmetc.com. I do not have them as of this writing.<center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2008%2F05%2F08%2Fvmware-site-recovery-manager-overview%2F';
  addthis_title  = 'VMware+Site+Recovery+Manager+Overview';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2008/05/08/vmware-site-recovery-manager-overview/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>XenServer integrates everRun VM for HA features</title>
		<link>http://vmetc.com/2008/03/25/xenserver-integrates-everrun-vm-for-ha-features/</link>
		<comments>http://vmetc.com/2008/03/25/xenserver-integrates-everrun-vm-for-ha-features/#comments</comments>
		<pubDate>Tue, 25 Mar 2008 11:35:07 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[XenServer]]></category>
		<category><![CDATA[XenSource]]></category>
		<category><![CDATA[citrix]]></category>
		<category><![CDATA[dr]]></category>
		<category><![CDATA[fail over]]></category>
		<category><![CDATA[feature comparison]]></category>
		<category><![CDATA[replication]]></category>
		<category><![CDATA[vmetc.com]]></category>
		<category><![CDATA[vmworld]]></category>
		<category><![CDATA[business continuity]]></category>
		<category><![CDATA[esx]]></category>
		<category><![CDATA[highavailability]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[server2008]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[windows]]></category>
		<category><![CDATA[Xen]]></category>

		<guid isPermaLink="false">http://vmetc.com/2008/03/25/xenserver-integrates-everrun-vm-for-ha-features/</guid>
		<description><![CDATA[Compared to VMware ESX Enterprise Edition, business continuity and high availability features are lacking when deploying Citrix XenServer &#8220;out of the box.&#8221; Specifically, XenServer does not have the built in equivalent to VI3&#8242;s HA feature. Also missing is a solution similar to VMware&#8217;s soon to be released Site Recovery Manager (SRM). However, Marathon Technologies and [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://vmetc.com/wp-content/uploads/2008/03/everrun-vm.jpg" target="_blank" title="everRun VM diagram"><img src="http://vmetc.com/wp-content/uploads/2008/03/everrun-vm.jpg" style="margin: 10px; width: 240px; height: 173px" alt="everRun VM diagram" align="right" height="173" hspace="10" vspace="10" width="240" /></a>Compared to VMware ESX Enterprise Edition, business continuity and high availability features are lacking when deploying Citrix XenServer &#8220;out of the box.&#8221; Specifically, XenServer does not have the built in equivalent to <a href="http://www.vmware.com/products/vi/vc/ha.html" target="_blank">VI3&#8242;s HA feature</a>. Also missing is a solution similar to VMware&#8217;s soon to be released <a href="http://www.vmware.com/vmworldnews/srm.html" target="_blank">Site Recovery Manager (SRM)</a>. However, Marathon Technologies and XenSource (now a division of Citrix) have worked together to develop <a href="http://www.marathontechnologies.com/fault_tolerant_servers.html">everRun VM</a> as a enterprise class answer to fault tolerant availability for Windows virtual machines hosted on Citrix XenServer. According to Marathon&#8217;s Director of Products, <a href="http://www.virtual-strategy.com/migration/high-availability-through-virtualization.html" target="_blank">Michael Bilancieri</a>, at a recent Atlanta &#8220;Virtualization for the Real World&#8221; event, the integrated solution will be generally available sometime in the 2008 Q2/Q3 time frame.</p>
<p>Quoting from the Best of VMworld (more on this award later in this post) white paper downloadable from the everRun link above:</p>
<p><span id="more-272"></span><center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center></p>
<p class="comments-nice"> &#8221; everRun creates a paired virtual machine environment that appears as a single virtual machine environment for both administration and application operations. The environment presents a single IP address and hostname to the application network so users never have to make client-side changes when failures occur. With everRun ComputeThru technology, protection can be configured so that component and even server failures go unnoticed. You simply keep computing through the failure, rather than failing over between servers.&#8221;</p>
<p class="comments-nice">&#8220;The protected virtual machine appears and is managed just like a standard Windows server. Disk data is mirrored synchronously to redundant storage and network and server operations are protected from failure. The administrator loads and configures the application in the protected virtual machine as though it was being loaded onto a physical server and then simply walks away. There is nothing else to do!&#8221;</p>
<p class="comments-nice">&#8220;Failure detection is automatic and recovery policy is embedded. There is no need to create and test complex fail over scripts. everRun works equally well with all classes of storage devices, unlike clusters that require expensive shared storage. Configuration is simple; everRun presents a single protected virtual machine to the administrator and transparently manages redundant resources for availability, removing opportunities for operator-induced errors.&#8221;</p>
<p>The everRun VM virtual appliance is called the everRun Availability Manager, and it is accessible and configured via a web interface. The browser based interface provides single glance access to events and status.</p>
<p></p>
<p>Although everRun VM is not clustering, it is like clustering in that it requires a shared private network between XenServer hosts. The technology is loaded as a virtual appliance on each host that monitors the VMs under it&#8217;s protection. For designs with more than 2 hosts the private network becomes a &#8220;token ring&#8221; -like network for XenServer hosts to communicate the status of VMs. The key requirement to the solution is latency of the private network.</p>
<p>Also unlike clustering, it should also be stressed that the everRun VM solution is application independent. All Microsoft applications are supported (yes, Exchange and SQL). Since the mirroring takes place at the VM level the application is irrelevant to the process.</p>
<p>The everRun VM solution can be scaled for one to many VM protection scenarios or even used as remote site disaster recovery fail over. For remote data center fail over with the previously mentioned latency requirement was recommended by Michael Bilancieri at the Atlanta event to be less than 10 ms.</p>
<p>There are three levels of protection for a VM offered by everRun:</p>
<ol>
<li>Level 1 &#8211; High Availability (Basic fail over)</li>
<li>Level 2 &#8211; Continuous Availability (VM-level assured)</li>
<li>Level 3 &#8211; Fault Tolerance (Uninterrupted computing)</li>
</ol>
<p>My notes and my memory from the event are unclear about the difference between Level 1 and Level 2, but &#8220;big red button&#8221; fail over (power off crash consistent recovery) like VMware ESX&#8217;s feature falls somewhere between these first 2 levels. Level 3 is live fail over like, for lack of a better example, clustering. everRun offers the ability to set the level of protection for each VM independently.</p>
<p><a href="http://searchservervirtualization.techtarget.com/news/article/0,289142,sid94_gci1272002,00.html" target="_blank">Marathon won an award in the New Technology Category from SearchServerVirtualization.com while at VMworld 2007</a>. Interestingly, Marathon provided a demo in their booth at VMware&#8217;s conference running on XenServer. Marathon and XenSource were one of 2 winners in the category from over 100 exhibitors at VMworld.</p>
<p>Finally, everRun is not a solution only for XenServer. It will work with VMware ESX. everRun&#8217;s HA features can compliment or replace current VI3 HA functionality by offering separate protection to VMs, or offer an alternative to SRM as a Ethernet based solution for DR site fail over.</p>
<p><center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2008%2F03%2F25%2Fxenserver-integrates-everrun-vm-for-ha-features%2F';
  addthis_title  = 'XenServer+integrates+everRun+VM+for+HA+features';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2008/03/25/xenserver-integrates-everrun-vm-for-ha-features/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Designing ESX Resource Pools</title>
		<link>http://vmetc.com/2008/03/04/designing-esx-resource-pools/</link>
		<comments>http://vmetc.com/2008/03/04/designing-esx-resource-pools/#comments</comments>
		<pubDate>Tue, 04 Mar 2008 14:36:41 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[cluster]]></category>
		<category><![CDATA[drs]]></category>
		<category><![CDATA[esx]]></category>
		<category><![CDATA[esxi]]></category>
		<category><![CDATA[fail over]]></category>
		<category><![CDATA[how to]]></category>
		<category><![CDATA[services]]></category>
		<category><![CDATA[vi3]]></category>
		<category><![CDATA[design]]></category>
		<category><![CDATA[planning]]></category>
		<category><![CDATA[VI 3.5]]></category>
		<category><![CDATA[vmware]]></category>

		<guid isPermaLink="false">http://vmetc.com/2008/03/04/designing-esx-resource-pools/</guid>
		<description><![CDATA[How do you design resource pools in an ESX Cluster? There are two strategies that are the most popular in my experience. The first strategy creates resource pools based on CPU and Memory shares for host resource conflict management, and the second strategy uses reservations and limits to guarantee physical resources and ensure VM containment. [...]]]></description>
			<content:encoded><![CDATA[<p>How do you design resource pools in an ESX Cluster? There are two strategies that are the most popular in my experience. The first strategy creates resource pools based on CPU and Memory shares for host resource conflict management, and the second strategy uses reservations and limits to guarantee physical resources and ensure VM containment. This post will use a 3 ESX host example to explain both strategies. Please feel free to comment on the pros and cons of each or why you think one is better than the other.</p>
<p>In the example scenario three ESX hosts each have 16 GB RAM and 2 dual core 3.0 Ghz CPUs. The three hosts will all be members of the same ESX cluster.<span id="more-243"></span></p>
<p><center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center></p>
<p>I tried to create design names that illustrate the strategy. I am not aware of any specific resource pool design naming from VMware.</p>
<h3>The Tug of War design</h3>
<p>This design is simple and does not limit any VMs from any physical resources. Using the ESX shares mechanism, if two or more VMs are competing for the same physical resources the tug of war that results will be decided by the resource pool memberships of the VMs.</p>
<p>The <strong>ESX cluster</strong> will have three resource pools defined.</p>
<ul class="unIndentedList">
<li> A &#8220;High&#8221; resource pool will have no initial reservation and unlimited/expandable RAM and CPU settings. CPU and Memory shares will be set to high. This resource pool will be devoted for mission-critical VMs.</li>
<li> A second &#8220;Normal&#8221; resource pool will have no initial reservation and unlimited/expandable RAM and CPU settings. CPU and Memory shares will be set to normal. Normal priority service VMs will be placed into that resource pool and the shares guarantee that VMs in this resource pool do not have priority over mission critical VMs.</li>
<li> A third &#8220;Low&#8221; resource pool will have no initial reservation and unlimited/expandable RAM and CPU settings. CPU and Memory shares will be set to low. Low priority service VMs, development, and non critical VMs will be placed into this resource pool and the shares guarantee that VMs in this resource pool do not have priority over normal or high VMs.</li>
</ul>
<p>The three resource pools configured by shares will ensure that the proper VM has priority to physical resources when a conflict occurs. Resource conflict management can take place with the full capacity of the ESX cluster&#8217;s physical resources.</p>
<h3>The Pizza Design</h3>
<p><strong><span style="color: #ff6600">updated 03.09.08</span></strong></p>
<p>This design takes the sum total of all physical resources and slices it up across the resource pools. Although the following design only uses two resource pools, many more &#8220;slices&#8221; could be created.<strong><span style="color: #ff6600"> The most basic Pizza Design would be to reserve all memory and cpu, but the following example helps also illustrate reservations and limits.</span></strong></p>
<p>The <strong>ESX cluster</strong> will have two resource pools defined.</p>
<ul class="unIndentedList">
<li> A &#8220;Critical Services&#8221; resource pool will have an initial reservation of 32GB RAM and 8GHz CPU, and unlimited/expandable RAM and CPU settings. This resource pool will be devoted for mission-critical VMs. <span style="color: #ff6600"><strong>Shares for RAM will be set to high, but shares for CPU will be set to normal.</strong></span></li>
<li> A second &#8220;Non Critical Services&#8221; resource pool will have no reservation and a limit of 16GB RAM and 4GHz CPU. Shares for Memory and CPU will be set to normal. Low priority service VMs will be placed into this resource pool <strong><span style="color: #ff6600">and the limits guarantee that VMs in this resource pool do not over consume resources.</span></strong></li>
<li><span style="color: #ff6600"><strong>The remaining balance of CPU is fully available to the Critical Services pool because of the expandable reservation.</strong></span></li>
</ul>
<p><span style="color: #ff6600"><strong>This specific design example was based on a mix of VMs that  were expected to contend for memory more often then cpu.</strong></span></p>
<h3>Final Thoughts</h3>
<p>I usually start a design with the tug of war strategy. Resource Pools can be confusing, complicated, and difficult to troubleshoot if the design gets complex. Starting out simple makes sense until a specific VM performance issue is determined. On the other hand, the pizza design is a better starting point if you know you will have heavily used development VMs mixed in with your production VMs. The pizza design ensures that if a developer launches a rogue script on one of the VMs then the production system will not be impacted.</p>
<p>LOL! After writing this I am hungry for some pizza &#8230;.</p>
<table align="center" border="0">
<tr>
<td><center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></td>
<td></td>
</tr>
</table>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2008%2F03%2F04%2Fdesigning-esx-resource-pools%2F';
  addthis_title  = 'Designing+ESX+Resource+Pools';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2008/03/04/designing-esx-resource-pools/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Planning ESX host capacity</title>
		<link>http://vmetc.com/2008/01/12/how-many-vms-should-run-on-each-esx-host/</link>
		<comments>http://vmetc.com/2008/01/12/how-many-vms-should-run-on-each-esx-host/#comments</comments>
		<pubDate>Sat, 12 Jan 2008 10:56:42 +0000</pubDate>
		<dc:creator>Rich Brambley</dc:creator>
				<category><![CDATA[capacity analysis]]></category>
		<category><![CDATA[fail over]]></category>
		<category><![CDATA[vi3]]></category>
		<category><![CDATA[vmotion]]></category>
		<category><![CDATA[capacityplanner]]></category>
		<category><![CDATA[drs]]></category>
		<category><![CDATA[esx]]></category>
		<category><![CDATA[highavailability]]></category>
		<category><![CDATA[planning]]></category>
		<category><![CDATA[vmware]]></category>

		<guid isPermaLink="false">http://vmetc.com/?p=189</guid>
		<description><![CDATA[How many VMs should run on each ESX host? The answer is determined mostly by the physical resources of the host&#8217;s platform (storage, ram, cpu, etc.). Before VI3 introduced ESX Clusters with DRS and HA squeezing as many VMs on each ESX host as possible was acceptable. Today it&#8217;s not just ESX host capacity, but [...]]]></description>
			<content:encoded><![CDATA[<p align="left">How many VMs should run on each ESX host? The answer is determined mostly by the physical resources of the host&#8217;s platform (storage, ram, cpu, etc.). Before VI3 introduced ESX Clusters with DRS and HA squeezing as many VMs on each ESX host as possible was acceptable. Today it&#8217;s not just ESX host capacity, but ESX Clusters need to be take into consideration. Planning Cluster capacity means ensuring availability of VMs while maintaining acceptable host performance in a fail over scenarios.</p>
<p align="left"><img src="http://www.vmware.com/files_inline/images/products_ha_diagram_02.gif" alt="VMWare HA" align="left" height="188" hspace="15" vspace="10" width="243" />First, what is a fail over scenario? The first thing that comes to mind is a problem. One or more of your ESX hosts unexpectedly crashed. This is considered unplanned downtime. Another fail over scenario to consider is planned downtime such as rebooting  after applying ESX patches. For both of these types of scenarios you want to make sure your VMs stay online.</p>
<p align="left">VMware&#8217;s solution for planned downtime is VMotion. The solution for unplanned downtime is the HA feature of ESX Clusters. When determining your ESX capacity be sure to allow room to leverage these features.</p>
<p align="left"> VMotion migrates a VM to a different ESX host without users losing connectivity. Evacuating an ESX server by VMotion enables you <span id="more-189"></span><br />
<center><p><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "8919425963";
google_ad_width = 468;
google_ad_height = 15;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</p></center><br />
to reboot ESX for planned downtime. The ESX HA feature recognizes an ESX host that is no longer online and communicating with the other ESX hosts in the cluster, or considered isolated, and automatically restarts the VM on other available ESX servers with the capacity to run the VM. Making sure that the extra capacity is available is the trick.</p>
<p align="left">I&#8217;ve heard a lot of companies adopting an N + 1 mentality recently. This is typically based on a consolidation estimate&#8217;s results. If the capacity analysis study determined that 4 ESX hosts are needed then include an extra ESX host  just for added capacity and fail over resources. That&#8217;s a good start, but it might not be enough.</p>
<p align="left"><a href="http://vmetc.com/wp-content/uploads/2008/01/availability-of-vms-and-sizing-esx-host-clusters.jpg" target="_blank"><img src="http://vmetc.com/wp-content/uploads/2008/01/availability-of-vms-and-sizing-esx-host-clusters.jpg" style="margin: 10px 15px; width: 400px; height: 302px" alt="Sizing ESX Clusters" align="right" height="302" hspace="15" vspace="10" width="400" /></a>Remember that a capacity analysis project is centered around the &#8220;WOW! factor&#8221; of demonstrating how many servers can be consolidated on ESX hosts. The ESX host utilization is often targeted to be 80% or higher &#8211; meaning host as many VMs as possible until the host&#8217;s cpu and ram is 80% in use. Using the N + 1 strategy that means that losing one host will work fine. 80% load spread over 4 hosts is maintained by having the 5th ESX host. What if you need to allow for 2 ESX host failures? In that case there is a problem. Here&#8217;s some math to demonstrate why:</p>
<ul>
<li>original design was 4 ESX servers at 80% = 320%, but if you lose 2 ESX hosts, even with the N + 1 server, 3 hosts will be running at over 106% utilization.</li>
</ul>
<p align="left">That may be acceptable for a short period of time, but an Administrator should be ready to add another ESX server ASAP to maintain normal  performance levels. It gets worse when you lose more ESX hosts simultaneously. A consolidation scenario where ESX hosts are only 65% utilized helps the numbers, but it also means more hosts.</p>
<p align="left">The reality of the IT budget and the reliability of modern hardware keep most companies from planning for more than 1 ESX host failure. Maybe that&#8217;s why the N + 1 strategy is so popular? If this is your case you maybe able to still plan for more than 1 host failure  by categorizing your VMs as critical and non critical. Create 2 separate ESX Clusters. Enable DRS and HA on the Cluster with the critical VMs, and only enable DRS on the non critical cluster. The N + 1 design already gives you enough capacity for planned downtime (by rotating maintenance on one ESX at a time), and by limiting the number of VMs that HA will restart, you might be able to maintain critical VM availability through multiple unplanned host failures. You can achieve the same design with a single cluster by setting the non critical VMs fail over behavior to not restart in the properties of the Cluster.</p>
<p align="left">Monitoring your ESX Clusters is critical to be sure to maintain the design&#8217;s expected availability. Obviously, As your number of VMs increases there will quickly come a point where the number of ESX hosts will need to increase.</p>
<p align="left">For more information about HA check out <a href="http://www.vmware.com/products/vi/vc/ha.html" target="_blank">VMware&#8217;s HA page</a>. For more information on VMotion then check out <a href="http://www.vmware.com/products/vi/vc/vmotion.html" target="_blank">VMware&#8217;s VMotion page</a>.</p>
<p><center><script type="text/javascript"><!--
google_ad_client = "pub-9435712307568301";
google_ad_slot = "2700754787";
google_ad_width = 336;
google_ad_height = 280;
//--></script>
<script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
</center></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fvmetc.com%2F2008%2F01%2F12%2Fhow-many-vms-should-run-on-each-esx-host%2F';
  addthis_title  = 'Planning+ESX+host+capacity';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://vmetc.com/2008/01/12/how-many-vms-should-run-on-each-esx-host/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
