<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-GB">
	<id>https://alteeve.com/w/index.php?action=history&amp;feed=atom&amp;title=Fence_Loop</id>
	<title>Fence Loop - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://alteeve.com/w/index.php?action=history&amp;feed=atom&amp;title=Fence_Loop"/>
	<link rel="alternate" type="text/html" href="https://alteeve.com/w/index.php?title=Fence_Loop&amp;action=history"/>
	<updated>2026-04-04T08:19:44Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.42.3</generator>
	<entry>
		<id>https://alteeve.com/w/index.php?title=Fence_Loop&amp;diff=5636&amp;oldid=prev</id>
		<title>Digimer: Created page with &quot;{{header}}  A fence loop is a condition in HA clusters where each node fences the other on boot. It occurs when quorum has been disabled and the ...&quot;</title>
		<link rel="alternate" type="text/html" href="https://alteeve.com/w/index.php?title=Fence_Loop&amp;diff=5636&amp;oldid=prev"/>
		<updated>2013-12-15T19:52:13Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;{{header}}  A fence loop is a condition in &lt;a href=&quot;/w/AN!Cluster_Tutorial_2&quot; title=&quot;AN!Cluster Tutorial 2&quot;&gt;HA clusters&lt;/a&gt; where each node fences the other on boot. It occurs when &lt;a href=&quot;/w/Quorum&quot; title=&quot;Quorum&quot;&gt;quorum&lt;/a&gt; has been disabled and the ...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;Páàjì titun&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{header}}&lt;br /&gt;
&lt;br /&gt;
A fence loop is a condition in [[AN!Cluster Tutorial 2|HA clusters]] where each node fences the other on boot. It occurs when [[quorum]] has been disabled and the cluster stack starts automatically with the [[OS]] and the network connection between the nodes has failed.&lt;br /&gt;
&lt;br /&gt;
Consider:&lt;br /&gt;
&lt;br /&gt;
* A two node cluster has to have quorum disabled.&lt;br /&gt;
* The network connection between [[corosync]] or [[drbd]] has failed, triggering a [[fence]] action.&lt;br /&gt;
* Node 1 wins the initial fence, node 2 reboots.&lt;br /&gt;
* Node 2, on boot, starts corosync or drbd automatically, can&amp;#039;t connect to Node 1 and calls a fence.&lt;br /&gt;
* Node 1 reboots. On boot, it starts corosync or drbd, fails to connect to Node 2 and calls a fence.&lt;br /&gt;
* Node 2 reboots. On boot, it starts corosync or drbd, fails to connect to Node 1 and calls a fence.&lt;br /&gt;
&lt;br /&gt;
This loop continues until the network connection is repaired.&lt;br /&gt;
&lt;br /&gt;
There are three ways to mitigate against this;&lt;br /&gt;
&lt;br /&gt;
# Use 3 or more nodes so that quorum can be enabled.&lt;br /&gt;
# Set fence actions to &amp;lt;span class=&amp;quot;code&amp;quot;&amp;gt;off&amp;lt;/span&amp;gt; instead of &amp;lt;span class=&amp;quot;code&amp;quot;&amp;gt;reboot&amp;lt;/span&amp;gt;, thus preventing a fenced node from booting.&lt;br /&gt;
# Disable the cluster stack from starting on boot.&lt;br /&gt;
&lt;br /&gt;
Of these, the third option is usually the best. The rationale for this is that if a node got fenced in a production cluster, something likely went wrong. Having the node boot, but not join the cluster, will allow you to log into it and examine what happened. When you&amp;#039;ve determined that the node is healthy, rejoin it to the cluster. If a node has a recurring problem, allowing to rejoin the cluster automatically could mean that it repeatedly gets fenced. This is, on the surface, safe. However, fence actions can be disruptive and are never totally risk free.&lt;br /&gt;
&lt;br /&gt;
{{footer}}&lt;/div&gt;</summary>
		<author><name>Digimer</name></author>
	</entry>
</feed>