Now that we are all going to have to dive into our
router
configurations, wouldn't it be a
good time to make some changes that are long overdue?
Like getting rid of the IPIP mesh and replace it with
something more
modern and supported
by off-the-shelf routers, works behind NAT, etc?
As long as it doesn’t create failure points!
This event is the perfect example. The entire worldwide 44.x community didn't even
know there was a problem for about 8 hours! And, once reported, everyone was more
interesting in griping about the address sale than fixing the DNS problem. In fact, no
one even knew how to/who could fix the problem because the 44.in-addr.arpa server is
handled by a very select few (one?).
The IPIP mesh may be non-standard, but it is distributed, without any single point of
failure. To get between two points, the two gateways have to have IP connectivity to each
other. That's it. The two end-points can troubleshoot directly.
But every proposal I've seen on this list involves adding at least two other ham
points of failure. For example, I would presumably connect to some other ham's BGP
node and the other end of the connection would do the same. Why? Do these hubs have 24x7
support, like my ISP does? Do the responsible people ever go to work, go out to eat, get
sick, go on vacation, ...? Are they going to be available to troubleshoot on my schedule?
What if they just don't feel like it today?
The six gateway machines in our network don't even use the single-point-of-failure
44-style RIP server. We download the gateway/route list every 6 hours (suitable for our
needs). If FTP fails, file doesn't exist, file has zero size, number of changes seems
unreasonable, etc., (all of which have happened over the years), we send an alert to our
folks and continue on with the previous list of routes and try again later. As a result,
since 2009, we have had exactly zero outages!
So: Standard protocols? Absolutely! If it doesn't add failure points between
peers/gateways, I'm for it!
As you say, we'll need some reliable way to distribute the peer info. Perhaps a few
mirrored servers spread around the world, enabling us to try another one if our closest
server fails. But for peer info, not for forwarding. Requiring some artificial overlay
routing hierarchy or forwarding hop between end-points smells like taking a step back to
the 80s and hop-by-hop BBS forwarding.
Michael, N6MEF