[outages] Meta: outages outage
Jared Mauch
jared at puck.nether.net
Tue Oct 5 07:31:03 EDT 2021
> On Oct 5, 2021, at 5:34 AM, Jürgen Botz via Outages <outages at outages.org> wrote:
>
> OK, aside from the Facebook outage, and no doubt because of it, we effectively had an outages outage... messages to the lsit were massively
> delayed, causing lots of duplicate reports and general failure to have
> any meaningful confirmations, etc.
>
> Why? What happened? Well, outages is hosted on a server
> (puck.nether.net) with a couple dozen other lists, and some of these lists probably have members with email addresses at facebook domains,
> and the SMTP server kept trying to look those up and having wait for
> timeouts? Something like that, except that since there were no routes
> to facebook's nameservers there shouldn't have been any need to wait
> for timeouts. Anyone here form nether.net actually take a look?
>
> It's more than idle curiosity, I'd like make my mail servers more
> resilient to this kind of situation, and the outages list of all
> probably should be, too.
Sure, I’ve gone and tweaked a few more things, there’s a balancing act here between having a single system send out lots of concurrent mail to a server, eg:
60756540E50* 22760 Tue Oct 5 01:28:00 outages-bounces at outages.org
(host itchy.cerento.com[199.190.154.20] said: 451 Only one recipient at a time (in reply to RCPT TO command))
REDACTED at cerento.com
Some systems are explicitly configured to not be optimized (see above, then imagine how many people in parallel @gmail might get a message)
And there are many people with old subscriptions or domains that don’t exist anymore which gets to be exciting when everything is going on at once:
(connect to mail.bestii.com[172.96.180.81]:25: Connection timed out)
redacted at bestii.com
(connect to canadawebhosting.com[74.201.58.138]:25: Connection timed out)
redacted at canadawebhosting.com
(connect to muhpanel.ironusmaidenus.com[51.15.246.204]:25: Connection refused)
redacted at ironusmaidenus.com
(connect to mail.mailhost4.com[198.50.245.163]:25: Connection refused)
redacted at mailhost4.com
(connect to mail.etsms.com[50.232.238.69]:25: Connection timed out)
redacted at etsms.com
(connect to keyedupmedia.com[204.11.56.48]:25: Connection timed out)
redacted at keyedupmedia.com
(connect to metacloud.com[72.52.10.14]:25: Connection timed out)
redacted at metacloud.com
(connect to smtp.naturalwireless.com[52.119.91.194]:25: Connection timed out)
redacted at naturalwireless.com
redacted at naturalwireless.com
(host s1.mail.pciwest.net[2604:2400:a::425] said: 450 4.7.1 Bad Attachment .com (in reply to end of DATA command))
redacted at presys.com
(delivery temporarily suspended: connect to mail.talueee.com[159.69.230.243]:25: No route to host)
redacted at talueee.com
(connect to mail-1.meridian-enviro.com[198.58.69.47]:25: Connection timed out)
redacted at meridian-enviro.com
(reason unavailable)
(All of these are from the same message btw)
I was away for personal reasons yesterday so missed out on all the fun. I’ll check how this message goes out and see if the tweaks helped.
- Jared
More information about the Outages
mailing list