2024-05-21
The DNS server failed and has been restored.
1.Date of Occurrence
From 5/18 (Sat) around 0:00 to 5/20 (Mon) around 0:00 (under scrutiny)
2.Impact
Unable to resolve names from compute nodes.
3.Causes etc.
Due to recovery from the mass node downtime caused by user jobs that occurred on 5/17 (Fri.), the DNS servers became overloaded and unresponsive.