[Failure Report] 2024-05-18: DNS Server failure

2024-05-21

The DNS server failed and has been restored.

1.Date of Occurrence

 From 5/18 (Sat) around 0:00 to 5/20 (Mon) around 0:00 (under scrutiny)

2.Impact

 Unable to resolve names from compute nodes.

3.Causes etc.

 Due to recovery from the mass node downtime caused by user jobs that occurred on 5/17 (Fri.), the DNS servers became overloaded and unresponsive.