ssh failures on archiving

Description

Dear helpdesk,

I get the occasional ssh failure (1 in 10 or so of archive attempts) when archiving data. My current work around is for the archiver to ignore errors and continue. The archiver seems to produce partial conversions but no output when this happens. Any suggestions?

I have had a look at your setup and can confirm that it is an SSH problem and not anything to do with the UM or archiving. I have reduced your setup to the attached simple problem which shows the effect on ARCHER. Just copy the script to /work and

I have done some investigation and discussed with the sysadmin team
re: the termination of your PP jobs. The reason is that the only
modes of access to the PP nodes that is supported are (from
https://www.archer.ac.uk/documentation/user-guide/connecting.php#sec-2.1.2)
1. Via the serial queues
2. Via direct interactive SSH
As a result, processes running on the PP nodes which are not from an
interactive SSH session or a current serial batch job may be
terminated, and this is what is happening in your case.

However, they are aware of the need to solve this issue and will be in further contact with me.