Fortran Input/Output error with error code -5

kelvin490 · Dec 16, 2015

I got a problem running my FORTRAN program in high performance computer cluster. It runs well in my PC but I want to have mass production of data with different initial conditions so I put it in a cluster node with eight cores, simulate eight sets of data.

The program can run without problem in home directory but since I need extra memory space a scratch hard disk is added and I run the programs in this disk.

After a while the program stopped and there is an error message:

PGFIO/stdio: Input/output error
PGFIO-F-/formatted write/unit=6/error code returned by host stdio - 5.
File name = stdout formatted, sequential access record = 181
In source file TipNew8.f90, at line number 2018
FORTRAN STOP

I have run it several times, similar error occurs but the error occurs at different lines. Also it stopped at different time steps each time I run it. This kind of error seems quite random since it occurs at different steps and different lines. Every time it occurs at lines with "write" or "print" function. It runs without problem when I run it in my PC using Microsoft Visual Studio with PGI compiler.

Does anyone have ideas what's wrong with the program?

jedishrfu · Dec 16, 2015

Is your program writing to a disk file? Do you have enough disk space on this scratch disk?

You could check with the "df -h" command if this is linux.

kelvin490 · Dec 16, 2015

jedishrfu said:

Is your program writing to a disk file? Do you have enough disk space on this scratch disk?

You could check with the "df -h" command if this is linux.

I have checked and there is enough space.

DrClaude · Dec 17, 2015

I suggest you contact the support staff responsible for the cluster. I don't think there is much we can do to help you without access to the system.

gsal · Dec 17, 2015

I don't exactly know what "cluster" means, but here are some ideas, maybe...

Are you compiling your program in your PC and then running it in the cluster?
Can you run your program in the cluster without taking advantage of the cluster aspect of it? like just one instance of it? does it run this way?
Is there such a thing as compiling your program in the cluster? for assured compatiblity?
What does cluster mean? Many independent instances of the same program? Are they all writing to the exact same file? or are the file names different?

256bits · Dec 18, 2015

kelvin490 said:

I have checked and there is enough space.

Space shouldn't matter if your program has checked before doing the heavy processing, in which case that is one termination mode.
Are all channel resources made to be sure to be allocated before the run.
Program error handling ...

Sounds though something similar is happening, such as a buffer overflow somewhere, or a node conflict and timeout to disk access.

Is that your software or from the cluster I don't know enough about it. Is it from the network links - is that a possibility.

Random means that the error is indeterminate - ie works really well until the error occurs and you have complete collapse, such as adding the scratch disk has led to an overwhelming accumulation of data.

that;s about all I know.

DrClaude · Dec 18, 2015

256bits said:

or a node conflict and timeout to disk access.

Now that you mention it, this is what I would investigate first. You should be careful that different nodes are not trying to write to a file at the same time. It is very good practice to have one node handle all input/output.

Fortran Input/Output error with error code -5

Similar threads

How to increase phone signal strength by lying about it

A Crisis for Newly Minted CompSci Majors -- entry level jobs gone

Who is responsible for the software when AI takes over programming?

How to calculate Tension for a series of connected points?

Learning Assembly and computer architecture for x86

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers