I just compiled the High-Performance Linpack benchmark version 2.2 linked with OpenBLAS version 0.2.19. These are the current versions as of today. Results indicate that the NanoPi T3 is capable of 12.49 Gflops when solving systems of linear equations. This is the standard benchmark used to rank the Top 500 Supercomputers in the world. For reference, this makes the NanoPi T3 about 7.5 million times slower than the Sunway TaihuLight, which is currently the fastest computer in the world. Another point of reference is the Raspberry Pi 3B, which scores about 6.2 Gflops on the same benchmark.

Note that in order to repeatedly get the fastest timing it was necessary to take the cover off the T3 and cool it with a fan as shown below.

.

Other single-board computers require similar cooling. The output of the benchmark was

An explanation of the input/output parameters follows:T/V : Wall time / encoded variant.N : The order of the coefficient matrix A.NB : The partitioning blocking factor.P : The number of process rows.Q : The number of process columns.Time : Time in seconds to solve the linear system.Gflops : Rate of execution for solving the linear system.

- The matrix A is randomly generated for each test.- The following scaled residual check will be computed: ||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )- The relative machine precision (eps) is taken to be 1.110223e-16- Computational tests pass if scaled residuals are less than 16.0