Just for fun, I made a match with Scorpio using lc0 net 32742 and running on volta (about 20knps) vs stockfish-9 on single thread.
This is probably a very unfair setup for stockfish but my goal was just to see if this setup will beat stockfish anyway.
Btw my superivised network is way weaker than leela nets at the moment and i am not sure if i will ever reach strenght of leela nets with
this approach anyway.

Score is 2-1-4 in favor of scorpio+leela at the moment.

your web browser and/or your host do not support iframes as required to display the chessboard

I am getting 20kns on the V100. I think leela benchmarks done by nvidia mention a 25knps on gtx 2080 ti so pretty close.
I may actually be able to get 25knps by increasing batch size (number of threads in my case) to 256 or 512 but 128 is what I prefer.
I am not able to get LC0 backend to work on this linux system I am using (don't have admin privilages to install some packages, such as mason ).

My nets are stuck at 2900+ after 20 million games collected from internet.
Either supervized learning creates nets with a bunch of holes, or I am missing something in my training code (such as SWE etc).

I make sure I don’t have libprotobuf or protoc installed. The meson build will download the appropriate package for you.

I had a pre-installed protoc in the sytem which was causing the problem. Changed it so that it won't find that and now it compiles.
It looks like I get about 17knps using the lc0=cudnn backend and about 19knps using scorpio backend. So it looks like scorpio's backend
is atleast as fast. Note that the RTX 2080 ti is turing architecture while V100 is volta, so not surprized that the RTX numbers are higher than what I am getting here.

I make sure I don’t have libprotobuf or protoc installed. The meson build will download the appropriate package for you.

I had a pre-installed protoc in the sytem which was causing the problem. Changed it so that it won't find that and now it compiles.
It looks like I get about 17knps using the lc0=cudnn backend and about 19knps using scorpio backend. So it looks like scorpio's backend
is atleast as fast. Note that the RTX 2080 ti is turing architecture while V100 is volta, so not surprized that the RTX numbers are higher than what I am getting here.