For me, the baseline is always a comparison to some other system.
The other system need not be software. It might be implemented by, say, people wielding pencils and paper forms. If that exists, you can measure its response time, availability, reliability, error rate, and so on.
The other system need not exist. I can always posit an imaginary ...