One nice to have feature would be support for displaying some form of progress for the computational processes -- something like the mkl_progress or acml_progress available in mkl and acml. This does not have to be particularly fine-grained.

Is it feasible to consider something like this?

I don't mind doing some of the legwork -- at least for the LU decomposition implementation -- but would appreciate some input on the best way to go about this.