As promised, I’ve put together a short video on how to build your own cluster for sensor data analysis with Myriad. Myriad uses Akka’sRemoting feature to (hopefully) make it relatively straightforward to link up several computers in a DIY processing pipeline. If you’re using the GUI tools I wrote for NASA, you just start the GUI on each machine, then point the next machine in the processing pipeline to the remote machine and you’re good to go.

The main use cases I see for this feature are for handling big datasets and for resource-intensive ROI code. If you’ve got enough data to analyze that just reading it might take up all your available RAM, or if your ROI code needs all the CPU/GPU/RAM it can get, you can split the processing up among multiple systems. Have one machine responsible for reading the data and sending subsets to an analysis machine, which is then free to use all its resources for your ROI code. The analysis machine does its work and sends the results to another machine for reporting and compiling the results, and so on.