Cloud computing for bioinformatics.

he recent increase in quantity and complexity of high-throughput sequencing has made high-performance distributed computing important. This UROP would involve helping to design and implement a flexible high-performance computing infrastructure that can utilize both high-memory local compute clusters along with massively parallel cloud computing instances. Existing framework in our lab consists of a preliminary hybrid cluster architecture that opportunistically utilizes amazon ec2 nodes, as well as analysis tools which have been created to utilize primarily ec2 nodes. Our goal is to construct methods of automatic load-balancing using cloud computing, as well as modifying existing analysis pipelines to utilize parallel resources.