PI: Daniel Murphy-Olson, CELS

Objective: Modern big data analytics frameworks have diverse collections of actively developed open source libraries to enable analysis of large amounts of data with a minimal amount of software development time. Leveraging these frameworks can enable more efficient initial and iterative analysis of collections of scientific data across many science domains.  This aims to improve our understanding of these frameworks, and how they can be leveraged to improve our knowledge base across collections of scientific data.

Testbed: To evaluate the runtime environment available on the Sage Cray uREKA-GX Analytics Cluster.  Some exclusive access to this cluster may be needed as we evaluate extensions to the Cray-provided environment.