Traces from existing parallel and distributed computing systems are a useful resource for researchers to replicate real-life workloads for their experiments. However, since cloud computing is a relatively new area few such traces are currently available.


Using Chameleon OpenStack KVM Cloud Traces


We have developed a trace data structure based on data from OpenStack Nova, as well as software to extract the appropriate data. We are making available data from the OpenStack KVM cloud operated by the Chameleon testbed for educational projects.

We released Chameleon OpenStack KVM cloud traces to enable researchers to run their experiment and/or simulations with more realisticĀ scientific testbed data. It would be a huge encouragement to us to see your works using our cloud trances. If you use our data in your research, it would be great to let us know. You can find our contact information here.

For a quick start on downloading and using our cloud traces, we provided a Jupyter Notebook example, which you can uploadĀ to Chameleon Jupyter server. For more information about Chameleon Jupyter interface, please visit Chameleon user documentation.


Sharing Your Cloud Traces


We also encourage research groups and cloud testbed providers to share their cloud traces with us. To share your cloud traces, please provide the following information with your cloud traces:

  • Cloud trace format and format version
  • How the cloud trace was generated (i.e. software tools)
  • Cloud trace period
  • Release date
  • Contact information
  • (Optional) Publications

If you are interested in sharing your cloud traces with us, please contact us!


Other Released Cloud Traces


The commercial cloud providers have released their cloud traces, and numerousĀ researches had been done based on these released traces. We listed Microsoft Azure, Alibaba and Google release below for your reference.