Publications

 

Core project publications

  • X. Wang, M. Mubarak, R. Ross and Z. Lang. Trade-off Study of Localizing Communication and Balancing Network Traffic on Dragonfly Systems to appear in 32nd IEEE International Parallel and Distributed Processing (IPDPS), 2018 [paper].
  • M. Mubarak, P. Carns, J. Jenkins et al. Quantifying I/O and Communication Traffic Interference on Dragonfly Networks equipped with Burst Buffers, in 19th IEEE Cluster Conference, September 2017.
  • Kelvin Li, M. Mubarak, Kwan-Liu Ma, Chris Carothers and Robert B. Ross, “Visual Analysis Techniques for Exploring the Design Space of Large-scale high-radix networks”, in 19th IEEE Cluster Conference, September 2017.
  • Nikhil Jain, Abhinav Bhatele, Louise Howell et al. “Predicting the Performance Impacts of Different Fat-tree Configurations” in Supercomputing (SC), 2017.
  • M. Mubarak, Nikhil Jain, Jens Domke, Noah Wolfe et al. “Toward Reliable Validation of HPC Interconnect Simulation Models”, in Winter Simulation Conference (WSC), 2017.
  • C. Carothers, J. Meredith, M. Blanco, J. Vetter, M. Mubarak et al. “Durango: Scalable Synthetic Workload Generation for Extreme-Scale Application Performance Modeling & Simulation”, in the 5th ACM SIGSIM Conference on Principles of Advanced Discrete-event Simulations (PADS), 2017.
  • N. Wolfe, M. Mubarak, N. Jain, J. Domke, A. Bhatele, C. Carothers and R. Ross. “Preliminary Performance Analysis of Multi-Rail Fat-Tree Networks” in 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), May 2017.
  • M. Mubarak, C. D. Carothers, R. B. Ross, and P. Carns. Enabling Parallel Simulation of Large-Scale HPC Network Systems. In IEEE Transactions on Parallel and Distributed Systems (TPDS) (January 2017).
  • Xu Yang, Jonn Jenkins, Misbah Mubarak, Robert Ross and Zhiling Lan. “Watch out for the Bully! Job Interference Study on Dragonfly Networks” in Supercomputing (SC) 2016.
  • M. Mubarak, C. D. Carothers, R. B. Ross, and P. Carns, “A case study in using massively parallel simulation for extreme-scale torus network codesign,” in Proc. of the 2nd ACM SIGSIM/PADS Conf. on Principles of Advanced Discrete Simulation, 2014, pp. 27–38.
  • Shane Snyder, Philip Carns, Robert Latham, Robert Ross, Misbah Mubarak, Chris Carothers, Babak Behzad, Huong Vu, Surendra Byna, Prabhat.  “Techniques for Modeling Large-scale HPC I/O Workloads” in 6th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS) 2015.
  • Misbah Mubarak, Christopher D. Carothers, Robert B. Ross, and Philip Carns. Enabling Parallel Simulation of Large-Scale HPC Network Systems. In IEEE Transactions on Parallel and Distributed Systems (TPDS) 2017.
  • Misbah Mubarak, L. Aliaga, P. Ding, A. Tsaris, A. Lyon, A. Norman, R. Ross. SciDAC Data, A Project to Enable Data-Driven Modeling of Exascale Computing in 22nd International Conference on Computing in High Energy and Nuclear Physics (CHEP), 2016.
  • P. Ding, A. Lyon, M. Mubarak, A. Norman, R. Ross, L. Sophlin, A. Tsaris. “Analyzing how we do Analysis and consume data, Results from the SciDAC-Data Project” in 22nd International Conference on Computing in High Energy and Nuclear Physics (CHEP), 2016.
  • Philip Carns, Kevin Harms, John Jenkins, Misbah Mubarak, Robert Ross, and Christopher Carothers. Impact of Data Placement on Resilience in Large-Scale Object Storage Systems. In the 32nd International Conference on Massive Storage Systems and Technology (MSST16) , 2016.
  • Noah Wolfe, Misbah Mubarak, Christopher Carothers, Robert Ross, and Philip Carns. Modeling a Million-Node Slim Fly Network Using Parallel Discrete-Event Simulation. In Proceedings of the 4th ACM Conference on SIGSIM Principles of Advanced Discrete Simulation (SIGSIM-PADS’16) , 2016.
  • Caitlin Ross, Misbah Mubarak, John Jenkins, Philip Carns, Christopher D. Carothers, Robert Ross, Wei Tang, Wolfgang Gerlach, Folker Meyer. A Case Study in Using Discrete-Event Simulation to Improve the Scalability of MG-RAST. In Proceedings of the 4th ACM Conference on SIGSIM Principles of Advanced Discrete Simulation (SIGSIM-PADS’16) , 2016.
  • Justin M. LaPre, Elsa J. Gonsiorowski, Christopher D. Carothers, John Jenkins, Philip Carns, Robert Ross. Time Warp State Restoration via Delta Encoding. In Proceedings of the Winter Simulation Conference (WSC), pages 3025-3036. ACM, 2015.
  • Shane Snyder, Philip Carns, Robert Latham, Misbah Mubarak, Robert Ross, Christopher Carothers, Babak Behzad, Huong Vu Thanh Luu, Surendra Byna, and Prabhat. Techniques for Modeling Large-scale HPC I/O Workloads. International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS15), 2015. [paper / slides]
  • Misbah Mubarak, Christopher D. Carothers, Robert B. Ross, and Philip Carns. Using massively parallel simulation for MPI collective communication modeling in extreme-scale networks. In Proceedings of the Winter Simulation Conference (WSC), 2014.
  • Misbah Mubarak, Christopher D Carothers, Robert B Ross, and Philip Carns. A case study in using massively parallel simulation for extreme-scale torus network codesign. In Proceedings of the 2nd ACM SIGSIM/PADS conference on Principles of advanced discrete simulation, pages 27-38. ACM, 2014.
  • Shane Snyder, Philip Carns, Jonathan Jenkins, Kevin Harms, Robert Ross, Misbah Mubarak, and Christopher Carothers. A case for epidemic fault detection and group membership in HPC storage systems. In 5th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS14). Springer, 2014.
  • Christopher D. Carothers, Misbah Mubarak, Robert B. Ross, Philip Carns, Jeffrey S. Vetter, and Jeremy S. Meredith. Combining Aspen with massively parallel simulation for effective exascale co-design. In Workshop on Modeling & Simulation of Exascale Systems & Applications (MODSIM 2013), 2013.
  • Misbah Mubarak, Christopher D. Carothers, Robert B. Ross, and Philip Carns. Modeling a million-node dragonfly network using massively parallel discrete-event simulation. In 3rd International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS12), 2012.
  • Ning Liu, Jason Cope, Philip Carns, Christopher Carothers, Robert Ross, Gary Grider, Adam Crume, and Carlos Maltzahn. On the role of burst buffers in leadership-class storage systems. In Proceedings of 28th IEEE MSST conference, 2012.
  • Ning Liu, Christopher Carothers, Jason Cope, Philip Carns, and Robert Ross. Model and simulation of exascale communication networks. Journal of Simulation, 2012.
  • N. Liu and C. Carothers, Modeling Billion-Node Torus Networks Using Massively Parallel Discrete-Event Simulation, Proceedings of the 25th ACM/IEEE/SCS Workshop on Principles of Advanced and Distributed Simulation (PADS 2011), Nice, France, June, 2011 [slides, paper]
  • J. Cope, N. Liu, S. Lang, P. Carns, C. Carothers, and R. Ross, CODES: Enabling Co-design of Multilayer Exascale Storage Architectures, Workshop on Emerging Supercomputing Technologies (WEST 2011), in conjunction with the 25th International Conference on Supercomputing (ICS 2011), May, 2011 [slides, paper]
  • N. Liu, C. Carothers, J. Cope, P. Carns, R. Ross, A. Crume, C. Maltzahn, Modeling a Leadership-scale Storage System, 9th International Conference on Parallel Processing and Applied Mathematics 2011 (PPAM 2011) [slides, paper]

Other CODES publications

  • N. Jain, A. Bhatele, X. Ni, T. Gamblin. L. Kale. “Partitioning Low-diameter Networks to Eliminate Inter-job Interference”, in 31st IEEE International Parallel and Distributed Processing (IPDPS), 2017.
  • Nikhil Jain, Abhinav Bhatele, Sam White, Todd Gamblin, Laxmikant V Kale. “Evaluating HPC networks via simulation of parallel workloads” in Supercomputing (SC) 2015.
  • Xiaoqing Luo, Frank Mueller, Philip Carns, John Jenkins, Robert Latham, Robert Ross and Shane Snyder, “HPC I/O Trace Extrapolation”, ESPT2015: Workshop on Extreme-Scale Programming Tools.
  • Bilge Acun, Nikhil Jain, Abhinav Bhatale, Misbah Mubarak, Christopher D. Carothers and Laxmikant Kale. Preliminary Evaluation of a Parallel Trace Replay Tool for HPC Network Simulations in PADABS Parallel and Distributed Agent-Based Simulations, Euro-Par 2015 workshop.
  • Ning Liu, Adnan Haider, Xian-He Sun, Dong Jin. FatTreeSim: Modeling a Large-scale Fat-Tree Network for HPC Systems and Data Centers Using Parallel and Discrete Event Simulation. ACM SIGSIM Conference on Principles of Advanced Discrete Simulation (PADS). 2015. Best paper award.
  • Ning Liu, Xi Yan, Xian-He Sun, Jonathan Jenkins, Robert Ross. YARNSim: Hadoop YARN Simulation System. IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). 2015.
  • W. Tang, J. Jenkins, F. Meyer, R. Ross, R. Kettimuthu, L. Winkler, X. Yang, T. Lehman, and N. Desai. Data-Aware Resource Scheduling for Multi-Cloud Workflows: A Fine-Grained Simulation Approach. in Proc. of IEEE International Conference on Cloud Computing Technology and Science (CloudCom), 2014. (EIC workshop) [paper]