ACCESS MMS Team

The ACCESS Monitoring and Measurement (MMS) team is responsible for monitoring and measurement of the NSF funded ACCESS Cyberinfrastrucure facilities including compute, storage and networking. This data is collected in a database and accessed via XDMoD. Development and adding new features to XDMoD is a large component of the team's work. The team is diverse featuring five groups providing a variety of expertise.

University at Buffalo Center for Computational Research

UB/CCR is the lead institution in the development and maintenance of XDMoD. They are responsible for maintenance of the code and database and integrating contributions from the other teams into XDMoD.

Staff

  • Matthew D. Jones (ACCESS MMS co-PI, Center Director CCR)
  • Robert L. DeLeon (ACCESS MMS co-PI and Project Manager)
  • Joseph P. White (ACCESS MMS co-PI and Lead Developer)
  • Nikolay Simakov (Computational Scientist)
  • Ryan Rathsam (Programmer/Analyst)
  • Gregary Dean (Programmer/Analyst)
  • Conner Saeli (Programmer/Analyst)

Roswell Park

The ACCESS MMS PI, Thomas R. Furlani is the CIO at Roswell Park. He is responsible for directing the ACCESS MMS team effort and maintaining the quality of the work.

Staff

  • Thomas R. Furlani (ACCESS MMS PI)

Texas Advanced Computing Center

TACC is responsible for the maintenance and development of TACC_Stats which is a software product that logs performance data on HPC systems which can be input to XDMoD. They are also tasked with developing a new network monitoring feature which will be added to XDMoD.

Staff

  • Bill Barth (Leader TACC sub award)
  • Stephen Harrell (HPC Engineering Scientist)
  • Matt Cawood (Research Engineering Scientist)
  • Jennifer Schopf (NetSage)

Tufts University

Abani Patra is a Professor at Tufts University. His role is to advise the team on new trends in HPC and other computing modes and new analytical technology to improve how XDMoD tracks usage in the

Staff

  • Abani Patra (ACCESS MMS co-PI)

Case Western Reserve University

Case Western is responsible for developing Application Performance Diagnosis technology which is a machine learning based framework to detect anomalies in performance and detection of root cause of application performance problems.

Staff

  • Vipin Chaudhary (Leader Case Western sub award)
  • Zahra Rahmani (Graduate Student)

San Diego Supercomputer Center

Shava Smallen who is a member of the Cloud Bank team at SDSC is leading the effort to monitor NSF funded public cloud usage and incorporate this data into XDMoD.

Staff

  • Shava Smallen (Leader Cloud Bank sub award)

Recent Publications

2022

  1. Gregary Dean, Joshua Moraes, Joseph White, Robert Deleon, Matthew Jones, and Thomas Furlani. "Performance Optimization of the Open XDMoD Datawarehouse.", Proceedings of the Practice and Experience in Advanced Research Computing ser PEARC '22. Boston, MA, 2022. Association for Computing Machinery, ISBN 9781450391610 article no. 4 doi:10.1145/3491418.3530290
  2. Nikolay A. Simakov, Robert L. Deleon, Yuqing Lin, Phillip S. Hoffmann, and William R. Mathias. "Developing Accurate Slurm Simulator. In Practice and Experience in Advanced Research Computing", Proceedings of the Practice and Experience in Advanced Research Computing ser PEARC '22. Boston, MA, 2022. Association for Computing Machinery, ISBN 9781450391610 article no. 59 doi:10.1145/3491418.3535178

2021

  1. Andrew Burford, Alan Calder, David Carlson, Barbara Chapman, Firat Coskun, Tony Curtis, Catherine Feldman, Robert Harrison, Yan Kang, Benjamin Michalowicz, Eric Raut, Eva Siegmann, Daniel Wood, Robert DeLeon, Mathew Jones, Nikolay Simakov, Joseph White, and Dossay Oryspayev. 2021. "Ookami: Deployment and Initial Experiences." In Practice and Experience in Advanced Research Computing (PEARC '21). Association for Computing Machinery, New York, NY, USA, Article 9, 1–8. doi:10.1145/3437359.3465578
  2. Md Abdullah Shahneous Bari, Barbara Chapman, Anthony Curtis, Robert J. Harrison, Eva Siegmann, Nikolay A. Simakov, Matthew D. Jones. "A64FX performance: experience on Ookami" 2021 IEEE International Conference on Cluster Computing (CLUSTER), 2021, pp. 711-718, doi: 10.1109/Cluster48925.2021.00106.

2020

  1. Jeanette M. Sperhac, Robert L. DeLeon, Joseph P. White, Matthew D. Jones, Andrew Bruno, Renette Jones-Ivey, Thomas R. Furlani, Jonathan Bard, and Vipin Chaudhary, "Towards Performant Workflows, Monitoring and Measuring", Proceedings of the 29th International Conference on Computer Communications and Networks, ser IEEE ICCCN '20, Honolulu, HI (virtual), August 3-6, 2020. IEEE Computer Society. Invited talk. doi: 10.1109/ICCCN49398.2020.9209647
  2. Joseph P. White, Martins Innus, Robert L. DeLeon, Matthew D. Jones, and Thomas R. Furlani, "Monitoring and Analysis of Power Consumption on HPC clusters using XDMoD", Proceedings of the Practice and Experience in Advanced Research Computing, ser PEARC '20, Portland, OR (virtual), July 27-31, 2020. Association for Computing Machinery. Winner of Phil Andrews Most Transformative Contribution and Best Paper in Advanced Research Computing Environments track. doi:10.1145/3311790.3396624
  3. Jeanette M. Sperhac, Benjamin D. Plessinger, Joseph P. White, Robert L. DeLeon, Matthew D. Jones, Thomas R. Furlani, "Improving Science Gateway Monitoring, Caveats and Goals", Proceedings of Science Gateways Conference 2020, ser Gateways '20. Science Gateways Community Institute (SGCI), October 19-21, 2020. doi:10.17605/OSF.IO/KS2HT

2019

  1. Craig A. Stewart, David Y. Hancock, Julie Wernert, Thomas Furlani, David Lifka, Alan Sill, Nicholas Berente, Donald F. McMullen, Thomas Cheatham, Amy Apon, Ron Payne, and Shawn D. Slavin. "Assessment of Financial Returns on Investments in Cyberinfrastructure Facilities: A Survey of Current Methods", Proceedings of the 2019 Practice and Experience in Advanced Research Computing on Rise of the Machines (Learning), ser PEARC '19, Chicago, IL, 2019. Association for Computing Machinery. ISBN 9781450372275, article no. 33. doi:10.1145/3332186.3332228
  2. Craig A. Stewart, David Y. Hancock, Julie Wernert, Thomas Furlani, David Lifka, Alan Sill, Nicholas Berente, Donald F. McMullen, Thomas Cheatham, Amy Apon, Ron Payne, and Shawn D. Slavin. "Assessment of Non-Financial Returns on Cyberinfrastructure: A Survey of Current Methods", Proceedings of Humans in the Loop: Enabling and Facilitating Research on Cloud Computing, ser HARC '19, Chicago, IL, 2019. Association for Computing Machinery. ISBN 9781450372794, article no. 2. doi:10.1145/3355738.3355749
  3. Jeanette M. Sperhac, Robert L. DeLeon, Thomas R. Furlani, Steven M. Gallo, Martins Innus, Matthew D. Jones, Jeffrey T. Palmer, Abani Patra, Benjamin D. Plessinger, Ryan Rathsam, Nikolay Simakov, Joseph P. White, Rudra Chakraborty, and Gregary Dean. "Managing computational gateway resources with XDMoD", Future Generation Computer Systems, 98: pp. 154-166, 2019. doi:10.1016/j.future.2019.03.029

Older Publications

See https://xdmod.access-ci.org/#main_tab_panel:about_xdmod?Publications for a list of older publications from the TAS and XMS projects.

This work is sponsored by the National Science Foundation under award 2137603