ACCESS MMS Team

The ACCESS Monitoring and Measurement (MMS) team is responsible for monitoring and measurement of the NSF funded ACCESS Cyberinfrastrucure facilities including compute, storage and networking. This data is collected in a database and accessed via XDMoD. Development and adding new features to XDMoD is a large component of the team's work. The team is diverse featuring five groups providing a variety of expertise.

University at Buffalo Center for Computational Research

UB/CCR is the lead institution in directing the MMS team effort and the development and maintenance of XDMoD. They are responsible for maintenance of the code and database and integrating contributions from the other teams into XDMoD.

Staff

  • Thomas R. Furlani (ACCESS MMS PI)
  • Matthew D. Jones (ACCESS MMS co-PI, Center Director CCR)
  • Robert L. DeLeon (ACCESS MMS co-PI and Project Manager)
  • Joseph P. White (ACCESS MMS co-PI and Lead Developer)
  • Nikolay Simakov (Computational Scientist)
  • Ryan Rathsam (Programmer/Analyst)
  • Gregary Dean (Programmer/Analyst)
  • Conner Saeli (Programmer/Analyst)
  • Aaron Weeden (Programmer/Analyst)
  • Andrew Stoltman (Programmer/Analyst)

Texas Advanced Computing Center

TACC is responsible for the maintenance and development of TACC_Stats which is a software product that logs performance data on HPC systems which can be input to XDMoD. They are also tasked with developing a new network monitoring feature which will be added to XDMoD.

Staff

  • Stephen Harrell (HPC Engineering Scientist)
  • Matt Cawood (Research Engineering Scientist)
  • Jennifer Schopf (NetSage)

Tufts University

Abani Patra is a Professor at Tufts University. His role is to advise the team on new trends in HPC and other computing modes and new analytical technology to improve how XDMoD tracks usage in the

Staff

  • Abani Patra (ACCESS MMS co-PI)

Case Western Reserve University

Case Western is responsible for developing Application Performance Diagnosis technology which is a machine learning based framework to detect anomalies in performance and detection of root cause of application performance problems.

Staff

  • Vipin Chaudhary (Leader Case Western sub award)
  • Zahra Rahmani (Graduate Student)

San Diego Supercomputer Center

Shava Smallen who is a member of the Cloud Bank team at SDSC is leading the effort to monitor NSF funded public cloud usage and incorporate this data into XDMoD.

Staff

  • Shava Smallen (Leader Cloud Bank sub award)

Recent Publications

2024

  1. Aaron Weeden, Joseph P. White, Robert L. DeLeon, Ryan Rathsam, Nikolay A. Simakov, Conner Saeli, and Thomas R. Furlani. The Data Analytics Framework for XDMoD. SN COMPUT. SCI. 5, 462 (2024). doi:10.1007/s42979-024-02789-2
  1. Nikolay A. Simakov, Matthew D. Jones, Thomas R. Furlani, Eva Siegmann, and Robert J. Harrison. First Impressions of the NVIDIA Grace CPU Superchip and NVIDIA Grace Hopper Superchip for Scientific Workloads, In Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops (HPCAsia '24 Workshops). Association for Computing Machinery, New York, NY, USA, 36–44. doi:10.1145/3636480.3637097

2023

  1. Timothy J. Boerner, Stephen Deems, Thomas R. Furlani, Shelley L. Knuth, and John Towns. 2023. "ACCESS: Advancing Innovation: NSF’s Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support." In Practice and Experience in Advanced Research Computing, ser PEARC ’23 July 23–27, 2023, Portland, OR, USA. Association for Computing Machinery, New York, NY, USA, 4 pages. doi:10.1145/3569951.3597559
  2. N.A. Simakov, R.L. Deleon, J.P. White, M.D. Jones, T.R. Furlani, E. Siegmann, and R.J. Harrison "Are we ready for broader adoption of ARM in the HPC community: Performance and Energy Efficiency Analysis of Benchmarks and Applications Executed on High-End ARM Systems", Proceedings of the HPC Asia 2023 Workshops (HPC Asia '23 Workshops) Association for Computing Machinery, New York, NY, USA, 78–86. doi:10.1145/3581576.3581618

2022

  1. Gregary Dean, Joshua Moraes, Joseph White, Robert Deleon, Matthew Jones, and Thomas Furlani. "Performance Optimization of the Open XDMoD Datawarehouse.", Proceedings of the Practice and Experience in Advanced Research Computing ser PEARC '22. Boston, MA, 2022. Association for Computing Machinery, ISBN 9781450391610 article no. 4 doi:10.1145/3491418.3530290
  2. Nikolay A. Simakov, Robert L. Deleon, Yuqing Lin, Phillip S. Hoffmann, and William R. Mathias. "Developing Accurate Slurm Simulator. In Practice and Experience in Advanced Research Computing", Proceedings of the Practice and Experience in Advanced Research Computing ser PEARC '22. Boston, MA, 2022. Association for Computing Machinery, ISBN 9781450391610 article no. 59 doi:10.1145/3491418.3535178
  3. Liangyi Huang, Sophia Hall, Fei Shao, Arafath Nihar, Vipin Chaudhary, Yinghui Wu, Roger French, and Xusheng Xiao. 2022. "System-Auditing, Data Analysis and Characteristics of Cyber Attacks for Big Data Systems.", In Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM ’22), Association for Computing Machinery, New York, NY, USA, pp. 4872–4876 doi:10.1145/3511808.3557185

2021

  1. Andrew Burford, Alan Calder, David Carlson, Barbara Chapman, Firat Coskun, Tony Curtis, Catherine Feldman, Robert Harrison, Yan Kang, Benjamin Michalowicz, Eric Raut, Eva Siegmann, Daniel Wood, Robert DeLeon, Mathew Jones, Nikolay Simakov, Joseph White, and Dossay Oryspayev. 2021. "Ookami: Deployment and Initial Experiences." In Practice and Experience in Advanced Research Computing (PEARC '21). Association for Computing Machinery, New York, NY, USA, Article 9, 1–8. doi:10.1145/3437359.3465578
  2. Md Abdullah Shahneous Bari, Barbara Chapman, Anthony Curtis, Robert J. Harrison, Eva Siegmann, Nikolay A. Simakov, Matthew D. Jones. "A64FX performance: experience on Ookami" 2021 IEEE International Conference on Cluster Computing (CLUSTER), 2021, pp. 711-718, doi: 10.1109/Cluster48925.2021.00106.

2020

  1. Jeanette M. Sperhac, Robert L. DeLeon, Joseph P. White, Matthew D. Jones, Andrew Bruno, Renette Jones-Ivey, Thomas R. Furlani, Jonathan Bard, and Vipin Chaudhary, "Towards Performant Workflows, Monitoring and Measuring", Proceedings of the 29th International Conference on Computer Communications and Networks, ser IEEE ICCCN '20, Honolulu, HI (virtual), August 3-6, 2020. IEEE Computer Society. Invited talk. doi: 10.1109/ICCCN49398.2020.9209647
  2. Joseph P. White, Martins Innus, Robert L. DeLeon, Matthew D. Jones, and Thomas R. Furlani, "Monitoring and Analysis of Power Consumption on HPC clusters using XDMoD", Proceedings of the Practice and Experience in Advanced Research Computing, ser PEARC '20, Portland, OR (virtual), July 27-31, 2020. Association for Computing Machinery. Winner of Phil Andrews Most Transformative Contribution and Best Paper in Advanced Research Computing Environments track. doi:10.1145/3311790.3396624
  3. Jeanette M. Sperhac, Benjamin D. Plessinger, Joseph P. White, Robert L. DeLeon, Matthew D. Jones, Thomas R. Furlani, "Improving Science Gateway Monitoring, Caveats and Goals", Proceedings of Science Gateways Conference 2020, ser Gateways '20. Science Gateways Community Institute (SGCI), October 19-21, 2020. doi:10.17605/OSF.IO/KS2HT

2019

  1. Craig A. Stewart, David Y. Hancock, Julie Wernert, Thomas Furlani, David Lifka, Alan Sill, Nicholas Berente, Donald F. McMullen, Thomas Cheatham, Amy Apon, Ron Payne, and Shawn D. Slavin. "Assessment of Financial Returns on Investments in Cyberinfrastructure Facilities: A Survey of Current Methods", Proceedings of the 2019 Practice and Experience in Advanced Research Computing on Rise of the Machines (Learning), ser PEARC '19, Chicago, IL, 2019. Association for Computing Machinery. ISBN 9781450372275, article no. 33. doi:10.1145/3332186.3332228
  2. Craig A. Stewart, David Y. Hancock, Julie Wernert, Thomas Furlani, David Lifka, Alan Sill, Nicholas Berente, Donald F. McMullen, Thomas Cheatham, Amy Apon, Ron Payne, and Shawn D. Slavin. "Assessment of Non-Financial Returns on Cyberinfrastructure: A Survey of Current Methods", Proceedings of Humans in the Loop: Enabling and Facilitating Research on Cloud Computing, ser HARC '19, Chicago, IL, 2019. Association for Computing Machinery. ISBN 9781450372794, article no. 2. doi:10.1145/3355738.3355749
  3. Jeanette M. Sperhac, Robert L. DeLeon, Thomas R. Furlani, Steven M. Gallo, Martins Innus, Matthew D. Jones, Jeffrey T. Palmer, Abani Patra, Benjamin D. Plessinger, Ryan Rathsam, Nikolay Simakov, Joseph P. White, Rudra Chakraborty, and Gregary Dean. "Managing computational gateway resources with XDMoD", Future Generation Computer Systems, 98: pp. 154-166, 2019. doi:10.1016/j.future.2019.03.029

Older Publications

See https://xdmod.access-ci.org/#main_tab_panel:about_xdmod?Publications for a list of older publications from the TAS and XMS projects.

This work is sponsored by the National Science Foundation under award 2137603