Difference between revisions of "AI Surrogate Models LDRD"

From epsciwiki
Jump to navigation Jump to search
Line 63: Line 63:
 
* [https://jeffersonlab-my.sharepoint.com/:w:/g/personal/xmei_jlab_org/EbW3wNREEVxFm12IvsRs0AEB2BFHyuUrvU5T7OZJtJlqaQ?e=jHHfT9 The NVIDIA GPU profiling tool for the GPU Roofline analysis], Xinxin Mei
 
* [https://jeffersonlab-my.sharepoint.com/:w:/g/personal/xmei_jlab_org/EbW3wNREEVxFm12IvsRs0AEB2BFHyuUrvU5T7OZJtJlqaQ?e=jHHfT9 The NVIDIA GPU profiling tool for the GPU Roofline analysis], Xinxin Mei
 
* [https://jeffersonlab-my.sharepoint.com/:w:/r/personal/xmei_jlab_org/Documents/Roofline_model/Advisor%20and%20Likwid%20report.docx?d=wde1d0406d7644135ac5cb3c942751296&csf=1&web=1&e=Uuh3UJ Utilizing Advisor and Likwid for CPU Roofline analysis], Xinxin Mei
 
* [https://jeffersonlab-my.sharepoint.com/:w:/r/personal/xmei_jlab_org/Documents/Roofline_model/Advisor%20and%20Likwid%20report.docx?d=wde1d0406d7644135ac5cb3c942751296&csf=1&web=1&e=Uuh3UJ Utilizing Advisor and Likwid for CPU Roofline analysis], Xinxin Mei
 
+
* [https://github.com/cissieAB/pinn-heat-equation/blob/main/docs/prof_res.md The Roofline analysis of my libtorch C++ GPU PINN-heat equation implementation], Xinxin Mei
 
 
  
 
== Useful Links ==
 
== Useful Links ==

Revision as of 09:37, 1 September 2022

Running Legacy Code on Heterogeneous Hardware via Surrogate Models

Project Description

PHASM ("Parallel Hardware viA Surrogate Models") is a software toolkit, currently under development, for creating AI-based surrogate models of scientific code. AI-based surrogate models are widely used for creating fast and inverse simulations. The project anticipates an additional, future use case: adapting legacy code to modern hardware. Data centers are investing in heterogeneous hardware such as GPUs and FPGAs; meanwhile, many important codebases are unable to take advantage of this hardware's superior parallelism without undergoing a costly rewrite. An alternative is to train a neural net surrogate model to mimic the computationally intensive functions in the code, and deploy the surrogate on the exotic hardware instead. PHASM addresses three specific challenges: (1) systematically discovering which functions can be effectively replaced with a surrogate, (2) automatically identifying, for a given function, the true space of inputs and outputs including those not apparent from the type signature, and (3) integrating a machine learning model into a legacy codebase cleanly and with a high level of abstraction. In the first year of development, a proof of concept has been developed for each challenge. A surrogate API makes it easy to bring PyTorch models into the C++ ecosystem and uses profunctor optics to establish a two-way data binding between C++ datatypes and tensors. A model variable discovery tool performs a dynamic binary analysis using Intel PIN in order to identify a target function's model variable space, including types, shapes, and ranges, and generate the optics code necessary to bind the model to the function. Future work may include exploring the limits of surrogate models for functions of increasing size and complexity, and adaptively generating synthetic training data based on uncertainty estimates.

General Resources

Meetings

Thursday 2-3pm, bi-weekly. Teams.

Meeting ID: 281 724 015 543

Passcode: cWidgE

Minutes from the 25 August 2022 meeting


Documents

Proposals

Presentations

Date Event Presenter Slides
2022-06-27 SRGS 2022 Nathan Brei, David Lawrence PDF
2022-08-24 W&M collab Nathan Brei PDF

Publications

Date Journal Title
ccc aaa bbb


Notes

Useful Links