JIRIAF Meeting Apr. 18 2024

From epsciwiki
Jump to navigation Jump to search


Connection Info:

You can connect using the following link (Meeting ID: 160 126 6529). (Click "Expand" to the right for details -->):

One tap mobile: US: +16692545252,,1608518798# or +16468287666,,1608518798#
Meeting URL: https://jlab-org.zoomgov.com/j/1601266529?pwd=ZkZKL0tjeWFpbmxDeWZob0VmbzNOUT09&from=addon
Meeting ID: 160 126 6529
Passcode: 292304

Join by Telephone
For higher quality, dial a number based on your current location.
Dial:
US: +1 669 254 5252 or +1 646 828 7666 or +1 551 285 1373 or +1 669 216 1590 or 833 568 8864 (Toll Free)
Meeting ID: 160 126 6529

International numbers
Join by SIP
1616903130@sip.zoomgov.com
Join by H.323
161.199.138.10 (US West)
161.199.136.10 (US East)
Meeting ID: 160 851 8798
Passcode: 292304


Agenda:

  • Announcements
  • JFE
    • Code base
      • Current status
        • CIlogon authentication, database, etc.
          • Can folks from ORNL, ALS, or APS log in?
      • Deployment on jiriaf2301
        • Job request queue
        • List of pending and active JRMs
      • Public-facing website
        • Grafana deployment metrics visualization
        • K8S visualization?
  • JRM
    • Tables in Mongodb?
    • Metric server
    • Horizontal autoscaling support
    • Workflow management system.
  • JCS and JMS
    • No proactivity support. JRM/JRMS according to workflow request.
      • Resource Acquisition
        • Time, CPU, and memory requests to steer deployment: SLURM -> JRM
        • Check the job request queue and decide if we need to run more JRMs
      • Remove JRM if the job is completed.
    • Fabric deployment and testing platform.
      • Digital twin prototyping
  • Digital twin
    • Bayesian network-based agent model for a site/workflow.
      • Queueing theory-based mathematical model.
  • Upcoming large scale deployment at NERSC
    • EJFAT data-stream pipeline new metrics.
      • Request and deploy 38 node/JRE
      • Run ERSAP pipeline
      • Confirm 100 Gbps data stream processing with 0 packet loss
      • Reduce resources, i.e., stop JREs individually and measure data processing rate and packet loss.
        • Time-dependent Grafana plots.
  • Deployment at ORNL
  • Documentation and code
    • Centralize the code base in Github.
  • Demo and presentations.
    • Start preparing our second paper.
    • Start working on CHEP24 abstract and presentation
  • AOT

Useful References



Minutes: