EJFAT EPSCI Meeting Sep. 4, 2024

From epsciwiki
Jump to navigation Jump to search

The meeting time is 2:30pm.

Connection Info:


Agenda:

  1. Previous meeting
  2. Announcements:
    1. ACAT 2024 Paper - In Review
    2. ACAT 2022 Paper Judges Comments - accepted
  3. Status
    1. ejfat-1 LB up and should behave the same as ESnet Stable LB
    2. ejfat-2 LB up and should behave the same as ESnet Stable LB
    3. ejfat-4 LB up and should behave the same as ESnet Stable LB
    4. ejfat-5 LB - in progress
  4. Topics
    1. Weekly scheduled cluster maintenance
    2. IRI Test Development:
      1. LB version = ESnet Stable version
      2. Data Source:
        1. JLAB, CLAS12, pre-triggered events - 1 channel
      3. Data Sink:
        1. Perlmutter - 40 nodes
        2. ORNL/ESnet/JLab IRI Testbed / Defiant - 4 nodes allocated
        3. JLab - 7 nodes available
        4. ERSAP
      4. Test Plans - JLab, ESnet, NERSC:
      5. Prometheus Dashboards
      6. Analyzing data from Aug 29, 2024
    3. JLab FEG/SRO
      1. will use interim UDP solution for event sync
      2. Special Events Issue
    4. E2SAR
      1. e2sar/ibaldin:0.1.0b1 available - MVP completed
      2. ejfat-5 reserved for E2SAR
    5. IB
    6. ejfat-3- networking corrected - ready for two FPGA LB work
    7. SC poster submitted - demo in works
    8. SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm - pending
    9. Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others - pending
    10. Lustre storage for EJFAT - not available for Testbed or Ubuntu 20.04
    11. Experiment Halls - beam returns late January/February 2025
    12. Ubuntu 20.04 LTS - support ends in 2025
    13. Time to LAG a U280/NIC(?) at switch
  5. Resources:
    1. HPDF
    2. EJFAT API
    3. EJFAT Status
  6. AOT