Difference between revisions of "EJFAT EPSCI Meeting Sep. 18, 2024"

From epsciwiki
Jump to navigation Jump to search
Line 48: Line 48:
 
### Prometheus Dashboards
 
### Prometheus Dashboards
 
### Analyzing data from Aug 29, 2024
 
### Analyzing data from Aug 29, 2024
 +
### The Prometheus dashboard can be accessed on port 1717 of the ejfat-fs node. The test data is located at "100g-nersc-ornl / ejfat-nersc-ornl". The test time interval is around UTC 17:05 to 18:20 on August 29, 2024. To log in to Grafana, please use the username and password "ejfat".
 
## JLab FEG/SRO
 
## JLab FEG/SRO
 
### will use interim UDP solution for event sync
 
### will use interim UDP solution for event sync

Revision as of 13:50, 23 September 2024

The meeting time is 2:30pm.

Connection Info:


Agenda:

  1. Previous meeting
  2. Announcements:
    1. ACAT 2024 Paper - In Review
    2. ACAT 2022 Paper - accepted
    3. Scicomp Test Routers installation 09/18/2024
    4. U280’s are discontinued.
      1. New LB purchases U55C
      2. U55C bitfiles available 1 year out
      3. U280 Supported indefinitely
  3. Status
  4. Topics
    1. IRI Test Development:
      1. LB version = ESnet Stable version
      2. Data Source:
        1. JLAB, CLAS12, pre-triggered events - 1 channel
      3. Data Sink:
        1. Perlmutter - 40 nodes
        2. ORNL/ESnet/JLab IRI Testbed / Defiant - 4 nodes allocated
        3. JLab - 7 nodes available
        4. ERSAP
      4. Test Plans - JLab, ESnet, NERSC:
      5. Prometheus Dashboards
      6. Analyzing data from Aug 29, 2024
      7. The Prometheus dashboard can be accessed on port 1717 of the ejfat-fs node. The test data is located at "100g-nersc-ornl / ejfat-nersc-ornl". The test time interval is around UTC 17:05 to 18:20 on August 29, 2024. To log in to Grafana, please use the username and password "ejfat".
    2. JLab FEG/SRO
      1. will use interim UDP solution for event sync
      2. Special Events Issue
        1. Completely Out-of-band
        2. Three LBs
        3. Pass-through / Out-of-band
        4. In-band - new Req for LB F/W
        5. Pass-through / 3 LBs
        6. Delay Line Technique
        7. Extra Meta Data in Each Event
        8. LB-CODA
        9. LB-CODA-CP
    3. E2SAR
      1. e2sar/ibaldin:0.1.0b1 available - MVP completed
      2. ejfat-5 reserved for E2SAR
    4. IB
    5. ejfat-3 - ready for FPGA cluster LB install
    6. ejfat-6 - Ubuntu 24.04 being installed - no docker
    7. SC poster submitted - demo in works
    8. SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm - pending
    9. Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others - pending
    10. Experiment Halls - beam returns late January/February 2025
    11. Ubuntu 20.04 LTS - support ends in 2025
    12. LAGing U280 to switch on ejfat-1 - pending
  5. Resources:
    1. HPDF
    2. EJFAT API
    3. EJFAT Status
  6. AOT