EJFAT Group Meeting Dec 5, 2024

From epsciwiki
Jump to navigation Jump to search

The meeting time is 11:00am Eastern/USA.

Connection Info:

You can connect using [ https://jlab-org.zoomgov.com/j/1611828967?pwd=UVVCS0pUVW5FMlphT0lRQXdoQ0o4Zz09&from=addon ZoomGov Video conferencing (ID: 161 012 5238)]. (Click "Expand" to the right for details -->):

Meeting URL
 https://jlab-org.zoomgov.com/j/1611828967

Meeting ID
161 182 8967

Passcode
570041

Want to dial in from a phone?

Dial one of the following numbers:
US: +1 669 254 5252 or +1 646 828 7666 or +1 551 285 1373 or +1 669 216 1590 or 833 568 8864 (Toll Free)

Enter the meeting ID and passcode followed by #

Connecting from a room system?
Dial: bjn.vc or 199.48.152.152 and enter your meeting ID & passcode


Agenda:

  1. Previous meeting
  2. Announcements:
    1. SuperComputing24 Atlanta, GA from Nov 17-22, 2024
    2. ESnet CONFAB event, which runs from April 7 to 11. 
      1. EJFAT developer meeting all day Thursday 10th
      2. April 10th 2025 in San Francisco
  3. Topics
    1. IRI Test Development:
      1. Last Test Wednesday Nov 20
        1. Unexpected CP behavior traced to incorrect feedback form backends
      2. Next Test Date/Time = ?????
      3. Data Source:
        1. JLAB, CLAS12, pre-triggered events - 1 channel
      4. Data Sink:
        1. Perlmutter - 80 nodes
        2. ORNL/ESnet/JLab IRI Testbed / Defiant - 4 nodes allocated
        3. JLab - 7 nodes available
        4. FABRIC - nodes available
        5. ERSAP
      5. Test Plans - JLab, ESnet, NERSC:
    2. ALS:
    3. JLab FEG/SRO
      1. will use interim UDP solution for event sync
      2. Special Events Issue - Completely Out-of-band
        1. Cloud Based message queue
        2. LB isolation from any non-LB processing
        3. Cloud solutions include Kafka and RabbitMQ, ...
    4. E2SAR 0.1.2
      1. segmentation/reassembly complete
      2. .deb packages for Ubuntu 20, 22 and 24
    5. Experiment Halls - beam returns late January/February 2025
    6. Ubuntu 20.04 LTS - support ends in 2025 - next ESnet target 22.04
  4. Status
    1. ejfat-1 - 2-port LAG at switch
    2. ejfat-2
      1. Currently shadowing ESnet Stable deployment for IRI
    3. ejfat-3
      1. two FPGA DP built
      2. FW containers built Stacey
      3. Needs Installation Procedure
      4. 4-port LAG at switch
      5. needs CP installation
    4. ejfat-6
      1. Ubuntu 24.04 installed
      2. esnet-smartnic-fw build succeeds with podman
      3. issues with podman compose
  5. EJFAT Phase II
    1. Architecture change in control/data paths for FPGA (SRIOV)
    2. Adding PCIE AES
  6. AOT

Notes

  1. LLDP needs IOMMU
  2. EJFAT nodes:
    1. 16 NUMA domains
    2. DPDK must run portmode driver on CPU in NUMA domain of FPGA for LLDP messages

Minutes