Difference between revisions of "EJFAT Group Meeting May 23, 2024"
Jump to navigation
Jump to search
(Created page with "The meeting time is 11:00am Eastern/USA. === Connection Info: === <div class="toccolours mw-collapsible mw-collapsed"> You can connect using [ https://jlab-org.zoomgov.com/j/...") |
|||
Line 36: | Line 36: | ||
# Announcements: | # Announcements: | ||
# [https://wiki.jlab.org/epsciwiki/index.php/EJFAT#EJFAT_System_Status System Status]: | # [https://wiki.jlab.org/epsciwiki/index.php/EJFAT#EJFAT_System_Status System Status]: | ||
+ | ## LB Installation | ||
+ | ### ejfat-6 LB believed current with ESnet/NERSC | ||
+ | ### ejfat-3 has two U280s - not listing in lspci | ||
+ | ### ejfat-5 hi-speed NIC not configuring | ||
+ | ### Propogate ejfat-6 LB to all ejfat nodes except ejfat-3 | ||
+ | ### All ejfat nodes have IOMMU enabled | ||
# [https://docs.google.com/document/d/1CsBtDZEhK4k9POSeiLF4kzTQVMl7KwZtubVH-aEIyo4/edit NERSC Test Development]: | # [https://docs.google.com/document/d/1CsBtDZEhK4k9POSeiLF4kzTQVMl7KwZtubVH-aEIyo4/edit NERSC Test Development]: | ||
## Data Source: | ## Data Source: | ||
Line 43: | Line 49: | ||
### ERSAP | ### ERSAP | ||
## [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:] | ## [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:] | ||
− | ## [http://jiriaf2301:3001/d/bdkbrakggnytcf/test?orgId=1 | + | ## [http://jiriaf2301:3001/d/bdkbrakggnytcf/test?orgId=1 100 Gbps / 25 - 40 Node Test - '''Today 13:00 EDT'''] |
# ORNL/ESnet/JLab IRI Testbed (similar to NERSC) - Ross Miller <rgmiller@ornl.gov> '''project code CSC 266''' | # ORNL/ESnet/JLab IRI Testbed (similar to NERSC) - Ross Miller <rgmiller@ornl.gov> '''project code CSC 266''' | ||
## Waiting on ORNL Internal Network Security | ## Waiting on ORNL Internal Network Security | ||
Line 65: | Line 71: | ||
## Progress of finalizing a reassembly frame format (subordinate to 4.) ( Carl / Stacey ) | ## Progress of finalizing a reassembly frame format (subordinate to 4.) ( Carl / Stacey ) | ||
## Progress on software development for [https://developer.nvidia.com/arm-hpc-devkit NVIDIA Bluefield2 DPU] data steering from NIC to GPU memory ( Amitoj/Cissie ) | ## Progress on software development for [https://developer.nvidia.com/arm-hpc-devkit NVIDIA Bluefield2 DPU] data steering from NIC to GPU memory ( Amitoj/Cissie ) | ||
− | |||
# Resources: | # Resources: | ||
## [https://jeffersonlab.sharepoint.com/sites/HPDF HPDF] | ## [https://jeffersonlab.sharepoint.com/sites/HPDF HPDF] |
Latest revision as of 14:30, 23 May 2024
The meeting time is 11:00am Eastern/USA.
Connection Info:
You can connect using [ https://jlab-org.zoomgov.com/j/1611828967?pwd=UVVCS0pUVW5FMlphT0lRQXdoQ0o4Zz09&from=addon ZoomGov Video conferencing (ID: 161 012 5238)]. (Click "Expand" to the right for details -->):
Meeting URL https://jlab-org.zoomgov.com/j/1611828967 Meeting ID 161 182 8967 Passcode 570041 Want to dial in from a phone? Dial one of the following numbers: US: +1 669 254 5252 or +1 646 828 7666 or +1 551 285 1373 or +1 669 216 1590 or 833 568 8864 (Toll Free) Enter the meeting ID and passcode followed by # Connecting from a room system? Dial: bjn.vc or 199.48.152.152 and enter your meeting ID & passcode
Agenda:
- Previous meeting
- Announcements:
- System Status:
- LB Installation
- ejfat-6 LB believed current with ESnet/NERSC
- ejfat-3 has two U280s - not listing in lspci
- ejfat-5 hi-speed NIC not configuring
- Propogate ejfat-6 LB to all ejfat nodes except ejfat-3
- All ejfat nodes have IOMMU enabled
- LB Installation
- NERSC Test Development:
- Data Source:
- JLAB, CLAS12, pre-triggered events - 1 channel
- Data Sink:
- Perlmutter
- ERSAP
- Test Plans - JLab, ESnet, NERSC:
- 100 Gbps / 25 - 40 Node Test - Today 13:00 EDT
- Data Source:
- ORNL/ESnet/JLab IRI Testbed (similar to NERSC) - Ross Miller <rgmiller@ornl.gov> project code CSC 266
- Waiting on ORNL Internal Network Security
- ALS
- Socializing Experiment Agenda - In Review
- Hoping for First Runs Summer 2024
- JLab FEG/SRO - will use interim UDP solution for event sync - may heat up in a month
- EJFAT Reconfig meetings/Tools to explore options
- EJFAT Phase II
- Need long term solution for CP/Source event sync
- Implementation details in the DAOS gateway.
- Need to spec DAOS Use Cases ?
- Intel standing up special slack channel to discuss DAOS
- Connection Strategy to DAOS
- Specially when to keep track of how the FPGA would DMA event data cells in the future if it was a SmartNIC card. ( Cissie )
- daosfs01 has 2 physical IB cards and can run 2 true engines with each CPU socket hosting one engine.
- CP capable of multi FPGA and multi virtual DP
- Progress of FPGA architecture ( Peter and Jonathan )
- LB FW currently limited to 100 Gbps
- Reassembly work commencing soon
- Progress of finalizing a reassembly frame format (subordinate to 4.) ( Carl / Stacey )
- Progress on software development for NVIDIA Bluefield2 DPU data steering from NIC to GPU memory ( Amitoj/Cissie )
- Resources:
- AOT