Difference between revisions of "EJFAT EPSCI Meeting Dec. 18, 2024"
Jump to navigation
Jump to search
(Created page with "The meeting time is 2:30pm. === Connection Info: === <div class="toccolours mw-collapsible mw-collapsed"> You can connect using [https://teams.microsoft.com/l/meetup-join/19%...") |
|||
(3 intermediate revisions by the same user not shown) | |||
Line 27: | Line 27: | ||
#: | #: | ||
# Announcements: | # Announcements: | ||
+ | ## [https://indico.cern.ch/event/1330797/papers/5796662/ ACAT2024 Paper Accepted] | ||
## ESnet CONFAB event, which runs from April 7 to 11. | ## ESnet CONFAB event, which runs from April 7 to 11. | ||
### EJFAT developer meeting all day Thursday 10th | ### EJFAT developer meeting all day Thursday 10th | ||
### April 10th 2025 in San Francisco | ### April 10th 2025 in San Francisco | ||
## SkuTech Interest in EJFAT | ## SkuTech Interest in EJFAT | ||
+ | ### JLab supplied Letter of Support for pre-proposal for SBIR Phase II | ||
# Topics | # Topics | ||
## '''Local CP testing''' | ## '''Local CP testing''' | ||
Line 43: | Line 45: | ||
#### ERSAP | #### ERSAP | ||
### [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:] | ### [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:] | ||
− | ### | + | ### [https://github.com/JeffersonLab/E2SAR/blob/main/scripts/notebooks/EJFAT/E2SAR-U280-lb.ipynb E2SAR Integration] |
− | |||
− | |||
## JLab FEG/SRO | ## JLab FEG/SRO | ||
### will use interim UDP solution for event sync | ### will use interim UDP solution for event sync | ||
Line 51: | Line 51: | ||
#### Cloud message queue solutions include Kafka and RabbitMQ, ... | #### Cloud message queue solutions include Kafka and RabbitMQ, ... | ||
#### LB isolation from any non-LB processing | #### LB isolation from any non-LB processing | ||
− | ## E2SAR 0.1.4 | + | ## [https://github.com/JeffersonLab/E2SAR/ E2SAR 0.1.4] |
### segmentation/reassembly complete | ### segmentation/reassembly complete | ||
### .deb packages for Ubuntu 20, 22 and 24 are now available (they contain E2SAR library, headers, executables as well as appropriate versions of gRPC and Boost dependencies, all installed under /usr/local), as well as the latest Docker image | ### .deb packages for Ubuntu 20, 22 and 24 are now available (they contain E2SAR library, headers, executables as well as appropriate versions of gRPC and Boost dependencies, all installed under /usr/local), as well as the latest Docker image | ||
Line 80: | Line 80: | ||
## U280 Supported indefinitely | ## U280 Supported indefinitely | ||
# [https://www.overleaf.com/project/667d9fa6b50f340b46026ba3 ACAT 2024 Paper] - In Review | # [https://www.overleaf.com/project/667d9fa6b50f340b46026ba3 ACAT 2024 Paper] - In Review | ||
+ | |||
=== Notes === | === Notes === | ||
# LLDP needs IOMMU | # LLDP needs IOMMU |
Latest revision as of 15:15, 19 December 2024
The meeting time is 2:30pm.
Connection Info:
You can connect using Teams Link. (Click "Expand" to the right for details -->):
Agenda:
- Previous meeting
- Announcements:
- ACAT2024 Paper Accepted
- ESnet CONFAB event, which runs from April 7 to 11.
- EJFAT developer meeting all day Thursday 10th
- April 10th 2025 in San Francisco
- SkuTech Interest in EJFAT
- JLab supplied Letter of Support for pre-proposal for SBIR Phase II
- Topics
- Local CP testing
- IRI Test Development:
- Data Source:
- JLAB, CLAS12, pre-triggered events - 1 channel
- Data Sink:
- Perlmutter - 80 nodes
- ORNL/ESnet/JLab IRI Testbed / Defiant - 4 nodes allocated
- JLab - 7 nodes available
- FABRIC - nodes available
- ERSAP
- Test Plans - JLab, ESnet, NERSC:
- E2SAR Integration
- Data Source:
- JLab FEG/SRO
- will use interim UDP solution for event sync
- Special Events Issue - Completely Out-of-band
- Cloud message queue solutions include Kafka and RabbitMQ, ...
- LB isolation from any non-LB processing
- E2SAR 0.1.4
- segmentation/reassembly complete
- .deb packages for Ubuntu 20, 22 and 24 are now available (they contain E2SAR library, headers, executables as well as appropriate versions of gRPC and Boost dependencies, all installed under /usr/local), as well as the latest Docker image
- ALS: E2SAR integration
- IB
- Storage
- SSD drives on ejfat-fs - 20TB used of 28TB - mounted for EJFAT farm - permissions issue
- Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
- Repurposing /dev/sdb to be used for user storage
- Storage Areas NOT to be backed up could be marked as scratch
- Have an opportunity to consolidate wares on SSD for consistent SC backup procedure ?.
- Experiment Halls - beam returns late January/February 2025
- Ubuntu 20.04 LTS - support ends in 2025 - next ESnet target 22.04
- CP: Control Web UI (127.0.0.1:8081) needs SSH tunnel
- EJFAT II
- ESnet interested in partnering for beachhead in FPGA/GPU AI space
- Separate Project
- FAST program coming up
- May get free help from Xilinx
- Might target VERSA release
- Resources:
- U280’s are discontinued.
- New LB purchases U55C
- U55C bitfiles available 1 year out
- U280 Supported indefinitely
- ACAT 2024 Paper - In Review
Notes
- LLDP needs IOMMU
- FW containers need boot init script
- EJFAT nodes:
- 16 NUMA domains
- DPDK must run portmode driver on CPU in NUMA domain of FPGA for LLDP messages
- EJFAT II
- architecture change in control/data paths for FPGA (SRIOV)
- adding PCIE AES
- AOT