Difference between revisions of "EJFAT EPSCI Meeting Oct. 9, 2024"

From epsciwiki
Jump to navigation Jump to search
(Created page with "The meeting time is 2:30pm. === Connection Info: === <div class="toccolours mw-collapsible mw-collapsed"> You can connect using [https://teams.microsoft.com/l/meetup-join/19%...")
 
 
(One intermediate revision by the same user not shown)
Line 36: Line 36:
 
## [https://docs.google.com/document/d/1CsBtDZEhK4k9POSeiLF4kzTQVMl7KwZtubVH-aEIyo4/edit IRI Test Development]:  
 
## [https://docs.google.com/document/d/1CsBtDZEhK4k9POSeiLF4kzTQVMl7KwZtubVH-aEIyo4/edit IRI Test Development]:  
 
### '''New Test Thursday Oct 10, 14:30 - 15:30'''
 
### '''New Test Thursday Oct 10, 14:30 - 15:30'''
### LB version = ESnet Stable version
+
### LB version = ESnet ??? version
 
### Data Source:  
 
### Data Source:  
 
#### JLAB, CLAS12, pre-triggered events - 1 channel
 
#### JLAB, CLAS12, pre-triggered events - 1 channel
Line 53: Line 53:
 
#### LB isolation from any non-LB processing
 
#### LB isolation from any non-LB processing
 
## E2SAR
 
## E2SAR
### e2sar/ibaldin:0.1.02 available - MVP completed
+
### e2sar/ibaldin:0.1.2 available - MVP completed
 
### '''ejfat-5 reserved for E2SAR'''
 
### '''ejfat-5 reserved for E2SAR'''
### E2SAR 0.1.0 .deb packages for Ubuntu 20, 22 and 24 are now available (they contain e2sar library, headers, executables as well as appropriate versions of gRPC and Boost dependencies, all installed under /usr/local), as well as the latest Docker image  
+
### E2SAR 0.1.2 .deb packages for Ubuntu 20, 22 and 24 are now available (they contain E2SAR library, headers, executables as well as appropriate versions of gRPC and Boost dependencies, all installed under /usr/local), as well as the latest Docker image  
 
## IB
 
## IB
 
## ejfat-3 - two FPGA DP built, running - with 4-port LAG at switch, needs CP installation
 
## ejfat-3 - two FPGA DP built, running - with 4-port LAG at switch, needs CP installation
Line 70: Line 70:
 
## [https://wiki.jlab.org/epsciwiki/index.php/EJFAT#EJFAT_System_Status EJFAT Status]
 
## [https://wiki.jlab.org/epsciwiki/index.php/EJFAT#EJFAT_System_Status EJFAT Status]
 
# AOT
 
# AOT
 +
# Minutes
 +
## /dev/sdb to be used for user storage
 +
## Storage Areas NOT to be backed up could be marked as ''volatile''
 +
## Have an opportunity to consolidate wares on SSD for consistent SC backup procedure.
 
<hr>
 
<hr>

Latest revision as of 19:10, 9 October 2024

The meeting time is 2:30pm.

Connection Info:


Agenda:

  1. Previous meeting
  2. Announcements:
    1. ACAT 2024 Paper - In Review
    2. U280’s are discontinued.
      1. New LB purchases U55C
      2. U55C bitfiles available 1 year out
      3. U280 Supported indefinitely
  3. Status
  4. Topics
    1. IRI Test Development:
      1. New Test Thursday Oct 10, 14:30 - 15:30
      2. LB version = ESnet ??? version
      3. Data Source:
        1. JLAB, CLAS12, pre-triggered events - 1 channel
      4. Data Sink:
        1. Perlmutter - 40 nodes
        2. ORNL/ESnet/JLab IRI Testbed / Defiant - 4 nodes allocated
        3. JLab - 7 nodes available
        4. ERSAP
      5. Test Plans - JLab, ESnet, NERSC:
      6. Prometheus Dashboards
      7. The Prometheus dashboard can be accessed on port 1717 of the ejfat-fs node. The test data is located at "100g-nersc-ornl / ejfat-nersc-ornl". The test time interval is around UTC 17:05 to 18:20 on August 29, 2024. To log in to Grafana, please use the username and password "ejfat".
    2. JLab FEG/SRO
      1. will use interim UDP solution for event sync
      2. Special Events Issue - Completely Out-of-band
        1. Cloud Based message queue
        2. LB isolation from any non-LB processing
    3. E2SAR
      1. e2sar/ibaldin:0.1.2 available - MVP completed
      2. ejfat-5 reserved for E2SAR
      3. E2SAR 0.1.2 .deb packages for Ubuntu 20, 22 and 24 are now available (they contain E2SAR library, headers, executables as well as appropriate versions of gRPC and Boost dependencies, all installed under /usr/local), as well as the latest Docker image
    4. IB
    5. ejfat-3 - two FPGA DP built, running - with 4-port LAG at switch, needs CP installation
    6. ejfat-6 - Ubuntu 24.04 installed - esnet-smartnic-fw build succeeds with podman, issues with podman compose
    7. SC poster submitted - demo in works
    8. SSD drives on ejfat-fs - 20TB used of 28TB - mounted for EJFAT farm
    9. Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
    10. Disk sdb - user storage or HA OS fail-over mirror?
    11. Experiment Halls - beam returns late January/February 2025
    12. Ubuntu 20.04 LTS - support ends in 2025
  5. Resources:
    1. HPDF
    2. EJFAT API
    3. EJFAT Status
  6. AOT
  7. Minutes
    1. /dev/sdb to be used for user storage
    2. Storage Areas NOT to be backed up could be marked as volatile
    3. Have an opportunity to consolidate wares on SSD for consistent SC backup procedure.