Difference between revisions of "EJFAT EPSCI Meeting Aug. 14, 2024"

From epsciwiki
Jump to navigation Jump to search
Line 40: Line 40:
 
### [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:]
 
### [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:]
 
### Prometheus Dashboards
 
### Prometheus Dashboards
## Roles for
 
### SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm access (?)
 
 
## JLab FEG/SRO - will use interim UDP solution for event sync - '''heats up in August (?)'''
 
## JLab FEG/SRO - will use interim UDP solution for event sync - '''heats up in August (?)'''
 
## E2SAR - e2sar/ibaldin:0.1.0a6 available - segmentation done, reassembly not completed yet
 
## E2SAR - e2sar/ibaldin:0.1.0a6 available - segmentation done, reassembly not completed yet
 
## IB
 
## IB
# Administrivia
+
# System / Administrivia
## ejfat-3 - needs configuration corrections for networking, routing, mounting /daqfs - should't be used until further notice
+
## ejfat-3 - needs configuration corrections for networking, routing, mounting /daqfs - '''should't be used until further notice'''
 
## [https://www.overleaf.com/project/667d9fa6b50f340b46026ba3 ACAT 2024 Paper] submission deadline extended to '''Aug 18, 2024'''
 
## [https://www.overleaf.com/project/667d9fa6b50f340b46026ba3 ACAT 2024 Paper] submission deadline extended to '''Aug 18, 2024'''
## Awaiting TLS cert install across cluster - '''Done'''
+
## <s>Awaiting TLS cert install across cluster</s> - '''Done'''
## Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
 
 
## Turn machines over to sys admin/Puppet - <s>except for ejfat-fs</s>
 
## Turn machines over to sys admin/Puppet - <s>except for ejfat-fs</s>
## AirMettle SBIR partnership request
 
 
## [https://jeffersonlab-my.sharepoint.com/:b:/g/personal/baldin_jlab_org/EaFbLl6TB9BBkA7bG7SwxAwB1_V5HB2rcPNH9KKY846NkQ?e=T1m0q5 SC poster '''submitted''']
 
## [https://jeffersonlab-my.sharepoint.com/:b:/g/personal/baldin_jlab_org/EaFbLl6TB9BBkA7bG7SwxAwB1_V5HB2rcPNH9KKY846NkQ?e=T1m0q5 SC poster '''submitted''']
 +
## SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm access probably ZFS/Raid for data preservation
 +
## Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
 +
## Luster storage for EJFAT
 
# Resources:
 
# Resources:
 
## [https://jeffersonlab.sharepoint.com/sites/HPDF HPDF]
 
## [https://jeffersonlab.sharepoint.com/sites/HPDF HPDF]

Revision as of 17:10, 14 August 2024

The meeting time is 2:30pm.

Connection Info:


Agenda:

  1. Previous meeting
  2. Announcements:
    1. ejfat-1 LB up and should behave the same as ESnet Stable LB
  3. Topics
    1. IRI Test Development:
      1. LB version = ESnet Stable version
      2. Data Source:
        1. JLAB, CLAS12, pre-triggered events - 1 channel
      3. Data Sink:
        1. Perlmutter - 40 nodes
        2. ORNL/ESnet/JLab IRI Testbed / Defiant - 4 nodes allocated
        3. JLab - 7 nodes available
        4. ERSAP
      4. Test Plans - JLab, ESnet, NERSC:
      5. Prometheus Dashboards
    2. JLab FEG/SRO - will use interim UDP solution for event sync - heats up in August (?)
    3. E2SAR - e2sar/ibaldin:0.1.0a6 available - segmentation done, reassembly not completed yet
    4. IB
  4. System / Administrivia
    1. ejfat-3 - needs configuration corrections for networking, routing, mounting /daqfs - should't be used until further notice
    2. ACAT 2024 Paper submission deadline extended to Aug 18, 2024
    3. Awaiting TLS cert install across cluster - Done
    4. Turn machines over to sys admin/Puppet - except for ejfat-fs
    5. SC poster submitted
    6. SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm access probably ZFS/Raid for data preservation
    7. Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
    8. Luster storage for EJFAT
  5. Resources:
    1. HPDF
    2. EJFAT API
    3. EJFAT Status
  6. AOT