Difference between revisions of "EJFAT EPSCI Meeting Aug. 14, 2024"

From epsciwiki
Jump to navigation Jump to search
(Created page with "The meeting time is 2:30pm. === Connection Info: === <div class="toccolours mw-collapsible mw-collapsed"> You can connect using [https://teams.microsoft.com/l/meetup-join/19%...")
 
 
(2 intermediate revisions by the same user not shown)
Line 27: Line 27:
 
#:
 
#:
 
# Announcements:
 
# Announcements:
 +
## ejfat-1 LB up and should behave the same as ESnet Stable LB
 
# Topics
 
# Topics
 
## [https://docs.google.com/document/d/1CsBtDZEhK4k9POSeiLF4kzTQVMl7KwZtubVH-aEIyo4/edit IRI Test Development]:  
 
## [https://docs.google.com/document/d/1CsBtDZEhK4k9POSeiLF4kzTQVMl7KwZtubVH-aEIyo4/edit IRI Test Development]:  
### LB version = New '''Stable''' version
+
### LB version = ESnet '''Stable''' version
 
### Data Source:  
 
### Data Source:  
 
#### JLAB, CLAS12, pre-triggered events - 1 channel
 
#### JLAB, CLAS12, pre-triggered events - 1 channel
Line 39: Line 40:
 
### [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:]
 
### [https://docs.google.com/document/d/13VvyCMNJW3nIVZMgqOuPn3MBSLmfAl1zLkJAHw8fj04/edit?usp=drivesdk Test Plans - JLab, ESnet, NERSC:]
 
### Prometheus Dashboards
 
### Prometheus Dashboards
## Roles for
 
### ejfat-fs: file server (no logins) ?, computational workstation ?, ?
 
### SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm access (?)
 
 
## JLab FEG/SRO - will use interim UDP solution for event sync - '''heats up in August (?)'''
 
## JLab FEG/SRO - will use interim UDP solution for event sync - '''heats up in August (?)'''
 
## E2SAR - e2sar/ibaldin:0.1.0a6 available - segmentation done, reassembly not completed yet
 
## E2SAR - e2sar/ibaldin:0.1.0a6 available - segmentation done, reassembly not completed yet
 
## IB
 
## IB
# Administrivia
+
# System / Administrivia
## ejfat-3 - needs configuration corrections for networking, routing, mounting /daqfs - should't be used until further notice
+
## ejfat-3 - needs configuration corrections for networking, routing, mounting /daqfs - '''should't be used until further notice'''
 
## [https://www.overleaf.com/project/667d9fa6b50f340b46026ba3 ACAT 2024 Paper] submission deadline extended to '''Aug 18, 2024'''
 
## [https://www.overleaf.com/project/667d9fa6b50f340b46026ba3 ACAT 2024 Paper] submission deadline extended to '''Aug 18, 2024'''
## Awaiting TLS cert install across cluster
+
## <s>Awaiting TLS cert install across cluster</s> - '''Done'''
 +
## Turn machines over to sys admin/Puppet - <s>except for ejfat-fs</s>
 +
## [https://jeffersonlab-my.sharepoint.com/:b:/g/personal/baldin_jlab_org/EaFbLl6TB9BBkA7bG7SwxAwB1_V5HB2rcPNH9KKY846NkQ?e=T1m0q5 SC poster '''submitted''']
 +
## SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm access probably ZFS/Raid for data preservation
 
## Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
 
## Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
## Turn machines over to sys admin/Puppet - '''except for ejfat-fs'''
+
## Lustre storage for EJFAT
## AirMettle SBIR partnership request
 
## [https://jeffersonlab-my.sharepoint.com/:b:/g/personal/baldin_jlab_org/EaFbLl6TB9BBkA7bG7SwxAwB1_V5HB2rcPNH9KKY846NkQ?e=T1m0q5 SC poster submission]
 
 
# Resources:
 
# Resources:
 
## [https://jeffersonlab.sharepoint.com/sites/HPDF HPDF]
 
## [https://jeffersonlab.sharepoint.com/sites/HPDF HPDF]
 
## [https://jeffersonlab.sharepoint.com/:b:/r/sites/SciComp/Shared%20Documents/EPSCI/EJFAT/E2SAR.drawio.pdf?csf=1&web=1&e=rejQL5 '''EJFAT API''']
 
## [https://jeffersonlab.sharepoint.com/:b:/r/sites/SciComp/Shared%20Documents/EPSCI/EJFAT/E2SAR.drawio.pdf?csf=1&web=1&e=rejQL5 '''EJFAT API''']
 +
## [https://wiki.jlab.org/epsciwiki/index.php/EJFAT#EJFAT_System_Status EJFAT Status]
 
# AOT
 
# AOT
 
<hr>
 
<hr>

Latest revision as of 18:59, 14 August 2024

The meeting time is 2:30pm.

Connection Info:


Agenda:

  1. Previous meeting
  2. Announcements:
    1. ejfat-1 LB up and should behave the same as ESnet Stable LB
  3. Topics
    1. IRI Test Development:
      1. LB version = ESnet Stable version
      2. Data Source:
        1. JLAB, CLAS12, pre-triggered events - 1 channel
      3. Data Sink:
        1. Perlmutter - 40 nodes
        2. ORNL/ESnet/JLab IRI Testbed / Defiant - 4 nodes allocated
        3. JLab - 7 nodes available
        4. ERSAP
      4. Test Plans - JLab, ESnet, NERSC:
      5. Prometheus Dashboards
    2. JLab FEG/SRO - will use interim UDP solution for event sync - heats up in August (?)
    3. E2SAR - e2sar/ibaldin:0.1.0a6 available - segmentation done, reassembly not completed yet
    4. IB
  4. System / Administrivia
    1. ejfat-3 - needs configuration corrections for networking, routing, mounting /daqfs - should't be used until further notice
    2. ACAT 2024 Paper submission deadline extended to Aug 18, 2024
    3. Awaiting TLS cert install across cluster - Done
    4. Turn machines over to sys admin/Puppet - except for ejfat-fs
    5. SC poster submitted
    6. SSD drives on ejfat-fs - 20TB used of 28TB - mount for EJFAT farm access probably ZFS/Raid for data preservation
    7. Ram Disks: 1TB Total Mem on ejfat-fs, 0.5 TB others
    8. Lustre storage for EJFAT
  5. Resources:
    1. HPDF
    2. EJFAT API
    3. EJFAT Status
  6. AOT