Difference between revisions of "EJFAT Group Meeting May. 5, 2022"

From epsciwiki
Jump to navigation Jump to search
(Created page with " The meeting time is 11:00am. === Connection Info: === <div class="toccolours mw-collapsible mw-collapsed"> You can connect using [https://jlab-org.zoomgov.com/j/1610125238?p...")
 
 
(10 intermediate revisions by the same user not shown)
Line 32: Line 32:
 
<!-------------------------------------------------------------------------------------------------->
 
<!-------------------------------------------------------------------------------------------------->
 
=== Agenda: ===
 
=== Agenda: ===
* [[EJFAT Group Meeting Apr. 21, 2022 | Previous meeting]]
+
* [[EJFAT Group Meeting Apr. 28, 2022 | Previous meeting]]
 
*:
 
*:
 
* Situation:
 
* Situation:
** <s>Testing with End-to-end EJFAT ERSAP solution on FPGA LB</s>
+
** '''Rec'd new f/w build 28 April'''
** <s>Jumbo Frames - indra-s2,s3, alkaid, fpga</s>
+
*** [https://docs.google.com/document/d/1ssw8sye7jExtPCJVejloe8hNkyWOcxEQzVmm45xs5-w/edit#heading=h.mqilsqsmmpek Specs]
** Linux IP stack buffer size increased
+
*** Restores Jumbo Frames
 +
*** arp, ping - working
 +
*** Port entropy field - Passed Test for data_id stream horizontal reassembly with 10 streams
 
** Using script based LB Control Plane
 
** Using script based LB Control Plane
 
** ERSAP feed end bottleneck needs investigation; Timmer's blaster may provide relief
 
** ERSAP feed end bottleneck needs investigation; Timmer's blaster may provide relief
** EJFAT subnet
+
** EJFAT subnet VLAN 937 172.19.22.0/24
*** VLAN 937 172.19.22.0/24
+
** EJFAT equip inventory:
*** 10Gbs NIC, <s>but '''need cables''' </s>(copper twinX will do)
+
*** Loaners:
** EJFAT loaner equip inventory:
+
**** (2) switches:  ejfat-sw, daq-cc-f109-sw-1
*** (2) switches:  ejfat-sw, daq-cc-f109-sw-1
+
**** (3) DAQ dev machines ''indra-s[1-3]'' 129.57.29/109.23[0-2]
*** (3) DAQ dev machines indra-s[1-3]
+
**** (4) DAQ Farm machines ''dafarm6[1-4]'' currently on 129.57.29.17[1-4]
*** (4) DAQ Farm machines ''dafarm61 dafarm62 dafarm63 dafarm64'' currently on 129.57.29.0/24 subnet w/ additional i/f available for EJFAT subnet
+
**** (17) Hall-D machines - ''gluon120-36'' 129.57.172.1[20-36]
*** (17) Hall-D machines - gluon120-36
+
**** 10Gbs NICs
** Testing newly received FPGA load
+
*** On Order:
*** [https://docs.google.com/document/d/1ssw8sye7jExtPCJVejloe8hNkyWOcxEQzVmm45xs5-w/edit#heading=h.mqilsqsmmpek Specs]
+
**** Compute Equip.- ETA 1 June - '''sans 100Gbs NICs''' (ETA 1 July)
*** arp, ping - working
+
**** Networking Equip. - ETA <s>1 July</s> 5 October
*** Issues:
 
**** Port entropy field - currently broken, update pending
 
**** MTU size restricted to max 1500, update pending
 
**** Possibly incorrect control message responses being investigated
 
*** '''Rec'd new f/w build 28 April'''
 
 
* Pending:
 
* Pending:
** Compute Equip.- ETA 1 June - '''sans 100Gbs NICs''' (ETA 1 July)
+
** Support C libraries for LB Host Control Plane - in unit test
** Networking Equip. - ETA <s>1 July</s> 5 October
+
** ESnet smartnic open-source GitHub repo (May)
** Support C libraries for LB Host Control Plane
+
** ESnet private, forkable Jlab P4 and simulations GitHub repo (May)
** ESnet smartnic open-source GitHub repo (April)
 
** ESnet private, forkable Jlab P4 and simulations GitHub repo (April)
 
 
* To Do:
 
* To Do:
 
** Near Term:
 
** Near Term:
 
*** Hall-B FT calorimeter and hodoscope streaming readout test - Pending OK from Sergey B.
 
*** Hall-B FT calorimeter and hodoscope streaming readout test - Pending OK from Sergey B.
 
**** May be able to use Abbott's indra-s1 setup
 
**** May be able to use Abbott's indra-s1 setup
**** May be to use new VTP f/w with  Hall-B VTP's - (Ben Raydo)
+
**** May be able to use new VTP f/w with  Hall-B VTP's - (Ben Raydo)
 
**** CODA 3.10 + ERSAP for new VTP f/w
 
**** CODA 3.10 + ERSAP for new VTP f/w
 
**** CODA 2.0 for old VTP f/w
 
**** CODA 2.0 for old VTP f/w
*** [https://www.epj-conferences.org/articles/epjconf/abs/2021/05/epjconf_chep2021_04005/epjconf_chep2021_04005.html HOSS Hall-D EJFAT  use case]
+
**** [https://jeffersonlab-my.sharepoint.com/:p:/r/personal/goodrich_jlab_org/Documents/EJFAT/hall-b_test.pptx?d=w31891fd52c1a420ea2b29efcdf5f9ed2&csf=1&web=1&e=JGyxHO Diagram]
 +
*** [https://www.epj-conferences.org/articles/epjconf/abs/2021/05/epjconf_chep2021_04005/epjconf_chep2021_04005.html HOSS]
 
**** parallelize writing of raw data files
 
**** parallelize writing of raw data files
 
**** distribute raw data across multiple compute nodes for calibration skims
 
**** distribute raw data across multiple compute nodes for calibration skims
 
**** 1 Gbs at hi-luminosity
 
**** 1 Gbs at hi-luminosity
 
**** Control Plane
 
**** Control Plane
***** Feedback from Compute hosts
+
***** Will interact with SLURM
***** Control Plane daemon for compute host
+
***** Python based (?)
 +
***** Control Plane daemon for compute host (?)
 
***** Demonstrate CP based flexibility/elasticity
 
***** Demonstrate CP based flexibility/elasticity
 
**** Hall-D comms with DAQ 109 subnet require network customization; DAQ 29 subnet available
 
**** Hall-D comms with DAQ 109 subnet require network customization; DAQ 29 subnet available
**** [https://docs.google.com/presentation/d/1m3rFm-1GymYv8zGimlAjL1NmWtXVfyIQdGzhx_j_BKE/edit?usp=sharing Slides]
+
**** [https://docs.google.com/presentation/d/1m3rFm-1GymYv8zGimlAjL1NmWtXVfyIQdGzhx_j_BKE/edit?usp=sharing Hall-D EJFAT use case]
**** [https://jeffersonlab-my.sharepoint.com/personal/bmorris_jlab_org/Documents/Microsoft%20Teams%20Chat%20Files/JLab%20Network%20-%20HallD-to-EJFAT.png Netwok Diagram]
+
**** [https://jeffersonlab-my.sharepoint.com/personal/bmorris_jlab_org/Documents/Microsoft%20Teams%20Chat%20Files/JLab%20Network%20-%20HallD-to-EJFAT.png Hall-D EJFAT Network Diagram]
 
** Downstream:
 
** Downstream:
 
*** [http://www.dpdk.org DPDK]
 
*** [http://www.dpdk.org DPDK]

Latest revision as of 12:20, 6 May 2022

The meeting time is 11:00am.

Connection Info:

You can connect using ZoomGov Video conferencing (ID: 161 012 5238). (Click "Expand" to the right for details -->):

Meeting URL
 https://jlab-org.zoomgov.com/j/1610125238?pwd=QnEvcjV6VFFndWZsQW15SmJKU0RJZz09&from=addon

Meeting ID
161 012 5238

Passcode
503371

Want to dial in from a phone?

Dial one of the following numbers:
US: +1 669 254 5252 or +1 646 828 7666 or +1 551 285 1373 or +1 669 216 1590 or 833 568 8864 (Toll Free)

Enter the meeting ID and passcode followed by #

Connecting from a room system?
Dial: bjn.vc or 199.48.152.152 and enter your meeting ID & passcode

Agenda:

  • Previous meeting
  • Situation:
    • Rec'd new f/w build 28 April
      • Specs
      • Restores Jumbo Frames
      • arp, ping - working
      • Port entropy field - Passed Test for data_id stream horizontal reassembly with 10 streams
    • Using script based LB Control Plane
    • ERSAP feed end bottleneck needs investigation; Timmer's blaster may provide relief
    • EJFAT subnet VLAN 937 172.19.22.0/24
    • EJFAT equip inventory:
      • Loaners:
        • (2) switches: ejfat-sw, daq-cc-f109-sw-1
        • (3) DAQ dev machines indra-s[1-3] 129.57.29/109.23[0-2]
        • (4) DAQ Farm machines dafarm6[1-4] currently on 129.57.29.17[1-4]
        • (17) Hall-D machines - gluon120-36 129.57.172.1[20-36]
        • 10Gbs NICs
      • On Order:
        • Compute Equip.- ETA 1 June - sans 100Gbs NICs (ETA 1 July)
        • Networking Equip. - ETA 1 July 5 October
  • Pending:
    • Support C libraries for LB Host Control Plane - in unit test
    • ESnet smartnic open-source GitHub repo (May)
    • ESnet private, forkable Jlab P4 and simulations GitHub repo (May)
  • To Do:
    • Near Term:
      • Hall-B FT calorimeter and hodoscope streaming readout test - Pending OK from Sergey B.
        • May be able to use Abbott's indra-s1 setup
        • May be able to use new VTP f/w with Hall-B VTP's - (Ben Raydo)
        • CODA 3.10 + ERSAP for new VTP f/w
        • CODA 2.0 for old VTP f/w
        • Diagram
      • HOSS
        • parallelize writing of raw data files
        • distribute raw data across multiple compute nodes for calibration skims
        • 1 Gbs at hi-luminosity
        • Control Plane
          • Will interact with SLURM
          • Python based (?)
          • Control Plane daemon for compute host (?)
          • Demonstrate CP based flexibility/elasticity
        • Hall-D comms with DAQ 109 subnet require network customization; DAQ 29 subnet available
        • Hall-D EJFAT use case
        • Hall-D EJFAT Network Diagram
    • Downstream:
  • AOT

Minutes: