Difference between revisions of "EJFAT Group Meeting Mar. 3, 2022"

From epsciwiki
Jump to navigation Jump to search
 
(20 intermediate revisions by the same user not shown)
Line 40: Line 40:
 
** Awaiting Compute Equip.- ETA 1 June
 
** Awaiting Compute Equip.- ETA 1 June
 
** Awaiting Networking Equip. - ETA 1 July
 
** Awaiting Networking Equip. - ETA 1 July
** <s>End-to-end EJFAT ERSAP solution</s>
+
** End-to-end EJFAT ERSAP solution
 
** Building Interim Test Lab
 
** Building Interim Test Lab
 
*** Use FPGA port #1 for local host subnet testing
 
*** Use FPGA port #1 for local host subnet testing
 
*** Use FPGA port #2 for switch fabric testing
 
*** Use FPGA port #2 for switch fabric testing
*** Use (2) spare/borrowed switches in DAQ data center/lab
+
*** Use (2) spare/borrowed switches
 
*** Using spare 8 nodes (Abbott)
 
*** Using spare 8 nodes (Abbott)
**** <s>Decision on Hall-D spare 10Gbs NICs for compute nodes</s>
+
*** Using Hall-D spare 10Gbs NICs
**** OS install on interim boxes - next week
 
*** <s>Switch config for Jumbo frames</s>
 
 
*** [https://jeffersonlab-my.sharepoint.com/:b:/r/personal/goodrich_jlab_org/Documents/EJFAT/EJFAT%20Network%20Setup.pdf?csf=1&web=1&e=hkUo8k Diagram]
 
*** [https://jeffersonlab-my.sharepoint.com/:b:/r/personal/goodrich_jlab_org/Documents/EJFAT/EJFAT%20Network%20Setup.pdf?csf=1&web=1&e=hkUo8k Diagram]
 
* Pending:
 
* Pending:
** Minor f/w change for 'garbage' packets
+
** <s>Minor f/w change for 'garbage' packets</s>
 
** Support C libraries for LB Host Control Plane
 
** Support C libraries for LB Host Control Plane
 
** ESnet smartnic open-source GitHub repo (April)
 
** ESnet smartnic open-source GitHub repo (April)
Line 58: Line 56:
 
** Connect
 
** Connect
 
*** FPGA port #2 to switch
 
*** FPGA port #2 to switch
*** Melanox NIC port #2 to switch
+
*** Mellanox NIC port #2 to switch
** FPGA routing based on
+
*** Switch config for FPGA
*** Event-Id (Tick) + Data-Id (ROC-Id) ?
+
** CentOS 7 install on interim boxes - next week
*** Multiple Destinations for Event-Id + Data-Id ?
 
 
** C-based control plane
 
** C-based control plane
 
*** Feedback from Compute hosts design
 
*** Feedback from Compute hosts design
*** Control Plane Arp cache / network good citizen
+
*** Control Plane Arp cache / network good citizen - P4 may do
*** SLURM
+
** Control Plane daemon for compute host
** Jumbo Frames - currently limited to 1472B frames - fix is in / not yet installed
+
** Jumbo Frames
 
** IPV6 testing
 
** IPV6 testing
 
** EJFAT Subnet
 
** EJFAT Subnet
** Performance Measures (RT2022 - April 01):
+
** Performance Measures (RT2022 - April 01 submission):
 +
*** Stress Test ERSAP / EJFAT
 
*** Payload Size
 
*** Payload Size
 
*** Reassembly
 
*** Reassembly
 
*** Multiple Back-Ends
 
*** Multiple Back-Ends
 +
*** AOT
 
* [[Test Plans | Test Plan]]
 
* [[Test Plans | Test Plan]]
 
*:
 
*:
 
* AOT
 
* AOT
 
<hr>
 
<hr>
* Actions:
 
** David L, will check on spare 10Gbs NICs from Hall-D
 

Latest revision as of 17:09, 3 March 2022

The meeting time is 11:00am.

Connection Info:

You can connect using ZoomGov Video conferencing (ID: 161 012 5238). (Click "Expand" to the right for details -->):

Meeting URL
 https://jlab-org.zoomgov.com/j/1610125238?pwd=QnEvcjV6VFFndWZsQW15SmJKU0RJZz09&from=addon

Meeting ID
161 012 5238

Passcode
503371

Want to dial in from a phone?

Dial one of the following numbers:
US: +1 669 254 5252 or +1 646 828 7666 or +1 551 285 1373 or +1 669 216 1590 or 833 568 8864 (Toll Free)

Enter the meeting ID and passcode followed by #

Connecting from a room system?
Dial: bjn.vc or 199.48.152.152 and enter your meeting ID & passcode

Agenda:

  • Previous meeting
  • Situation:
    • Testing with ERSAP on FPGA LB
    • Using script based LB Control Plane
    • Awaiting Compute Equip.- ETA 1 June
    • Awaiting Networking Equip. - ETA 1 July
    • End-to-end EJFAT ERSAP solution
    • Building Interim Test Lab
      • Use FPGA port #1 for local host subnet testing
      • Use FPGA port #2 for switch fabric testing
      • Use (2) spare/borrowed switches
      • Using spare 8 nodes (Abbott)
      • Using Hall-D spare 10Gbs NICs
      • Diagram
  • Pending:
    • Minor f/w change for 'garbage' packets
    • Support C libraries for LB Host Control Plane
    • ESnet smartnic open-source GitHub repo (April)
    • ESnet private, forkable Jlab P4 and simulations GitHub repo (April)
  • To Do:
    • Connect
      • FPGA port #2 to switch
      • Mellanox NIC port #2 to switch
      • Switch config for FPGA
    • CentOS 7 install on interim boxes - next week
    • C-based control plane
      • Feedback from Compute hosts design
      • Control Plane Arp cache / network good citizen - P4 may do
    • Control Plane daemon for compute host
    • Jumbo Frames
    • IPV6 testing
    • EJFAT Subnet
    • Performance Measures (RT2022 - April 01 submission):
      • Stress Test ERSAP / EJFAT
      • Payload Size
      • Reassembly
      • Multiple Back-Ends
      • AOT
  • Test Plan
  • AOT