Difference between revisions of "EPSCI Group Meeting Apr. 19, 2021"

From epsciwiki
Jump to navigation Jump to search
(Created page with " The meeting time is 10:00am. === Connection Info: === <div class="toccolours mw-collapsible mw-collapsed"> You can connect using [https://bluejeans.com/253300597 BlueJeans V...")
 
 
(One intermediate revision by the same user not shown)
Line 35: Line 35:
 
# Announcements
 
# Announcements
 
#* Theory Seminar @1pm: Simulating and unfolding LHC events with generative networks ([https://urldefense.proofpoint.com/v2/url?u=https-3A__jlab.us11.list-2Dmanage.com_track_click-3Fu-3D1de9c43e6d5e45ff0ecd664cd-26id-3Dd79203c3ef-26e-3D6b79ff1be4&d=DwMFaQ&c=CJqEzB1piLOyyvZjb8YUQw&r=i0p-C_T1RieVsCZl8ZnUyQ&m=jln4-LFF6Tz2k8VLTzDdhM63Hjyg6VJR__AN95lkr5E&s=PNeyg4qOQJKFgJkNVfJkCBxiRIdLsjPy-Il97lK_qI0&e= bluejeans])
 
#* Theory Seminar @1pm: Simulating and unfolding LHC events with generative networks ([https://urldefense.proofpoint.com/v2/url?u=https-3A__jlab.us11.list-2Dmanage.com_track_click-3Fu-3D1de9c43e6d5e45ff0ecd664cd-26id-3Dd79203c3ef-26e-3D6b79ff1be4&d=DwMFaQ&c=CJqEzB1piLOyyvZjb8YUQw&r=i0p-C_T1RieVsCZl8ZnUyQ&m=jln4-LFF6Tz2k8VLTzDdhM63Hjyg6VJR__AN95lkr5E&s=PNeyg4qOQJKFgJkNVfJkCBxiRIdLsjPy-Il97lK_qI0&e= bluejeans])
#* [[Fortnight Papers|Fortnight paper]] for May. 1st: [https://www.sciencedirect.com/science/article/abs/pii/S0010465521000151 HEP-Frame: Improving the efficiency of pipelined data transformation & filtering for scientific analyses] (delayed?)
+
#* [[Fortnight Papers|Fortnight paper]] for May. 1st: [https://www.sciencedirect.com/science/article/abs/pii/S0010465521000151 HEP-Frame: Improving the efficiency of pipelined data transformation & filtering for scientific analyses]
 
#* Review expectations
 
#* Review expectations
 
#:
 
#:
Line 51: Line 51:
 
#* Scientific Software support
 
#* Scientific Software support
 
#** JLab Common Environment (CE) + SPACK
 
#** JLab Common Environment (CE) + SPACK
#*** EPSCI are now responsible for ROOT builds on CUE
 
#*** CentOS8 support
 
 
#*** ServiceNow [https://jlab.servicenowservices.com/nav_to.do?uri=%2Fincident.do%3Fsys_id%3D8443178a1b782450f0b4dc6ce54bcb80%26sysparm_record_target%3Dincident%26sysparm_record_row%3D2%26sysparm_record_rows%3D3%26sysparm_record_list%3Dactive%3Dtrue%5Ecaller_id%3Djavascript:gs.getUserID()%5EORu_affected_user%3Djavascript:gs.getUserID()%5EORwatch_listCONTAINSjavascript:gs.getUserID()%5EORDERBYDESCopened_at (mapmanager, fputil, fpack, bos, bankdef)]
 
#*** ServiceNow [https://jlab.servicenowservices.com/nav_to.do?uri=%2Fincident.do%3Fsys_id%3D8443178a1b782450f0b4dc6ce54bcb80%26sysparm_record_target%3Dincident%26sysparm_record_row%3D2%26sysparm_record_rows%3D3%26sysparm_record_list%3Dactive%3Dtrue%5Ecaller_id%3Djavascript:gs.getUserID()%5EORu_affected_user%3Djavascript:gs.getUserID()%5EORwatch_listCONTAINSjavascript:gs.getUserID()%5EORDERBYDESCopened_at (mapmanager, fputil, fpack, bos, bankdef)]
 
#** EIC
 
#** EIC
#*** Collaboration with ANL
 
#**** Gaudi -> JANA2
 
#*** ACTS
 
 
#** Offline frameworks (CLARA, JANA2)
 
#** Offline frameworks (CLARA, JANA2)
 
#:
 
#:
#* Data Transport
+
#* EJFAT (ESnet/JLab + FPGA +Accelerated Transport)
#** Meeting with ESnet this afternoon
 
 
#** Status of proposal
 
#** Status of proposal
 
#:
 
#:
 
#* DAQ systems
 
#* DAQ systems
 
#** SRO
 
#** SRO
#*** SAMPA + ERSAP + JANA2 + INDRA-ASTRA = April 1st + 2 weeks
+
#*** SAMPA + ERSAP + JANA2 + INDRA-ASTRA = ?
 
#** CODA (CODA3 support, EVIO-6)
 
#** CODA (CODA3 support, EVIO-6)
 
#:
 
#:
 
#* A.I.
 
#* A.I.
 +
#** Experimental Controls
 
#** Multiple [https://docs.google.com/presentation/d/1lYenr970yuYyzvPz8MXb_pmHxvB8GX2DLPn6Lny8T1U/edit?usp=sharing FOAs] + JLab LDRD
 
#** Multiple [https://docs.google.com/presentation/d/1lYenr970yuYyzvPz8MXb_pmHxvB8GX2DLPn6Lny8T1U/edit?usp=sharing FOAs] + JLab LDRD
 
#*** Collaboration with Theory on MCGen project [https://science.osti.gov/-/media/grants/pdf/foas/2021/SC_FOA_0002493.pdf DE-FOA-0002493]
 
#*** Collaboration with Theory on MCGen project [https://science.osti.gov/-/media/grants/pdf/foas/2021/SC_FOA_0002493.pdf DE-FOA-0002493]
Line 78: Line 73:
 
#*** Amplitude Analysis Inverse Problem (LDRD)
 
#*** Amplitude Analysis Inverse Problem (LDRD)
 
#** Jupyterhub + GPU
 
#** Jupyterhub + GPU
#** Experimental Controls
 
 
#:
 
#:
 
#:
 
#:
 
#* Offsite Computing
 
#* Offsite Computing
 
#** NERSC, PSC, IU
 
#** NERSC, PSC, IU
#*** XSEDE application for PSC bridges-2 being updated for resubmission (due April 15th)
+
#*** XSEDE application for PSC bridges-2 resubmitted
 
#** OSG
 
#** OSG
 
# AOT
 
# AOT
  
<div class="toccolours mw-collapsible mw-collapsed">
 
Message from Bob Michaels officially handing over ROOT responsibilities to EPSCI <font size="-3">(Click "Expand" to the right for details -->):</font>
 
<div class="mw-collapsible-content">
 
<pre>
 
  BTW, I'm officially passing this job (building and maintaining ROOT) to you, now, David.
 
  If you need some help, let me know.  Of course, I can answer questions and help resolve
 
  problems with the old builds.
 
  
  yours
+
<hr>
  Bob
+
 
 +
=== Minutes: ===
 +
 
 +
Attendees: David L., Nathan B., Kishan R., Vardan G., Thomas B., Mike G., Torri J., Graham H.
 +
 
 +
* Licenses
 +
** CLion licenses expired. Quote for renewal received and is in the works.
 +
** GitKraken licenses recently auto-renewed. It seems only 2 people may use GitKraken occasionally. (David will check on adjusting licenses)
 +
** Overleaf licenses are expiring. Will like go to purchasing individual licenses since it does not look like enough use to justify a site license.
 +
 
 +
* EIC
 +
** Nathan has been working with Sylvester and Whit at ANL to understand their GAUDI-based software in order to gauge how difficult it will be to port it into JANA2
 +
*** Fairly complex CI system using Docker to build their software
 +
*** Most of their Docker images not accessible outside ANL. One image copied to dockerhub for Nathan to work with
 +
*** GAUDI uses a "gaudi cmake" that deviates from standard cmake so some effort is required to move to a standard cmake build system
 +
*** One goal is to get a document on migrating from GAUDI to JANA2 that we can post on the JANA2 website.
 +
** Communication from Pukai's mentor listing several issues with eJANA. These mostly appear to be things on Dmitry's side.
 +
 
 +
* CLARA
 +
** New version released for use by CLAS12
 +
*** Includes fix for bug that caused rare occurrences where a file was only partially processed
 +
 
 +
* JANA2
 +
** Nathan working to integrate JANA2 and CLARA (i.e. ERSAP).
  
  Dr. Robert Michaels
+
* EJFAT
  Staff Scientist, Jefferson Lab
+
** Met several times with ESnet for low-level design discussions
</pre>
+
** ESnet presented us with what looks to be a complete design for phase-I
</div>
+
** A single U280 is needed for phase-I with some load balancing handled by that which is already integrated into the switches
</div>
+
** Mike will look into putting a PR in for the U280 (will need to learn the JLab procurement system)
 +
** Existing switches at JLab may not support protocols needed by ESnet software. Dave A. is looking into what we need to purchase.
 +
** ESnet cannot start work on FPGA software development until May 15th so delivery not expected until maybe July.
  
<hr>
+
* SRO
 +
** Contacted Ed J. about SAMPA setup in INDRA lab
 +
*** Gas bottle for GEM changed last week. Needed to flow for a couple of days before turning on HV
 +
*** Ed will go in Thurs. or Fri. to power system up. Vardan volunteered to help if needed.
 +
** VTP testing now includes Data Lakes
 +
*** Increased resident memory usage to 22GB
 +
*** Slight decrease in CPU usage to around 13 cores
 +
** Vardan working on JAVA bindings in ERSAP
 +
 
 +
* CODA
 +
** Dave A. is working on presenting data read from the VTP as a ROC to the standard CODA system
 +
*** This requires updates to jcedit which Vardan is working on
 +
*** This will be deployed as part of CODA 3.11
 +
*** Will be for triggered mode only (SRO support will not be included yet)
 +
 
 +
* AI Experimental Controls
 +
** Torri able to run GlueX CDC calibration codes on farm
 +
** Naomi suggested several tasks involving GARFIELD to allow calculation related to pressure changes
 +
** Suggestion was made to build GARFIELD in central location on the CUE (with spack)
  
=== Minutes: ===
+
* FOAs + LDRD
 +
** 3 ASR, 1 NP, and 1 LDRD
 +
** Graham listed our involvement in several of these:
 +
*** Accelerator operations: Chris T. and Malachi S.
 +
*** Modeling computation and predicting data flows (smart data center): BNL + JLab
 +
*** MCGen : Nobuo + ANL
 +
*** Data reduction/Filtering: JLab + ORNL + GA Tech
 +
*** EJFAT (different funding channel): JLab + ESnet
 +
** LDRD proposals
 +
*** Thomas noted pre-proposal template is short and due May 5th. Full proposals due at end of May.
 +
*** Amplitude Analysis: Thomas
 +
*** Surrogate Models: Kishan, David ,...
  
<!-- Attendees: David L., Carl T., Nathan B., Kishan R., Vardan G., Thomas B., Mike G., Torri J. -->
+
* Offsite Computing
 +
** GlueX proposal to XSEDE for time on PSC Bridges-2 submitted
 +
** OSG
 +
*** Operations improved after unmounting lustre and home from scosg16
 +
*** scosg20 up, but not in production use yet
 +
*** MCWrapper: Thomas working on bundling similar jobs which has the potential of speeding up the submit times by factors of 20-40.

Latest revision as of 15:27, 19 April 2021

The meeting time is 10:00am.

Connection Info:

You can connect using BlueJeans Video conferencing (ID: 253 300 597). (Click "Expand" to the right for details -->):

Meeting URL
 https://bluejeans.com/253300597?src=join_info

Meeting ID
253 300 597

Want to dial in from a phone?

Dial one of the following numbers:
+1.888.240.2560 (US Toll Free)
(see all numbers - https://www.bluejeans.com/premium-numbers)

Enter the meeting ID and passcode followed by #

Connecting from a room system?
Dial: bjn.vc or 199.48.152.152 and enter your meeting ID & passcode

Agenda:

  1. Previous meeting
  2. Announcements
  3. Conferences and Workshops
  4. Ongoing Activities
    • EJFAT (ESnet/JLab + FPGA +Accelerated Transport)
      • Status of proposal
    • DAQ systems
      • SRO
        • SAMPA + ERSAP + JANA2 + INDRA-ASTRA = ?
      • CODA (CODA3 support, EVIO-6)
    • A.I.
      • Experimental Controls
      • Multiple FOAs + JLab LDRD
        • Collaboration with Theory on MCGen project DE-FOA-0002493
        • Collaboration with BNL on AI scheduling DE-FOA-0002482
        • Collaboration with INDRA-ASTRA
        • Collaboration with Sergey F. on AI + FPGA
        • Surrogate Models proposal (NP, ASCR, LDRD?)
        • Amplitude Analysis Inverse Problem (LDRD)
      • Jupyterhub + GPU
    • Offsite Computing
      • NERSC, PSC, IU
        • XSEDE application for PSC bridges-2 resubmitted
      • OSG
  5. AOT



Minutes:

Attendees: David L., Nathan B., Kishan R., Vardan G., Thomas B., Mike G., Torri J., Graham H.

  • Licenses
    • CLion licenses expired. Quote for renewal received and is in the works.
    • GitKraken licenses recently auto-renewed. It seems only 2 people may use GitKraken occasionally. (David will check on adjusting licenses)
    • Overleaf licenses are expiring. Will like go to purchasing individual licenses since it does not look like enough use to justify a site license.
  • EIC
    • Nathan has been working with Sylvester and Whit at ANL to understand their GAUDI-based software in order to gauge how difficult it will be to port it into JANA2
      • Fairly complex CI system using Docker to build their software
      • Most of their Docker images not accessible outside ANL. One image copied to dockerhub for Nathan to work with
      • GAUDI uses a "gaudi cmake" that deviates from standard cmake so some effort is required to move to a standard cmake build system
      • One goal is to get a document on migrating from GAUDI to JANA2 that we can post on the JANA2 website.
    • Communication from Pukai's mentor listing several issues with eJANA. These mostly appear to be things on Dmitry's side.
  • CLARA
    • New version released for use by CLAS12
      • Includes fix for bug that caused rare occurrences where a file was only partially processed
  • JANA2
    • Nathan working to integrate JANA2 and CLARA (i.e. ERSAP).
  • EJFAT
    • Met several times with ESnet for low-level design discussions
    • ESnet presented us with what looks to be a complete design for phase-I
    • A single U280 is needed for phase-I with some load balancing handled by that which is already integrated into the switches
    • Mike will look into putting a PR in for the U280 (will need to learn the JLab procurement system)
    • Existing switches at JLab may not support protocols needed by ESnet software. Dave A. is looking into what we need to purchase.
    • ESnet cannot start work on FPGA software development until May 15th so delivery not expected until maybe July.
  • SRO
    • Contacted Ed J. about SAMPA setup in INDRA lab
      • Gas bottle for GEM changed last week. Needed to flow for a couple of days before turning on HV
      • Ed will go in Thurs. or Fri. to power system up. Vardan volunteered to help if needed.
    • VTP testing now includes Data Lakes
      • Increased resident memory usage to 22GB
      • Slight decrease in CPU usage to around 13 cores
    • Vardan working on JAVA bindings in ERSAP
  • CODA
    • Dave A. is working on presenting data read from the VTP as a ROC to the standard CODA system
      • This requires updates to jcedit which Vardan is working on
      • This will be deployed as part of CODA 3.11
      • Will be for triggered mode only (SRO support will not be included yet)
  • AI Experimental Controls
    • Torri able to run GlueX CDC calibration codes on farm
    • Naomi suggested several tasks involving GARFIELD to allow calculation related to pressure changes
    • Suggestion was made to build GARFIELD in central location on the CUE (with spack)
  • FOAs + LDRD
    • 3 ASR, 1 NP, and 1 LDRD
    • Graham listed our involvement in several of these:
      • Accelerator operations: Chris T. and Malachi S.
      • Modeling computation and predicting data flows (smart data center): BNL + JLab
      • MCGen : Nobuo + ANL
      • Data reduction/Filtering: JLab + ORNL + GA Tech
      • EJFAT (different funding channel): JLab + ESnet
    • LDRD proposals
      • Thomas noted pre-proposal template is short and due May 5th. Full proposals due at end of May.
      • Amplitude Analysis: Thomas
      • Surrogate Models: Kishan, David ,...
  • Offsite Computing
    • GlueX proposal to XSEDE for time on PSC Bridges-2 submitted
    • OSG
      • Operations improved after unmounting lustre and home from scosg16
      • scosg20 up, but not in production use yet
      • MCWrapper: Thomas working on bundling similar jobs which has the potential of speeding up the submit times by factors of 20-40.