Difference between revisions of "EPSCI Group Meeting Apr. 19, 2021"

From epsciwiki
Jump to navigation Jump to search
 
Line 86: Line 86:
 
=== Minutes: ===
 
=== Minutes: ===
  
<!-- Attendees: David L., Carl T., Nathan B., Kishan R., Vardan G., Thomas B., Mike G., Torri J. -->
+
Attendees: David L., Nathan B., Kishan R., Vardan G., Thomas B., Mike G., Torri J., Graham H.
 +
 
 +
* Licenses
 +
** CLion licenses expired. Quote for renewal received and is in the works.
 +
** GitKraken licenses recently auto-renewed. It seems only 2 people may use GitKraken occasionally. (David will check on adjusting licenses)
 +
** Overleaf licenses are expiring. Will like go to purchasing individual licenses since it does not look like enough use to justify a site license.
 +
 
 +
* EIC
 +
** Nathan has been working with Sylvester and Whit at ANL to understand their GAUDI-based software in order to gauge how difficult it will be to port it into JANA2
 +
*** Fairly complex CI system using Docker to build their software
 +
*** Most of their Docker images not accessible outside ANL. One image copied to dockerhub for Nathan to work with
 +
*** GAUDI uses a "gaudi cmake" that deviates from standard cmake so some effort is required to move to a standard cmake build system
 +
*** One goal is to get a document on migrating from GAUDI to JANA2 that we can post on the JANA2 website.
 +
** Communication from Pukai's mentor listing several issues with eJANA. These mostly appear to be things on Dmitry's side.
 +
 
 +
* CLARA
 +
** New version released for use by CLAS12
 +
*** Includes fix for bug that caused rare occurrences where a file was only partially processed
 +
 
 +
* JANA2
 +
** Nathan working to integrate JANA2 and CLARA (i.e. ERSAP).
 +
 
 +
* EJFAT
 +
** Met several times with ESnet for low-level design discussions
 +
** ESnet presented us with what looks to be a complete design for phase-I
 +
** A single U280 is needed for phase-I with some load balancing handled by that which is already integrated into the switches
 +
** Mike will look into putting a PR in for the U280 (will need to learn the JLab procurement system)
 +
** Existing switches at JLab may not support protocols needed by ESnet software. Dave A. is looking into what we need to purchase.
 +
** ESnet cannot start work on FPGA software development until May 15th so delivery not expected until maybe July.
 +
 
 +
* SRO
 +
** Contacted Ed J. about SAMPA setup in INDRA lab
 +
*** Gas bottle for GEM changed last week. Needed to flow for a couple of days before turning on HV
 +
*** Ed will go in Thurs. or Fri. to power system up. Vardan volunteered to help if needed.
 +
** VTP testing now includes Data Lakes
 +
*** Increased resident memory usage to 22GB
 +
*** Slight decrease in CPU usage to around 13 cores
 +
** Vardan working on JAVA bindings in ERSAP
 +
 
 +
* CODA
 +
** Dave A. is working on presenting data read from the VTP as a ROC to the standard CODA system
 +
*** This requires updates to jcedit which Vardan is working on
 +
*** This will be deployed as part of CODA 3.11
 +
*** Will be for triggered mode only (SRO support will not be included yet)
 +
 
 +
* AI Experimental Controls
 +
** Torri able to run GlueX CDC calibration codes on farm
 +
** Naomi suggested several tasks involving GARFIELD to allow calculation related to pressure changes
 +
** Suggestion was made to build GARFIELD in central location on the CUE (with spack)
 +
 
 +
* FOAs + LDRD
 +
** 3 ASR, 1 NP, and 1 LDRD
 +
** Graham listed our involvement in several of these:
 +
*** Accelerator operations: Chris T. and Malachi S.
 +
*** Modeling computation and predicting data flows (smart data center): BNL + JLab
 +
*** MCGen : Nobuo + ANL
 +
*** Data reduction/Filtering: JLab + ORNL + GA Tech
 +
*** EJFAT (different funding channel): JLab + ESnet
 +
** LDRD proposals
 +
*** Thomas noted pre-proposal template is short and due May 5th. Full proposals due at end of May.
 +
*** Amplitude Analysis: Thomas
 +
*** Surrogate Models: Kishan, David ,...
 +
 
 +
* Offsite Computing
 +
** GlueX proposal to XSEDE for time on PSC Bridges-2 submitted
 +
** OSG
 +
*** Operations improved after unmounting lustre and home from scosg16
 +
*** scosg20 up, but not in production use yet
 +
*** MCWrapper: Thomas working on bundling similar jobs which has the potential of speeding up the submit times by factors of 20-40.

Latest revision as of 15:26, 19 April 2021

The meeting time is 10:00am.

Connection Info:

You can connect using BlueJeans Video conferencing (ID: 253 300 597). (Click "Expand" to the right for details -->):

Meeting URL
 https://bluejeans.com/253300597?src=join_info

Meeting ID
253 300 597

Want to dial in from a phone?

Dial one of the following numbers:
+1.888.240.2560 (US Toll Free)
(see all numbers - https://www.bluejeans.com/premium-numbers)

Enter the meeting ID and passcode followed by #

Connecting from a room system?
Dial: bjn.vc or 199.48.152.152 and enter your meeting ID & passcode

Agenda:

  1. Previous meeting
  2. Announcements
  3. Conferences and Workshops
  4. Ongoing Activities
    • EJFAT (ESnet/JLab + FPGA +Accelerated Transport)
      • Status of proposal
    • DAQ systems
      • SRO
        • SAMPA + ERSAP + JANA2 + INDRA-ASTRA = ?
      • CODA (CODA3 support, EVIO-6)
    • A.I.
      • Experimental Controls
      • Multiple FOAs + JLab LDRD
        • Collaboration with Theory on MCGen project DE-FOA-0002493
        • Collaboration with BNL on AI scheduling DE-FOA-0002482
        • Collaboration with INDRA-ASTRA
        • Collaboration with Sergey F. on AI + FPGA
        • Surrogate Models proposal (NP, ASCR, LDRD?)
        • Amplitude Analysis Inverse Problem (LDRD)
      • Jupyterhub + GPU
    • Offsite Computing
      • NERSC, PSC, IU
        • XSEDE application for PSC bridges-2 resubmitted
      • OSG
  5. AOT



Minutes:

Attendees: David L., Nathan B., Kishan R., Vardan G., Thomas B., Mike G., Torri J., Graham H.

  • Licenses
    • CLion licenses expired. Quote for renewal received and is in the works.
    • GitKraken licenses recently auto-renewed. It seems only 2 people may use GitKraken occasionally. (David will check on adjusting licenses)
    • Overleaf licenses are expiring. Will like go to purchasing individual licenses since it does not look like enough use to justify a site license.
  • EIC
    • Nathan has been working with Sylvester and Whit at ANL to understand their GAUDI-based software in order to gauge how difficult it will be to port it into JANA2
      • Fairly complex CI system using Docker to build their software
      • Most of their Docker images not accessible outside ANL. One image copied to dockerhub for Nathan to work with
      • GAUDI uses a "gaudi cmake" that deviates from standard cmake so some effort is required to move to a standard cmake build system
      • One goal is to get a document on migrating from GAUDI to JANA2 that we can post on the JANA2 website.
    • Communication from Pukai's mentor listing several issues with eJANA. These mostly appear to be things on Dmitry's side.
  • CLARA
    • New version released for use by CLAS12
      • Includes fix for bug that caused rare occurrences where a file was only partially processed
  • JANA2
    • Nathan working to integrate JANA2 and CLARA (i.e. ERSAP).
  • EJFAT
    • Met several times with ESnet for low-level design discussions
    • ESnet presented us with what looks to be a complete design for phase-I
    • A single U280 is needed for phase-I with some load balancing handled by that which is already integrated into the switches
    • Mike will look into putting a PR in for the U280 (will need to learn the JLab procurement system)
    • Existing switches at JLab may not support protocols needed by ESnet software. Dave A. is looking into what we need to purchase.
    • ESnet cannot start work on FPGA software development until May 15th so delivery not expected until maybe July.
  • SRO
    • Contacted Ed J. about SAMPA setup in INDRA lab
      • Gas bottle for GEM changed last week. Needed to flow for a couple of days before turning on HV
      • Ed will go in Thurs. or Fri. to power system up. Vardan volunteered to help if needed.
    • VTP testing now includes Data Lakes
      • Increased resident memory usage to 22GB
      • Slight decrease in CPU usage to around 13 cores
    • Vardan working on JAVA bindings in ERSAP
  • CODA
    • Dave A. is working on presenting data read from the VTP as a ROC to the standard CODA system
      • This requires updates to jcedit which Vardan is working on
      • This will be deployed as part of CODA 3.11
      • Will be for triggered mode only (SRO support will not be included yet)
  • AI Experimental Controls
    • Torri able to run GlueX CDC calibration codes on farm
    • Naomi suggested several tasks involving GARFIELD to allow calculation related to pressure changes
    • Suggestion was made to build GARFIELD in central location on the CUE (with spack)
  • FOAs + LDRD
    • 3 ASR, 1 NP, and 1 LDRD
    • Graham listed our involvement in several of these:
      • Accelerator operations: Chris T. and Malachi S.
      • Modeling computation and predicting data flows (smart data center): BNL + JLab
      • MCGen : Nobuo + ANL
      • Data reduction/Filtering: JLab + ORNL + GA Tech
      • EJFAT (different funding channel): JLab + ESnet
    • LDRD proposals
      • Thomas noted pre-proposal template is short and due May 5th. Full proposals due at end of May.
      • Amplitude Analysis: Thomas
      • Surrogate Models: Kishan, David ,...
  • Offsite Computing
    • GlueX proposal to XSEDE for time on PSC Bridges-2 submitted
    • OSG
      • Operations improved after unmounting lustre and home from scosg16
      • scosg20 up, but not in production use yet
      • MCWrapper: Thomas working on bundling similar jobs which has the potential of speeding up the submit times by factors of 20-40.