Difference between revisions of "EPSCI Group Meeting Aug. 10, 2020"
Jump to navigation
Jump to search
(One intermediate revision by the same user not shown) | |||
Line 49: | Line 49: | ||
#* JLab Common Environment (CE) + SPACK | #* JLab Common Environment (CE) + SPACK | ||
#* SRO | #* SRO | ||
− | #** EIC SRO Meeting today @ | + | #** EIC SRO Meeting today @14:00 |
− | #** | + | #** ERSAP |
− | |||
#** Hall-B/D TriDAS | #** Hall-B/D TriDAS | ||
#* Offsite Computing | #* Offsite Computing | ||
Line 65: | Line 64: | ||
=== Minutes: === | === Minutes: === | ||
Attendees: David L.(chair), Carl T., Nathan B., Thomas B., Vardan G., Kishan R., Graham H. | Attendees: David L.(chair), Carl T., Nathan B., Thomas B., Vardan G., Kishan R., Graham H. | ||
+ | |||
+ | '''Announcements''' | ||
+ | * Fortnight paper discussions will be moved to a separate meeting bi-weekly on Mondays. These will be at the same time, but alternate with the bi-weekly SRO meetings. | ||
+ | |||
+ | '''JANA2''' | ||
+ | * Most work over the past week has been related to A.I. support | ||
+ | * Some communication with Ole on JANA2. (In consideration for SoLID) | ||
+ | * A.I. support | ||
+ | ** Kishan was able to get python interpreter to work with JANA2 including event processor (no GPU inference yet) | ||
+ | ** Working with Nathan and testing interface (looking at calling interpreter from JFactory.) | ||
+ | |||
+ | '''A.I.''' | ||
+ | * Kishan will try accessing the sciml190X nodes with the RTX titan GPUs this week | ||
+ | * It was noted that he will need to generate a certificate (instructions on scicomp website) | ||
+ | * epsci group does not have automatic access to scicomp systems like experimental halls do. | ||
+ | ** David will ask about this. | ||
+ | |||
+ | '''EVIO-6''' | ||
+ | * Testing underway. Clearing seg. faults | ||
+ | * Could use a second set of eyes with more C++ experience to review API and give comments | ||
+ | ** Nathan will do it once Carl sends info. | ||
+ | |||
+ | '''SPACK''' | ||
+ | * Thomas has been communicating with Wouter who has put a lot of effort into implementing this for EIC | ||
+ | ** Working to minimize duplication | ||
+ | * "ground is broken" as disk space and web access has been set up by (others in) the Computer Center | ||
+ | |||
+ | '''SRO''' | ||
+ | * SRO helps minimize blind spots in physics data due to hardware triggers that must operate on limited information | ||
+ | * Vardan is studying data lakes for low latency data movement | ||
+ | ** Single node prototyping being down at moment to investigate data performance. | ||
+ | ** Initial prototyping with Redis | ||
+ | *** Single threaded so multi-process must be used to get full speed(ignore earlier minutes reporting multi-threaded capability) | ||
+ | *** Bookkeeping becomes cumbersome it multi-process model is needed. | ||
+ | *** Original purpose for text data. | ||
+ | *** Tested 2 streams per redis process (12 streams total running simultaneously) | ||
+ | *** Will look at a few other possibilities (MongoDB seems popular) | ||
+ | ** Rapid prototyping progress shown last week at EIC SRO monthly meeting | ||
+ | ** Additional testing done last week | ||
+ | * Ben R. has put together crate of fADC's in INDRA to use for SRO testing | ||
+ | ** Some question on cooling capacity of room since fADC's generate a lot of heat. | ||
+ | ** Possibility of moving some compute nodes to Computer Center room which has high cooling capacity. | ||
+ | |||
+ | '''OSG''' | ||
+ | * 3 prongs of effort: oasis, xrootd, stash cache | ||
+ | ** oasis is in production and documentation has been given to Bryan. | ||
+ | * Running OSG Jobs at JLab: | ||
+ | ** The head node has been stood up | ||
+ | ** No accurate timescale on when system will be open for jobs (main developers tied up with numerous other projects) | ||
+ | * Primary OSG collector for JLab VO now running at JLab. Backup is UCSD | ||
+ | * OSG category on ServiceNow has been added | ||
+ | * Richard J. (UConn) has had some issues with OSG + Compute Canada | ||
+ | * Graham: With broader use of OSG by multiple customers at JLab, we need a backup for Thomas in case he is unavailable | ||
+ | ** Thomas: Right now that would most likely be Bryan since he knows which developers are involved |
Latest revision as of 20:27, 10 August 2020
The meeting time is 10:00am.
Connection Info:
You can connect using BlueJeans Video conferencing (ID: 253 300 597). (Click "Expand" to the right for details -->):
Meeting URL https://bluejeans.com/253300597?src=join_info Meeting ID 253 300 597 Want to dial in from a phone? Dial one of the following numbers: +1.888.240.2560 (US Toll Free) (see all numbers - https://www.bluejeans.com/premium-numbers) Enter the meeting ID and passcode followed by # Connecting from a room system? Dial: bjn.vc or 199.48.152.152 and enter your meeting ID & passcode
Agenda:
- Previous meeting
- Announcements
- Beam delivery for physics ongoing (scheduled to end around Sep. 8)
- David on shift 8/18, 8/19, 8/28, 8/29, 9/11, 9/12
- Thomas on shift 8/20, 8/21, 8/30, 8/31, 9/9, 9/10
- Streaming Data Scientist job posting (almost..?)
- Fortnight Papers -> Move to separate meeting? Alternating Mondays with SRO?
- NERSC User's Group Meeting 8/17/2020 @ 11am-6pm
- Beam delivery for physics ongoing (scheduled to end around Sep. 8)
- Graham's Project
- Ongoing Activities
- JANA2
- GlueX port
- A.I. support
- A.I.
- ENP + CST Meeting week of 8/24 projects list
- GlueX-EIC-PANDA ML workshop Sep. 21-25
- EVIO-6
- JLab Common Environment (CE) + SPACK
- SRO
- EIC SRO Meeting today @14:00
- ERSAP
- Hall-B/D TriDAS
- Offsite Computing
- NERSC, PSC
- OSG
- JANA2
- GUI for Calorimeter calibration scripts (Hall-D Request)
- Publications
- AOT
Minutes:
Attendees: David L.(chair), Carl T., Nathan B., Thomas B., Vardan G., Kishan R., Graham H.
Announcements
- Fortnight paper discussions will be moved to a separate meeting bi-weekly on Mondays. These will be at the same time, but alternate with the bi-weekly SRO meetings.
JANA2
- Most work over the past week has been related to A.I. support
- Some communication with Ole on JANA2. (In consideration for SoLID)
- A.I. support
- Kishan was able to get python interpreter to work with JANA2 including event processor (no GPU inference yet)
- Working with Nathan and testing interface (looking at calling interpreter from JFactory.)
A.I.
- Kishan will try accessing the sciml190X nodes with the RTX titan GPUs this week
- It was noted that he will need to generate a certificate (instructions on scicomp website)
- epsci group does not have automatic access to scicomp systems like experimental halls do.
- David will ask about this.
EVIO-6
- Testing underway. Clearing seg. faults
- Could use a second set of eyes with more C++ experience to review API and give comments
- Nathan will do it once Carl sends info.
SPACK
- Thomas has been communicating with Wouter who has put a lot of effort into implementing this for EIC
- Working to minimize duplication
- "ground is broken" as disk space and web access has been set up by (others in) the Computer Center
SRO
- SRO helps minimize blind spots in physics data due to hardware triggers that must operate on limited information
- Vardan is studying data lakes for low latency data movement
- Single node prototyping being down at moment to investigate data performance.
- Initial prototyping with Redis
- Single threaded so multi-process must be used to get full speed(ignore earlier minutes reporting multi-threaded capability)
- Bookkeeping becomes cumbersome it multi-process model is needed.
- Original purpose for text data.
- Tested 2 streams per redis process (12 streams total running simultaneously)
- Will look at a few other possibilities (MongoDB seems popular)
- Rapid prototyping progress shown last week at EIC SRO monthly meeting
- Additional testing done last week
- Ben R. has put together crate of fADC's in INDRA to use for SRO testing
- Some question on cooling capacity of room since fADC's generate a lot of heat.
- Possibility of moving some compute nodes to Computer Center room which has high cooling capacity.
OSG
- 3 prongs of effort: oasis, xrootd, stash cache
- oasis is in production and documentation has been given to Bryan.
- Running OSG Jobs at JLab:
- The head node has been stood up
- No accurate timescale on when system will be open for jobs (main developers tied up with numerous other projects)
- Primary OSG collector for JLab VO now running at JLab. Backup is UCSD
- OSG category on ServiceNow has been added
- Richard J. (UConn) has had some issues with OSG + Compute Canada
- Graham: With broader use of OSG by multiple customers at JLab, we need a backup for Thomas in case he is unavailable
- Thomas: Right now that would most likely be Bryan since he knows which developers are involved