Core Minutes 2/10/2015
ScienceTools: (Jim) See Science Tools Development Notes for details of these and several other updates.
FSSC: (Joe? Alex?)
Weekend debacle: (Tom et. al.) Starting around 2 AM on Sunday many jobs failed because they could not connect to the MySQL calibrations metadata database. The reason was too many connections. The MySQL server connection limit is set to 800 and it was being exceeded. This is not normally an issue since connections are released a few events into a job, but on Sunday many jobs were holding on to connections for hours. Ultimately, by looking at a log of such a job, it was determined that an entirely different part of the code, having nothing to do with MySQL, was throwing an exception, and the handling was such that the job would hang. The exception was being thrown from within PtValsAlg because the event timestamp was past the time it could compute a magnetic field solution (namely start of 2015) without extrapolation. A few weeks ago Jim had made a new version of the file in the astro which throws the exception, to allow the computation to proceed for times before 2020, but this new version was not in use by the reconstruction code. For now in these recon jobs Warren has set the environment variable IGNORE_IGRF_BOUNDARY, causing the code to continue without exception no matter what the year is, thus eliminating the hangs and MySQL connection issues.
To be explored:
C & A: (Leon) A couple issues from the last C & A meeting:
Meanwhile, Michael has been doing some fine work profiling GR using tools built into Gaudi. See JIRA LPATE-181 for details; it's quite illuminating.
ScienceTools and Anaconda python: JIRA STGEN-148 Anaconda python was installed in GLAST_EXT for the linuxes and new test build of ST created. The FSSC requests no upgrade until the regression testing has been updated to test python changes. It is unclear who will be working on that. Jim attempted to perform an FSSC ST install at SLAC but was unable to due to the need to be "root" to run "make" for some of the packages. Joe has a workaround, and plans to send it to Jim. (as of 2/17/2015 that hasn't happened yet) Jim also asked that in the futre we try to keep ST updated with the latest cfitsio, to avoid migration issues by waiting too long. We have both signed up for the heasoft mailing list, to be alerted about new releases.
Other externals (Heather) cfitsio 3.37 and ape 2.8 were built in GLAST_EXT and a new SciTools build was started. As it turns out, tip's unit test failed, as did the other packages that use cfitio. There was also at least one failure associated with ape. The FSSC has been alerted and will work to find the problems. In the meantime, GGR and ST will not upgrade and it is unclear when our next opportunity will arise.
|
|
minutes index
|
next
|