November 30, 2004

Brief Meeting Minutes - November 30, 2004

JoshM and Charlie met briefly, the current list looks like:

JoshM


  • Install and try-out NAMD on bazaar and cairo.
  • Build a new bazaar image (with Dawit) using SuSe 9.1
  • Build a new cairo image using Yellow Dog 4.0 (including ccache and distcc).
  • Continue working on F@C code items.
  • Work with John on testing and the MidWife.

John


  • Complete testing matrix using a1 for all molecules (GMXBench plus others), bazaar and cairo, 2-12 processors.
  • The MidWife.

Charlie


  • Update prose for two SIAM CSE poster submissions. (done)
  • SIAM CSE conference registration for JM, JS, CP; JH are you going?
  • Hotel and air reservations for SIAM CSE

  • Organize F@C a2 release.

Posted by charliep at 02:26 PM | Comments (0)

November 14, 2004

Work Log

Folding@Clusters
  • The MPI_Recv to MPI_Irecv problem highlighted int the mother is a non-problem. This is expected behavior between the mother and child. This can be taken off the ToDo list.
  • I added the ability to correctly detect the number of CPUs on a Linux machine in COSM. i tested on Bazaar and Cairo.
  • I cleaned up the "step" messages generated by mdrun. These should be collected by the nanny and transmitted to the mother for a status display.
    Finished Step 10 of 1000, remaining runtime: 90 s
General
  • Created a new feel for the website. Let me know what you think, and if I should push it.
    See the link in the email, let me know if you need it again [I don't publish that private work space to keep it from being sniffed by spiders...]
Posted by hursejo at 09:48 PM | Comments (0)

November 12, 2004

Meeting Notes

Folding@Clusters
  • Short term code fixes (joshm)
    • Divorcing child and mdrun.
    • diff the configure.ac, makefile.am, and (?) to see changes and to preserve our FATC flag.
    • See if altivec issue is patched in the latest version of gromacs. If not, fix it.
    • gromacs version - is the highest production 3.2.1?
    • Don't do another code root; just roll back changes in CVS.
    • Changes to make in gromacs: remove new main in mdrun, linking instructions, base level Makefile.am will have to change (has dependencies on the child binary).
    • We need new build instructions.
    • Tabs and 80 cols.
  • Run muliti-node tests on all platforms. (john)
  • Update node usage doc with by giving all non-NFS nodes to John.
Posters
  • Mount on board like the physics posters?
  • Print poster content at appropriate size.
  • Option: Laminate the poster?
  • Two new generic framworks need to be created: FATC, Numerical Methods for SIAM
  • MERCK - Tuesday the 16th at 6-9p(?)
  • Dr. Dobbs nov 2005 issue - write article about FATC. Due in mid July. We need to start writing very soon.
Numerical Methods
  • Test a1 binary on c12-c15. Run as a test user.
  • Results table.
Plumbing
  • Making the cluster space usable:
    • Ask John Howell about shelves.
    • Mount screen and projector.
    • Vacuum
  • Data Visualization Console: temp file flooding and errors when running.
  • Midwife write-up (Charlie)
General
  • HHMI annual accounting and report (Charlie)
  • Talk to Mary Lou about travel logistics for SIAM. Start a calendar, get details down, talk to Joshh.
Posted by mccoyjo at 11:13 AM | Comments (0)

November 10, 2004

Code Review Notes

NOTES: 11/10/2004
  • separate mdrun from child (like grompp)
    • Start with latest version of GROMACS.
    • keep define f-at-c (for stderr->stdout, verbosity of prints...)
    • Patch for altivec
    • Change build script
    • Take out include of gromacs header for f-at-c
  • Remove HTTPD from mother
    • Turn this into MPI async communication.
    • Turn off Threads in COSM
    • No longer need to send the mother hostname and port to the nanny
  • CHILD_MULT should be dynamic (load balancing)
  • File System Recourses needs to be altered for dynamic files per pande stuff
  • Documentation:
    • Header file has complete documentation for functions, and globals
    • Use special tags for start and stop of comments.
  • Print function: Reconcile N-different.
  • Remove debug counter [duplicate: debugCounter, debugTimes??]
  • Functional abstraction in main() - mother/nanny/child
  • MPI Send and Recv file function()
  • Remove OpenLog()
    • Now we can use Print everywhere and should!
  • Stdout redirection toggle [log | display]
  • Route progress of child
    • caught by nanny
    • sent to the mother async. [disjoint from checkpointing mech.]
  • Clear documentation of *all* variables.
  • Trap, check, & react - return codes from all functions when appropriate.
  • Use PrintHeader(x,x) in [-v] version return.
  • Write a PraseSpec function to get applicable file names from assignment server.
  • Pre-while loop set child_process
  • Where is nanny_host_conf_file set in mother.c
  • convert int -> %u to something reasonable (signed)
  • If conf file is not set properly, loose default values pre spawn.
  • When waiting for the child to return a result, should use Irecv instead of Recv
  • check_code() function may be redundant.
  • "Recover #" print messages, should be more meaningful.
  • "hold" variable may be lost since it doesn't *normally* do anything meaningful.
  • "flag" should be equal to some defines in the header. clarity issue
  • Testing Rubic Item: Test stopping mother with a signal
Posted by hursejo at 10:29 PM | Comments (0)

November 04, 2004

Meeting Minutes - November 5, 2004

Folding@Clusters
Design handout for SC2004

Discuss legends for clusters on the diagram

Numerical Methods

Plumbing

Papers and Presentations

Items from JoshM's email of Wed

Posted by charliep at 05:25 AM | Comments (0)