Done: 1) Add HOSTALLOW_ issues to HAD manual (Ralph Filch's problem) 2) Add Hungarian notation to all class data members 3) PRIV_UNKNOWN -> PRIV_USER_FINAL + add a bit of documentation about the problem we underwent 4) Support for m_isPrimary inside Version structure - done with coding/decoding inside 'Version' class 5) Accept fix in uploader: ACCEPT_TIMEOUT inside DownloadReplicaTransferer::transferFileCommand - done 6) Merge HadCommands.h and ReplicationCommands.h into condor_commands.h - done before 7) Add 'Return value' clause to 'Version::synchronize' documentation - done 8) 'safeUnlinkFile' - if the file does not exist return true without trying 3 times - done 9) List of files in downloader/uploader - done 10) Support for overflow of version logical clock - done 11) 'ctime_r' on Alpha computers - done, changed with 'ctime' 12) Talk to Nick about HAD_LIST and REPLICATION_LIST issue as part of code review - done, for this release, we leave it as is, enforcing the same order of entries in REPLICATION_LIST like in HAD_LIST 13) Automatic nightly testing - scheduled, RH8.0 each day and all the platforms at Saturday 14) Talk about configuration parameters to Nick - done 15) HadMonitoringSystem - done, deployed in DSL 15.1) Run in a forever loop or synchronize a crontab job - done 15.2) Save all the logs - done 15.3) Absolute clocks - deferred 15.4) Garbage collection - done 15.5) Once in a while report to the administrator - done 15.6) Report to the administrator in case of error - done 15.7) Replication monitoring system - in next release 16) Prepare documentation of monitoring system - done 17) Prepare nmi-submit document (build and test submission) - done 18) Prepare nmi-automatic testing document (testing system description, interaction with NMI) - done 19) Code merge with Nick - done 20) Problem with #handlers in ReplicationLog on ds-ibm1 - fixed and merged in main branch of CVS 21) Remind Max about .old logs in ds-ibm1,picasso - done 22) Split the HAD Monitoring System mailing list into two: daily and error - done 23) Add Max and Eran to the HAD Monitoring System error recipients' list - done 24) Extending sanity check script - done, included check for REPLICATION_LIST and REPLICATION_ARGS 25) Merged Condor manual for 6.7.18 release - sent the updated chapters to Karen Miller 26) Big replication/HAD manual test to submit from NMI - done, debugging in DSL 27) Testing of merged code (6.7.18) in DSL - done 28) PERMISSION DENIED issue of Daemon core - done, it can happen when the NFS server is down 29) HA6 branch creation - done by Nick 30) WAN delays measurements - done 31) Adding replication daemon logs analysis to HAD Monitoring System - done 32) 'condor_examples' extension by examples of HAD-enabled central manager and client - done by Nick 33) WAN delays graphs - done 34) Talk with guys in Wisconsin considering the last problem, we had with collector (SEC_DEFAULT_NEG=NEVER) - described the problem to Nick 35) HAD scalability test (20 computers in DSL) - done 36) HAD manual section rewriting - done 37) Performance test of Coda - done 38) Uncheck the "use as default screensaver" checkbox from the installation, choose the 'Single User type' and check the 'Agree to license' checkbox + update on the detailed info page - done 39) Reproduce the statistics of Spinhenge@home on our site (stripcharts) - done 40) Move Superlink@home and Superlink@Technion to local FS - done by Mark 41) Reflect the number of CPU days utilized on the front page - done 42) Front page charts' datafiles rotation via logrotate - done 43) Add stderr output and time stamp to progress log - done 44) Compile new gnuplot with support for filling graphs - done 45) Check why Superlink stays in the memory after BOINC stops the execution - done 46) Graphs plotting for cplan - done 47) EGEE submission platform revival - done 48) Upload existing users photos - done 49) Superlink checkpoints - done (Mark provided the locations of false checkpoints in the source code) 50) Showing computers details - done 51) New stripchart for cplan (monitored clients) - done 52) Log rotation for charts datafiles of cplan project + do not forget to sample the last year entries in postrotate - done 53) RSS validation - done, problem resolved by the user 54) Realistic floating point estimations for the Superlink@Technion project - done 55) Promotion to a member from a new member for old users - done (within 14 days since the account creation) 56) Purge time - done (increased to 7 days) 57) Export statistics using db_dump on a daily basis, register to BOINC combined statistics - done 58) Merge Q&A and message board sections - done 59) Test BOINC/Superlink installers on Windows 98, Me, 2000 and Vista - done by Superlink@Technion users 60) System requirements on the front Superlink@Technion page plus decrease the font of the introductory words - done 61) Transactional submission to BOINC and to monitoring tables - done (takes less than 5 minutes for 60,000 jobs to get created) 62) Database backup in Superlink@Technion and Superlink@EGEE queue - done (dumps are checked into CVS on a daily basis at 23:57) 63) Create account through a web page in BOINC - done 64) Load monitoring daemon - done (for both Linux and Windows; deposited on the developers' site) 65) Create visual distribution of users according to their geographic location with their contribution in GFlops - done 66) Superlink, compiled on Mac-PowerPC, new platform for Superlink@Technion and new forum for Macintosh - done 67) Science forum on Superlink@Technion - done 68) Merging hosts - done for both projects, Superlink@Technion features both administrative and user merging interface 69) Killing dangling monitoring jobs on EGEE submission platform - done, adding commands that tend to get stuck from time to time to the list of monitored commands 70) Apache/PHP eAccelerator installation (cbl-link02, cbl-link07) - done 71) Transitioner denies uploaded output files that contain partial information - done 72) PreParallelDag.sh compliance to BOINC-free environment - done (RUN_BOINC option has been added to the local configuration) 73) BOINC client step-by-step installation documentation - done 74) EGEE submission, adjusting when the number of idle jobs is high - done (jobs, idle for more than an hour, are safe-cancelled and stop being monitored; a new threshold for currently running jobs) 75) Firewall on cbl-boinc-server{1,2} with entries for DNS/SMTP/SSH/HTTP/Condor range - done 76) Abstract for the 3rd EGEE User Forum - done 77) Install existing statistics on cbl-boinc-server1 and add per-location statistics - done (12-Nov-2007) 78) Access to cbl-link0{4,5,6} by ssh from within CBL only, any other access restricted - done (12-Nov-2007) 79) Recaptcha on Superlink@Technion to minimize automatic spam installed - done (13-Nov-2007) 80) EGEE submission machine in CBL pool and submission platform installation - done (15-Nov-2007) 81) Moving CVS repositories of all the backups to SVN - done (20-Nov-2007) 82) Enabling setting maximal number of running BOINC clients on submission platform(s) - done (22-Nov-2007) 83) Construct a platform for Superlink multi-platform builds - done for Linux i686/Linux x86_64/PowerPC/Windows (27-Nov-2007) 84) Distributing system components to different machines: MySQL node, Pedtool nodes, BOINC front-end nodes, NFS - done (05-Dec-2007) 85) PEDTool reconstruction, pruning and estimation algorithms - done (19-Feb-2008) 86) Test that suspension/quitting of Superlink in Superlink@clusters works - done (20-Feb-2008) 87) Configure DSL like the assistant account and add a support for assistant-like users in 'loadd' - done (20-Feb-2008) 88) Activate Condor pool at Technion with hard limit = 80 - done (20-Feb-2008) 89) Support for new statistics for all the projects - done (21-Feb-2008) 90) Stripcharts for DSL and LCCN - done (21-Feb-2008) 91) Merge the code for Condor and EGEE submission machines - done (21-Feb-2008) 92) Attach Madison Condor pool to Superlink@clusters - done (21-Feb-2008) 93) Synchronize charts on Superlink@Technion and Superlink@clusters - done (27-Feb-2008) 94) Numeric analysis of the Superlink round error - done (02-Mar-2008) 95) WU name -> XML - done (10-Apr-2008) 96) Migrate the new back-end (with support for statistics logging into the database) to Superlink@Technion and update the stripcharts mechanism accordingly (28-Apr-2008) 97) BOINC setup for Windows machines in Madison Condor pool - done (04-May-2008) 98) Generate usage breakdown by users on Superlink@clusters and integrate it into the stripcharts framework - done (04-May-2008) 99) Fault tolerance of the EGEE submission platform - each job is tried several times, before it stops being monitored - done 100) Reliable scheduling to smoothen the trail phenomenon - done 101) Reroute jobs from BOINC to Condor with the same priority in Pedtool - done 102) Annotation for the architecture diagram - done 103) Segregate stripcharts data processing according to the time intervals: long intervals updated less often, short intervals updated more often - done TODO: Condor leftovers: 1) Windows compilation 2) Additional checker for condor_reconfig 3) Document about Windows compilation BOINC/EGEE leftovers (in descending priority order): * Upper bound on the running time of Condor jobs * Register to http://dc-vault.com/ * Pandora - introduce the notion of daily restart to various checkers * Deposit TODO list for Superlink@Technion on the front page - after Mark returns from US * Superlink directory structure (and, perhaps, overall) man-like documentation * MyProxy and security issues in EGEE - in progress * Challenge/response in Superlink@Technion * Seek for CPU/memory balancer in the web * Create users' profiles upon account creation + UOTD modification * Join all Technion users to one Israeli team