Parallel Workloads Archive: SDSC DataStar

The San Diego Supercomputer Center (SDSC) DataStar log

System: 184-node IBM eServer pSeries 655/690
Duration: Mar 2004 thru Mar 2005
Jobs: 96,089

A log covering about a year of activity. It contains information on the requested and used nodes and time, CPU time, submit, wait and run times, and user.

The original log, with all the available information, is (was) available directly from the NPACI JOBLOG repository, in a special format described on that site. The copy available here is in the Standard Workload Format, which loses some data as specified below.

The NPACI JOBLOG repository is made available by Victor Hazlewood. If you use this log in your work, please use a similar acknowledgment.

This log is subject to the NPACI JOBLOG Repository Data Usage Agreement:
This Job Trace Repository is brought to you by the Allocated Systems and Security Technologies group of the San Diego Supercomputer Center (SDSC).
The JOBLOG and job trace data is Copyright 2000-2005 The Regents of the University of California All Rights Reserved
Permission to use, copy, modify and distribute any part of the JOBLOG data or job trace information from systems at the San Diego Supercomputer Center for educational, research and non-profit purposes, without fee, and without a written agreement is hereby granted, provided that this copyright notice is preserved in all copies or directories containing the data and all works based on use or analysis of this data is properly referenced in any written or electronic publication.

Downloads:

SDSC-DS-2004-2.swf 1.6 MB gz converted log
SDSC-DS-2004-2.1-cln.swf 1.6 MB gz cleaned log -- RECOMMENDED, see usage notes
SDSC-DS-2004-1.swf 1.6 MB gz OLD VERSION of converted log (replaced 6 Dec 2011)
(May need to click with right mouse button to save to disk)

Papers Using this Log:

This log was used in the following papers: [sotomayor06] [feitelson07a] [feitelson08] [folling09] [aida09] [liuz10] [lindsay12] [krakov12] [zakay12] [zakay13] [chen13] [rajbhandary13] [sheikhalishahi14] [zakay14] [zakay14b] [feitelson14] [lic14] [ntakpe17] [hai20]

System Environment

The total machine size is 184 nodes.

The nodes are of either of two types: p655, an 8-way SMP with 1.5 GHz processors and 16GB of memory, or p690, a 32-way SMP with 1.7 GHz processors and 128 GB of memory. These nodes are divided into 3 partitions as follows:
# Partition p655p690
1 interactive 5 1
2 batch 171 0
3 batch 0 7
(the p690 node in the interactive partition has only 64 GB of memory.)

Jobs are submitted to a set of queues. The main ones are

Name NodesTime limitNode limitWhen
interactivep655     
interactive32p690    
express p655 2hr 8nodes weekdays
high p655 18hr 164nodes  
normal p655 18hr 164nodes  
express32p690 2hr 7nodes weekdays
high32 p690 18hr 7nodes  
normal32 p690 18hr 7nodes  
expressL p690 2hr 1nodes weekdays
highL p690 18hr 1nodes  
normalL p690 18hr 1nodes  
TGexpressp690 2hr 7nodes weekdays
TGhigh p690 18hr 7nodes  
TGnormal p690 18hr 7nodes  
The TG queues are for TeraGrid jobs.

For more information see the NPACI user guide.

Log Format

The original log in available from the NPACI JOBLOG repository, which also includes a description of its format.

Conversion Notes

The converted log is available as SDSC-DS-2004-1.swf. The conversion from the original format to SWF was done subject to the following. The conversion was done by a log-specific parser in conjunction with a more general converter module.

Usage Notes

The log has a cleaned version available as SDSC-DS-2004-2.1-cln.swf. It is recommended that this version be used.

The cleaning consisted of removing the first 20 jobs, as they seem to represent activity from long before the actual logging started. In particular, the first 10 jobs occur about 3 weeks before the bulk of the work.

The Log in Graphics

File SDSC-DS-2004-2.1-cln.swf

weekly cycle daily cycle burstiness and active users job size and runtime histograms job size vs. runtime scatterplot utilization offered load performance


Parallel Workloads Archive - Logs