December 14, Tuesday
12:00 – 14:00
This talk will describe our efforts to evaluate various scheduling strategies is a dynamic, realistic environment. Of the various parameters that affect job scheduling performance, workload and implementation play a pivotal role. Most studies either employ simulations and/or simplistic workloads, which contain many assumptions, including unknown ones. Instead, we developed a scheduling framework that implements several existing and novel algorithms on various cluster architectures of up to hundreds of nodes. This framework was used to produce the first experimental evaluation of several job scheduling strategies in a dynamic workload environment, using synthetic and scientific MPI applications. This talk will discuss the challenges involved in evaluating job scheduling strategies, and the approaches we chose to address them. An analysis will be presented of three factors affecting scheduling systems running dynamic workloads: multiprogramming level, time quantum, and the use of backfilling for queue management – and how they depend on offered load.
Joint work with Dror Feitelson (Hebrew U.), Fabrizio Petrini (LANL), and Juan Fernandez (Murcia U.)
Bio: Eitan Frachtenberg is a postdoctoral fellow at Los Alamos National Laboratory. He received his Ph.D (2003), M.Sc (2001) and B.Sc (1993) from the Hebrew University, all in computer science. His research interests include most aspects of high-performance computing, including scalable system software, job scheduling algorithms, interconnection design and protocols, cluster computing,and large-scale resource management.