International recognition for Australian supercomputer debuggers
26 Mar 2010
Researchers from Monash University's eScience and Grid Engineering Lab are attracting international attention and funding for their unique expertise in debugging supercomputer software.
With conventional error-detecting programs ill-equipped to deal with the large and complex data sets of parallel distributed supercomputers, revolutionary debugging software developed by the research team has garnered interest from agencies around the world.
The research team, led by the Lab's director, professor David Abramson, recently received funding support from the United States Department of Energy, an agency leading an international supercomputer R&D consortium that includes IBM, and has a commercialisation agreement with supercomputer manufacturing giant Cray.
Professor Abramson said the funding support from the US Department of Energy, together with an Australian Research Council Linkage grant with Cray, would enable the research team to "develop debuggers that scale to millions of processors. It will also allow us to leverage state-of-the-art software development environments to improve programmer productivity.'
The uniqueness of the research team's expertise lies in a novel approach to the debugging process.
"While traditional debuggers work by comparing program variables with user expectations, our 'relative' debugging operates by comparing data in one program with data in another that is known to be correct. So it works by detecting where the codes differ rather than from the principle of how the code should be,' Professor Abramson said.
"The debugging software which we have developed -- and which is a commercial application of research we have been conducting for several years -- efficiently weeds out glitches in supercomputers through a process that could be described as the technical equivalent of a 'spot the difference' puzzle.'