| Preface |
|
vii | |
| Invited Papers |
|
1 | (44) |
|
Coordination models and languages for parallel programming |
|
|
3 | (15) |
|
|
|
|
|
|
|
|
|
|
|
Explosive advances in computational chemistry---Applications of parallel computing in biomedical and material science research |
|
|
18 | (14) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The challenge of massively parallel computing |
|
|
32 | (13) |
|
|
|
|
|
| Applications |
|
45 | (188) |
|
A communication library to couple simulation codes on distributed systems for multi-physics computations |
|
|
47 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Cellular automata model for parallel simulation of contamination processes by oil in porous soils |
|
|
56 | (7) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
HIPERPLAST: An HPCN simulator for reinforced thermoplastics injection processes |
|
|
63 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Parallel non linear electromagnetic modelling with FEM |
|
|
71 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
HIPERCIR: A scalable PC-based parallel system for medical imaging |
|
|
79 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bayesian image restoration: Parallel implementation on a SGI origin multiprocessor |
|
|
87 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A framework for parallel multithreaded implementation of domain decomposition methods |
|
|
95 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Performance evaluation of a FD-TD parallel code for microwave ovens design |
|
|
103 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
MPEG1 and MPEG2 compression based on a workstation cluster |
|
|
112 | (8) |
|
|
|
|
|
|
|
|
|
|
|
Parallel implementation of a 3D BJT device simulator |
|
|
120 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Implementation and performance evaluation for a computation-intensive climate simulation application |
|
|
128 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Modelling head biomechanics on parallel platforms |
|
|
136 | (8) |
|
|
|
|
|
|
HIPERWATER: A high performance computing demonstrator for water network analysis |
|
|
144 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Parallel ground water flow modelling |
|
|
152 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Parallel computational magneto-fluid dynamics |
|
|
160 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Parallel processing of natural language parsers |
|
|
168 | (8) |
|
|
|
|
|
|
Three-dimensional direct numerical simulation of flow problems with electromagnetic control on parallel systems |
|
|
176 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Using PVM on computer network to perform fast pre-processing of large medical data set |
|
|
185 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fine grain parallelization of multibody system equation of motion |
|
|
193 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Vehicle routing with time windows and stochastic demand |
|
|
201 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
209 | (8) |
|
|
|
|
|
|
|
|
|
|
|
Hybrid scheduling for realistic image synthesis |
|
|
217 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A parallel architecture for interactive FEM computations in a surgery simulator |
|
|
225 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Algorithms |
|
233 | (176) |
|
HIPERBUILD: An efficient parallel software for 3D structural analysis of buildings |
|
|
235 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Parallelization of the umbrella Monte Carlo algorithm |
|
|
243 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A two-dimensional parallel quadtree finite element mesh generator |
|
|
251 | (8) |
|
|
|
|
|
|
|
|
|
|
|
Dynamic multi-partitioning for parallel finite element applications |
|
|
259 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Factorized approximate inverse preconditioning of a parallel sparse eigensolver |
|
|
267 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A parallel finite element surface fitting algorithm for data mining |
|
|
275 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
External selective orthogonalization for the Lanczos algorithm in distributed memory environments |
|
|
283 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Implementation of parallel one-sided block Jacobi methods for the symmetric eigenvalue problem |
|
|
291 | (8) |
|
|
|
|
|
|
|
|
|
|
|
Communication overhead for parallel sparse Cholesky factorization on a reconfigurable network |
|
|
299 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Towards a fast parallel sparse matrix-vector multiplication |
|
|
308 | (8) |
|
|
|
|
|
|
|
|
|
|
|
Parallel adaptive 3-D wavelet analysis for fast and efficient video coding |
|
|
316 | (8) |
|
|
|
|
|
|
|
|
|
|
|
UG --- A parallel software tool for unstructured adaptive multigrids |
|
|
324 | (9) |
|
|
|
|
|
|
A framework for analyzing and designing parallel algorithms for tridiagonal systems |
|
|
333 | (9) |
|
|
|
|
|
|
Left-looking strategy for the sparse modified Cholesky factorization on NUMA multiprocessors |
|
|
342 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A parallel triangle operator for noise removal in true colour images |
|
|
350 | (8) |
|
|
|
|
|
|
A new scalable array processor for two-dimensional discrete Fourier transform |
|
|
358 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A generic all-pairs cluster-computing pipeline and its applications |
|
|
366 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Management of distributed dynamic data with algorithmic skeletons |
|
|
375 | (8) |
|
|
|
|
|
|
Experiments in parallel evolutionary partitioning |
|
|
383 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
MG --- A toolbox for parallel grid adaption and implementing multigrid solvers unstructured |
|
|
391 | (9) |
|
|
|
|
|
|
|
|
|
|
|
STW: Switch Time Warp. A model for rollback reduction in optimistic PDES |
|
|
400 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| System Software and Hardware Architecture |
|
409 | (338) |
|
Using optimal partition strategies for skeleton allocation |
|
|
411 | (8) |
|
|
|
|
|
|
|
|
|
|
|
SWC: A small framework for webcomputing |
|
|
419 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
On the memory performance of pure and impure, strict and non-strict functional programs |
|
|
427 | (8) |
|
|
|
|
|
|
|
|
|
|
|
Advanced visualization and data distribution steering in an HPF parallelization environment |
|
|
435 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Active I/O streams for heterogeneous high performance computing |
|
|
443 | (8) |
|
|
|
|
|
|
|
|
|
|
|
User level storage I/O: An experimental study of two storage protocols using the VI architecture |
|
|
451 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dynamic run time support for skeletons |
|
|
460 | (8) |
|
|
|
|
|
|
Estimating IPC of a block structured instruction set architecture in an early design stage |
|
|
468 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Linux-clusters for lattice field theory |
|
|
476 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Novel highly parallel and systolic architectures using quantum dot-based hardware |
|
|
484 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A cellular environment for steering high performance scientific applications |
|
|
493 | (8) |
|
|
|
|
|
|
|
|
|
|
|
Implementing a functional approach for parallel resolution of irregular problems on distributed multiprocessors |
|
|
501 | (9) |
|
|
|
|
|
|
|
|
|
|
|
Distributed high-speed computing of multimedia data |
|
|
510 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Increasing the efficiency of value prediction in future processors by predicting less |
|
|
518 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Expressiveness versus optimizability in coordinating parallelism |
|
|
526 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dynamic load balancing with self-organizing maps |
|
|
534 | (8) |
|
|
|
|
|
|
A framework for nesting algorithmic skeletons |
|
|
542 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Register allocation in hyper-block for EPIC processors |
|
|
550 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Study of data locality for iterative methods |
|
|
558 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
SUMA: A scientific metacomputer |
|
|
566 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
COMET: A communication-efficient load balancing strategy for multi-agent cluster computing |
|
|
574 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Performance comparison of four software architectures for distributed computations |
|
|
582 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Distributed high performance computing with OpusJava |
|
|
590 | (8) |
|
|
|
|
|
|
Malleable tasks: An efficient model for solving actual parallel applications |
|
|
598 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
File mapping in shared virtual memory using a parallel file system |
|
|
606 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Scalable simultaneous multithreading (ScSMT) |
|
|
615 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Achieving multiprogramming scalability of parallel programs on Intel SMP platforms: Nanothreading in the Linux kernel |
|
|
623 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Performances of hole based, chaotic and minimal fully-adaptive routing algorithms under constant resource constraint |
|
|
631 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A tool for SPMD application development with support for load balancing |
|
|
639 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Effective performance problem detection of MPI programs on MPP systems: From the global view to the details |
|
|
647 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Execution replay for an MPI-based multi-threaded runtime system |
|
|
656 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A practical methodology for defining histograms for predictions and scheduling |
|
|
664 | (8) |
|
|
|
|
|
|
Workload characteristics and effective scheduling in large parallel systems |
|
|
672 | (9) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A message oriented reliable multicast protocol for J.I.V.E. |
|
|
681 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Evaluation of file access patterns using realistic I/O workloads for a cluster environment |
|
|
689 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Parallel program model and environment |
|
|
697 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The asynchronous object-oriented programming model for parallel systems |
|
|
705 | (9) |
|
|
|
|
|
|
|
|
|
|
|
Compiling for fast state capture of mobile agents |
|
|
714 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A scalable multithreaded compiler front-end |
|
|
722 | (8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A 3D-Java tool to visualize loop-carried dependences |
|
|
730 | (8) |
|
|
|
|
|
|
Bubble-driven optimization of instruction level parallel programs |
|
|
738 | (9) |
|
|
|
|
|
|
|
|
|
|
| Industrial Perspective |
|
747 | (16) |
|
COMPAQ and QSW scalable scientific computing |
|
|
749 | (14) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Extended Abstracts |
|
763 | (4) |
|
Parallel inexact Newton and interior point methods |
|
|
765 | (1) |
|
|
|
|
|
|
|
|
|
|
|
Parallel SAR processing on Linux PCs enables operational radar remote sensing |
|
|
766 | (1) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Author Index |
|
767 | |