Scheduling in grid: rescheduling MPI applications using a fault-tolerant MPI implementation
Abstract
Due to advancement in grid technologies, resources spread across the globe can be accessed using standard general purpose protocols. Simulations and scientific experiments were earlier restricted due to limited availability of the resources. These are now carried out vigorously in the Grid. Grid environments are dynamic in nature. The resources in a grid are heterogeneous in nature and are not under a central control. So scheduling in grid is complex.
The initial schedule obtained for an application may not be good as it involves the selection of resources at a future time. The resource characteristics like CPU availability, memory availability, network bandwidth etc keep changing. Rescheduling becomes necessary under these conditions. There are many rescheduling methods in the literature. Process migration is one of them.
The thesis uses the fault-tolerant functionalities of MPICH-V2 to migrate MPI processes. Load balancing modules which make a decision of when and where to migrate a process are added into the MPICH-V2 system. Simulations are done to show that process migration is viable rescheduling technique for computationally intensive applications. The thesis also gives brief descriptions of some existing fault-tolerant MPI implementations.
Collections
- M Tech Dissertations [923]