Edge computing has recently evolved that offers to execute jobs efficiently by pushing cloud capabilities to edge of the network, this improves the quality of services to latency-oriented Internet of… Click to show full abstract
Edge computing has recently evolved that offers to execute jobs efficiently by pushing cloud capabilities to edge of the network, this improves the quality of services to latency-oriented Internet of Things (IoT) applications when compared with cloud computing. By using current smart devices as edge nodes, edge computing can provide elastic resources that allow distributed data processing in a decentralized way. Still these smart devices are resource constrained in nature and tends to face a high failure rate than traditional distributed systems, the implementation of a fault-tolerant system that ensures the reliability and application availability becomes a key requirement. In this article, we propose a fault-tolerance methodology based on checkpointing and replication for the edge computing. Our proposed system uses a smart checkpointing for the IoT application tasks executing in a distributed edge network, and by replicating the checkpoint files on alternative edge nodes in the vicinity allowed to increase the system reliability. The experimental results show that our approach is effective in terms of reliability and availability of tasks executing in the edge network along with meeting deadlines of an IoT application.
               
Click one of the above tabs to view related content.