Skip to main content

Monitoring Distributed Systems under Partial Synchrony

Publication Type
Year of Publication
2020
Conference/Journal Name
The 25th International Conference on Principles of Distributed Systems (OPODIS)
Page Numbers
20:1-20:17
Publisher
LIPICS
Abstract
In this paper, we study the problem of runtime verification of distributed applications that donotshare a global clock with respect to specifications in the linear temporal logics (LTL). Our proposed method distinguishes from the existing work in three novel ways. First, we make a practical assumption that the distributed system under scrutiny is augmented with a clock synchronization algorithm that guarantees bounded clock skew among all processes. Second, we do not make any assumption about the structure of predicates that form LTL formulas. This relaxation allows us to monitor a wide range of applications that was not possible before. Subsequently, we propose a distributed monitoring algorithm by employing SMT solving techniques. Third, given the fact that distributed applications nowadays run on massive cloud services, we extend our solution to a parallel monitoring algorithm to utilize the available computing infrastructure. We report on rigorous synthetic as well as real-world case studies and demonstrate that scalable online monitoring of distributed applications is within our reach.