A rule-based domain specific language for fault management


Tezin Türü: Doktora

Tezin Yürütüldüğü Kurum: Orta Doğu Teknik Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü, Türkiye

Tezin Onay Tarihi: 2014

Öğrenci: ÖZGÜR KAYA

Danışman: ALİ HİKMET DOĞRU

Özet:

A fault management framework has been developed where a rule-based event processing language is also developed that provides improvement to the existing approaches in terms of time responsiveness. Reference architectures were developed for the fault management domain including fault avoidance capabilities. Such capability is for taking precautionary actions before the fault happens, while most of the fault tolerance techniques are intended for detecting a fault after it happens, hence utilizing the time with less efficiently. High availability is targeted through such measures for missioncritical systems. The need for this study was realized when a family of products were planned for different mission-critical systems to support them by different Fault Management subsystems. A real-time event-processing rule-based language and its processing tools were defined as a requirement during this work and final contribution was dedicated to this area. Requirements for the language did not include high demands on the inference capabilities and very high-level declarative logic, however, its response in the environment that deals with events at the computational platform level was important. The language and its processor were validated with tests and based on the available similar results in the literature, performed superior especially considering its specific purpose.