Tezin Türü: Doktora
Tezin Yürütüldüğü Kurum: Orta Doğu Teknik Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü, Türkiye
Tezin Onay Tarihi: 2014
Öğrenci: ÖZGÜR KAYA
Danışman: ALİ HİKMET DOĞRU
Özet:A fault management framework has been developed where a rule-based event processing language is also developed that provides improvement to the existing approaches in terms of time responsiveness. Reference architectures were developed for the fault management domain including fault avoidance capabilities. Such capability is for taking precautionary actions before the fault happens, while most of the fault tolerance techniques are intended for detecting a fault after it happens, hence utilizing the time with less efficiently. High availability is targeted through such measures for missioncritical systems. The need for this study was realized when a family of products were planned for different mission-critical systems to support them by different Fault Management subsystems. A real-time event-processing rule-based language and its processing tools were defined as a requirement during this work and final contribution was dedicated to this area. Requirements for the language did not include high demands on the inference capabilities and very high-level declarative logic, however, its response in the environment that deals with events at the computational platform level was important. The language and its processor were validated with tests and based on the available similar results in the literature, performed superior especially considering its specific purpose.