Secure, Intermittent, and Model-Free Learning Framework for Autonomous Cyber Physical Systems
Author(s)
Connelly, Prachi P.
Advisor(s)
Editor(s)
Collections
Supplementary to:
Permanent Link
Abstract
Cyber-physical systems (CPS) integrate humans and computational systems to control physical plants using sensing signals and feedback loops. The interconnection, modularity, and exposure to human environments, reveal CPS to attacks from adversaries. In an adversarial, information-constrained, and model-agnostic environment, reinforcement learning (RL) methods design optimal control policies using reward signals that possess decentralized learning and deployment protocols. Increased continuous data-sharing between sensors, actuators, and the cloud adds complexities to the network leading to degradation of communication effectiveness and increased the possibility of reinforcement signal dropouts, leaving the learning framework with access to a sparse set of rewards or even state measurements.
This work presents a Moving Target Defense (MTD) framework, to switch between allowable actuator combinations. MTD allows the input-generation framework to maintain controllability, enhance performance, and safeguard the CPS from. A data-driven controllability algorithm identifies allowable actuator combinations to guarantee controllability. During sensing breakdown, lack of complete state measurements dictates an intermittent framework to approximate value functions and tune embedded neural networks. When continuous communication between the reinforcement channel and the controller is challenged, the proposed framework will develop an self learning algorithm that will provide conditions wherein the internal reinforcement signal is rich enough to allow learning of the optimal control policy and when a trade-off between internal and external reinforcements is utilized. Finally, bionic, human brain-centric learning for CPS is formalised using B.F. Skinner's operant conditioning re-contextualised for learning in CPS, as well as sparse evolutionary training of neuronal configurations informed with physics of the system to approximate solutions to the non-linear Hamilton-Jacobi-Bellman (HJB) is presented.
Sponsor
Date
2023-12-15
Extent
Resource Type
Text
Resource Subtype
Dissertation