Technical Manuscripts

Exploring the boundaries of multi-agent systems, latent graph representations, and probabilistic robotics. A collection of mathematical inquiries into resilient coordination.

Unpublished / In Progress

CMU ECE — Under Review

Policy Graphs for Byzantine Fault Detection in Communicating Multi-Agent Reinforcement Learning

PREPRINT2025

Policy Graphs for Byzantine Fault Detection in Communicating Multi-Agent Reinforcement Learning

Multi-agent RL systems coordinating through learned communication graphs are vulnerable to Byzantine agents that transmit strategically corrupted observations. We propose BARD-MARL — a framework that augments Bayesian latent graph inference with policy graph-based behavioral verification. Each agent's trajectory is mapped to a discrete directed policy graph capturing its decision-making topology. Spectral and behavioral features are fed into an Isolation Forest for unsupervised anomaly detection. On a 25-agent traffic signal control environment, our detection pipeline achieves F1 = 0.60 at 20% Byzantine fraction — three times the random baseline.

Request Manuscript

Peer-Reviewed

2025 — CMU Advanced Database Systems

Evolution of Mobile Database Technologies: From Local-First to Privacy-Preserving Edge Computing

A systematic evaluation of mobile database technologies tracing the trajectory from foundational constraints (CAP theorem, disconnection, bandwidth) to modern local-first paradigms (CRDTs, SQLite, Firebase, Realm) and emerging edge-intelligence architectures integrating Federated Learning for privacy-preserving mobile data management.

PDF

2025 — CMU DevOps & IT Operations

AI-Ops Integration: Failure Prediction and Root Cause Analysis in Modern DevOps Pipelines

A research paper examining how AI-driven observability tools and anomaly detection models can be integrated into DevOps pipelines to enable proactive failure prediction and automated root cause analysis at scale.

PDF

2025 — arXiv:2511.11749 · B.S. Thesis, Kabarak University

How Machine Learning-Data Driven Replication Strategies Enhance Fault Tolerance in Large-Scale Distributed Systems

An investigation into how ML-driven data replication strategies can improve fault tolerance in large-scale distributed systems. Submitted in partial fulfillment of a B.S. in Computer Science at Kabarak University.

arXiv Scholar

PythonPyTorchC++TypeScriptDockerKubernetesPostgreSQLFastAPI