Neuro-Symbolic Fraud Detection: Catching Concept Drift Before F1 Drops (Label-Free)

Contents

, every metric looked perfect.
RWSS = 1.000. Output probabilities unchanged. No labels moved.
Everything said “all clear.”

Then the alert fired anyway.

Window 3: severity=warning RWSS=1.000 fired=True ← FIDI Z fires here

The model’s predictions didn’t know anything was wrong yet.
But the symbolic layer did.

This is what actually happened in the experiment — and why it matters for anyone running fraud models in production.

Full code: https://github.com/Emmimal/neuro-symbolic-drift-detection

TL;DR: What You Will Get From This Article

FIDI Z-Score detects concept drift in 5 of 5 seeds, sometimes before F1 drops, with zero labels required
RWSS alone missed 3 of 5 seeds. A Z-score extension of FIDI is what makes it work
Covariate drift is a complete blind spot. It needs a separate raw-feature monitor
The alert system is ~50 lines of code and the difference between a scheduled retrain and an emergency one

Not familiar with the series? Hybrid Neuro-Symbolic Fraud Detection: Guiding Neural Networks with Domain Rules covers the architecture. How a Neural Network Learned Its Own Fraud Rules: A Neuro-Symbolic AI Experiment explains how the model discovers its own rules. This is the drift detection chapter.

The Story So Far

This is Part 3 of a series. New here? One paragraph is all you need.

A HybridRuleLearner trains two parallel paths: an MLP for detection and a rule path that learns symbolic IF-THEN conditions from the same data. The rule path found V14 on its own across two seeds, without being told to look for it. That learned rule (IF V14 < −1.5σ → Fraud) is now the thing being monitored. This article asks what happens when V14 starts behaving differently.

RWSS and F1 scores across 8 windows under covariate drift. RWSS stays at exactly 1.000 throughout all windows. F1 mean gradually declines from W2 onward, crossing the alert threshold near W6–W7. — RWSS stays flat while F1 is already falling. The symbolic layer’s activation pattern holds steady through W0–W3 — it registers no change while F1 is already on its way down. The drop to 0.923 at W4 confirms the drift one window after F1 crosses its alert threshold. This is why RWSS alone is insufficient for early concept drift detection. Image by Author.

RWSS and F1 scores across 8 windows under prior drift. RWSS stays at 1.000 throughout. F1 mean sits around 0.63–0.65, staying above the alert threshold for all 8 windows in the mean. — RWSS stays flat while F1 is already falling. The symbolic layer’s activation pattern holds steady through W0–W3 — it registers no change while F1 is already on its way down. The drop to 0.923 at W4 confirms the drift one window after F1 crosses its alert threshold. This is why RWSS alone is insufficient for early concept drift detection. Image by Author.

Seed	F1 fires	RWSS fires	VEL fires	FIDIZ fires	PSIR fires
42	W3	W4 (1w late)	W4 (1w late)	W3 (simultaneous)	—
0	W3	—	—	W3 (simultaneous)	—
7	W4	W4 (simult.)	W4 (simult.)	W3 (+1w early)	—
123	W3	—	—	W3 (simultaneous)	—
2024	W4	—	—	W3 (+1w early)	—

Drift type	F1 fired	RWSS fired	FIDIZ fired	FIDIZ mean lag
Covariate	4/5	0/5	0/5	—
Prior	2/5	0/5	5/5	−2.00w (late)
Concept	5/5	2/5	5/5	+0.40w (early)

Neuro-Symbolic Fraud Detection: Catching Concept Drift Before F1 Drops (Label-Free)

TL;DR: What You Will Get From This Article

The Story So Far

Three Ways Fraud Can Change

The Problem With the First Three Metrics

The Metrics: Building a Label-Free Drift Detection System

Results: What Each Metric Did

Concept Drift

Concept Drift vs Covariate Drift: Why Symbolic Monitoring Has Blind Spots

Prior Drift

The Alert Demo: Window 3

Why FIDI Z-Score Sees It Before F1 Does

What This System Cannot Do

Results Summary

Building It

V14: Three Articles, One Feature

What to Do With This

Three Things That Will Catch You Using This Concept Drift Early Warning System

Closing

Series

Disclosure

References