Reference

System Architecture

CHAPAL operates on a dual-layer detection system to balance speed and semantic understanding.

High-Level Flow

User Query

⬇

[Layer 1] Deterministic Guard (Local)

- Spike Detection (DDoS)

- Regex / Pattern Matching (PII, Keywords)

⬇ (if Safe)

Gemini 1.5 Flash (Generation)

- Generates initial response buffer

⬇

[Layer 2] Semantic Auditor (Groq / Llama 3.1)

- Contextual Analysis (Hallucinations, Tone)

- Medical/Psychological Evaluation

⬇

Decision Gate

➡ Safe: Deliver to User

➡ Anomaly: Block & Route to Admin

This layer runs instantly on the server. It handles high-speed threats that don't require deep AI understanding to identify.

Spike Detection: Blocks users sending >4 messages in rapid succession (Anti-DDoS).
Pattern Matching: Uses strict Regex for SSNs, Credit Cards, and known malicious prompt injection strings.

Powered by Llama 3.1 8B running on Groq LPU hardware. This layer "reads" the conversation like a human auditing the chat.

Hallucination Check: Compares the Gemini response against known facts or internal context.
Tone Analysis: Detects subtle hostility or manipulation that regex misses.
Medical/Legal Safety: Identifies if the AI is giving advice it shouldn't (e.g., specific dosage instructions).

Next.js 15

Framework

Gemini

Chat Model

Groq

Inference Engine

Prisma

ORM / DB