Sycophancy Python Packages

gauntlet-cli

Behavioral reliability under pressure. Test how LLMs behave when things get hard.

1K 6 0

rho-eval

Behavioral auditing toolkit for LLMs — audit any model across 8 dimensions (factual, toxicity, bias, sycophancy, reasoning, refusal, deception, over-refusal) using teacher-forced confidence probes.

524 4 0

knowledge-fidelity

Compress LLMs while auditing whether they still know truth vs myths. SVD compression + false-belief detection in one toolkit.

328 4 0

relational-memory

A relationship-aware memory layer for LLM chatbots — models the relationship, not just facts

112 2 0