PeptCheck v8.6 safety-axis benchmark.
The multi-axis gate benchmark — 56 scored peptide-drug entries with 100.0% full agreement across axes.
PeptCheck v8.6 adds source-backed noncanonical positive controls without moving a threshold.
The current reference gate scores 56/56 on the expanded PeptCheck matrix: 100.0% calibration, 100.0% held-out validation, and 100.0% full benchmark agreement. v8.5 resolved lepirudin by separating commercial withdrawal status from label-documented anti-hirudin antibody/anaphylactoid concern; v8.6 (PeptCheck v1.9) promotes cetrorelix, ganirelix, icatibant, and difelikefalin to held-out positive controls with exact noncanonical-residue maps. No safety-axis threshold moved.
# v8.4 addition: plecanatide pediatric boxed-warning row
{ "name": "plecanatide_pediatric_boxed_warning",
"outcome": "pediatric_approved_with_boxed_warning",
"sequence": "NDECELCVNVACTGCL",
"structuredModifications": [
{"kind": "cyclization", "form": "disulfide", "residues": [4, 12]},
{"kind": "cyclization", "form": "disulfide", "residues": [7, 15]}
],
"populationContext": {"ageBand": "under_6"},
"endpointContext": "TRULANCE boxed warning: serious dehydration" }
gi_tolerability_score = 0.48 > threshold 0.45
= baseline_class_b_nausea 0.18
+ pediatric_gc_c_dehydration_warning 0.18
+ compact GC-C secretagogue context
# v8.5 taxonomy closure: lepirudin concern-bearing withdrawal
benchmark label: approved_then_withdrawn_with_documented_concern
reference gate verdict: PASS_with_documented_concern
reason: FDA/EMA anti-hirudin antibody + anaphylactoid evidence
withdrawal context: EMA commercial withdrawal, not safety-drivenClosed-loop stability and off-loop safety are separate biological questions. The reference gate keeps four Lyapunov axes separate from seven transverse safety axes so every refusal has a visible cause and every pass remains claim-bounded.
- Lyapunov — glucose, calcium-lowering, calcium-raising, and BP-raising controller stability.
- Safety — immunogenicity, cardiac, GI, renal, hepatic, CNS, and metabolic off-target scoring.
- Mechanism-fit — weak perturbation evidence is carried as a protocol-level check instead of being hidden inside a sequence score.
native_decide to match the audited scorer — rebuilt in CI, not asserted. Three (metabolic, immunogenicity, CNS) prove the exact score in integer arithmetic; the four with continuous terms (cardiac, renal, hepatic, GI) prove the verdict via exact integer inequalities. Marquee rejections carry formal robustness guarantees: pasireotide (metabolic, 6 flipping edits), ecallantide (anaphylaxis-Kunitz, provably robust), ziconotide (CNS), caspofungin (hepatic), nesiritide (renal), and ulimorelin (motilin cardiac).lake build in Packages/PathwayLean re-checks every receipt. Source: PathwayLean/Examples/SafetyAxis/*Reflection.lean.experiments/peptcheck-v1/ and experiments/lyapunov-prototype/benchmark/. Every verdict is reproducible by running python3 experiments/peptcheck-v1/runner.py, and its robustness by running python3 experiments/lyapunov-prototype/perturbation_robustness.py.