Peptiter · DiscoveryLab

Find the drug combination that controls a disease — and prove it before the wet lab.

DiscoveryLab designs interventions across modalities — peptides, small molecules, designed proteins, and microbiome-mediated routes — then finds the combination that holds a disease variable in its target state across the full dosing cycle, even when every agent on its own falls short. Every result ships with a machine-checked stability proof and a pre-registered wet-lab experiment whose kill-criterion is named in advance — so a hypothesis is testable, not just plausible.

Strongest in peptide discovery — receptor-aware families distilled from a ~10¹³ sequence space — but the verification layer is modality-general, and combinations span all three paths.

Honest scope These are certified designs with documented surrogate efficacies — in-silico, not clinical claims. Every proof binds to an orthogonally measurable endpoint; one external wet-lab factorial is the validation gate.

See the combination controllers →Epigenome reprogramming Read the case studies

Verified controllers: 6 axes; Lean-checked, PK-aware
Search reduction: 10¹³ → 10²; peptide programs
PeptCheck: 100%; 56 scored

FIG-01 · DiscoveryLab · 5-workspace program viewObesity / GLP1R program

DiscoveryLab application showing comparative BioScout view with peptide candidate hypotheses, scores, and contextual tutor for an Obesity / GLP1R program

Why we're different

Most drug-AI asserts mechanism. We machine-check it.

Every competitor ships a model that scores and a story about why. We ship a proof. DiscoveryLab encodes its core claims in Lean 4 — the same proof assistant used to verify mathematics and safety-critical systems — and recompiles every one of them in CI. A claim that doesn't pass doesn't ship. This is the moat: not a better score, a checkable guarantee.

01 · Stability

Lyapunov-stable controllers

A drug acting on a disease variable is modeled as a feedback controller; Lean proves the closed loop descends — V̇(x) ≤ −ε‖x‖² for every state — in exact integer arithmetic. Generic 2-, 3-, and n-state framework.

02 · Combination

The cross-modal certificate

One theorem certifies a combination is stable at the dosing-cycle peak AND trough, in-band, below the over-suppression ceiling, and strictly better than any monotherapy. Each receipt also carries an 81-world ±15% potency × PK-trough robustness margin, so the verdict doesn't hinge on the surrogate numbers.

03 · Safety

Safety gates, proven faithful

Each safety axis (CNS, cardiac, hepatic, renal, GI, immunogenicity, metabolic) is re-encoded in Lean and proven to compute the exact same verdict as its audited scorer — pinned to the source hash, so drift breaks the build.

04 · Mechanism

Signed mechanism attribution

Mechanism graphs are checked path-by-path with signs, catching the cancellation-hides-signal failure an unsigned check misses — the formal backbone of every attributed claim.

6: verified combination controllers; C5a · IL-6 · glucose · neuro · cortisol · rabeximod-RA — each a CI-recompiled Lean receipt
7: safety axes proven to spec; Lean model = audited scorer, on every PeptCheck entry
0: proof holes; no sorry, no admit — stdlib-only Lean 4, fully discharged
CI: re-checked every commit; lake build recompiles every proof; a broken proof fails the build

The boundary is the product

A receipt proves the math is correct given the model and its documented surrogate constants — it is an in-silico certificate, not a clinical claim, and the only bridge to the clinic is a pre-registered wet-lab factorial with its kill-criterion named in advance. We state that line plainly because it is exactly what a serious reviewer checks — and being able to state it is the difference between a verified design and a generated one.

That same discipline runs at generation time: designed sequences are screened against configured select-agent and toxin deny-lists before any wet-lab handoff, hits are blocked, and the screen fails closed if its database cannot load — an operational, rank-only gate, not a comprehensive biorisk clearance.

On the roadmap

next receipts

The framework is general — new indications are new instantiations, not new mathematics. Receipts we are composing next:

Epigenome SIRT6 / NAD⁺ controller
Compose the peptide + NAD⁺ + reader-probe levers into a certified fidelity controller on the longevity axis.
Patient-twin controllers
Re-certify each receipt against individual PK/PD twins, not just the population-nominal plant.
More disease axes
The n-state framework generalizes the receipt to new indications — each new axis is a new instantiation, not new math.

See a verified controller →Open the safety-axis proofs

Next-level push: a complex-trained receptor inverse generator, learned receptor-first design, repair-peptide curation, and Mac-local model runtime.

DiscoveryLab now has a learned peptide generator behind the receptor-first lane: receptor-peptide PDB complex pairs train the sequence model, receptor context conditions candidate generation, and no-wet-lab runs can rehearse the feedback loop with clearly marked MockLab simulations.

BioFoundation adapter

8 providers

Biological foundation-model outputs now enter the evidence ledger as bounded scorecards.

Build 2 adds a BioFoundation Adapter Layer for deterministic, TranscriptFormer, Geneformer, scVI, AIDO.Cell, Arc State, AlphaGenome, and BioHub ESM-style provider contracts. It turns cell-state, protein, variant, perturbation, and execution-plan context into explicit scorecard JSON instead of untracked model notes.

experiments/biofoundation-adapter-v0/

Evidence graph bridge

8 claims

Build 1 can now ingest BioFoundation biological-context scorecards.

evidence_graph.py build accepts repeatable --biofoundation scorecards and creates biofoundation_context, calibration-state, organ-state, target-cell, protein, variant, perturbation, and execution-plan claims with benchmark next actions when outputs remain rank-only.

experiments/agentic-evidence-graph-v0/evidence_graph.py

Verified combinations

Lean ✓

Cross-domain combination therapies are now machine-checked as stable controllers.

Peptide + designed-protein + small-molecule combinations are certified as Lyapunov-stable feedback controllers that are control-adequate where no monotherapy is — proven in Lean (lake build, CI-recompiled), robustness-checked over an ±15% surrogate-error grid, and shipped with a pre-registered factorial. Three worked examples (C5a, IL-6, glucose); reproduce with peptiter-reproduce-combination.

experiments/multimodal-combination/METHOD.md

Complex model

GRU v2

A receptor-first inverse generator is now trained on complex labels.

receptor_inverse_complex_gru_v2 learns from 12,643 PepBDB receptor-peptide PDB complex pairs, conditions on receptor sequence and hotspot context, runs locally on Apple Silicon MPS, and emits learned receptor-first peptide hypotheses.

backbone/artifacts/weights/receptor_inverse_generator/receptor_inverse_complex_gru.pt

Current push

Learned RF

Receptor-first design can now consume learned generator artifacts.

The Swift lane still keeps the deterministic residue-preference fallback, but it can now ingest generated_candidates.json, rank model-backed RIG candidates, carry model provenance, and export assay-gated LabCandidate batches.

Packages/PeptiterDiscovery/Sources/PeptiterDiscovery/ReceptorFirstDesign.swift

Feedback gate

audited

Wet-lab retraining is wired and refuses simulated labels.

The feedback retraining path now looks for measured returned assay rows, writes an audit report, and deliberately skips model training when only MockLab, fixture, or absent labels are available.

backbone/artifacts/weights/wetlab_feedback_retraining/feedback_retraining_report.json

MockLab loop

270 rows

No-wet-lab dry runs now return random good/bad feedback.

The receptor-inverse candidates can flow through a MockLab simulator that emits simulated outcomes across 9 assay axes, then trains sandbox-only heads behind an explicit --allow-simulated gate.

backbone/artifacts/data_cache/mock_wetlab_feedback/lab_feedback.csv

Closed loop

R2 batch

Mock active learning now proposes and tests the next dry-run batch.

Round 1 MockLab outcomes rank the 30 receptor-inverse candidates, classify mock promote/revisit/reject decisions, generate 12 second-round analogs, and send them through a second simulated feedback/retraining pass.

backbone/artifacts/active_learning/mock_round2/mock_active_learning_round2.json

LabSpace bridge

CSV gate

Returned provider results now flow into the same guarded feedback schema.

LabSpace batches can export synthesis, purification, stability, binding, activity, and toxicity-style readouts into lab_feedback.csv. MockLab exports remain simulated-only, while non-MockLab provider rows become measured retraining inputs.

backbone/scripts/export_labspace_feedback.py

Loop audit

1 cmd

LabSpace feedback export and retraining gates now run together.

The new loop runner exports LabSpace returns, executes the measured-label retraining gate, optionally trains simulated sandbox heads, and writes a single promotion-status report so MockLab rehearsal cannot be confused with wet-lab evidence.

backbone/scripts/run_labspace_feedback_loop.py

Repair panel

5 seeds

Repair-peptide panel is now curated, scored, and claim-bounded.

GHK-Cu, MOTS-c for the NAD/SIRT axis, a BRC4-like RAD51 peptide, and a BLM DDR fragment now have sequence-level descriptor cards. IK14800 is kept as a metadata hold until a source-confirmed sequence is attached.

experiments/wetlab-prep/repair_peptide_panel.json

Seed library

4 + 1

Sequence-defined repair seeds entered the natural peptide registry.

The Swift library now registers GHK, MOTS-c, RAD51/BRC-like, and BLM fragment seeds, while IK14800 is explicitly blocked from analog generation because the repository has no primary sequence.

Packages/PeptiterDiscovery/Sources/PeptiterDiscovery/RepairPeptideSeedPanel.swift

Pathway KG

3 layers

Assertions are versioned, licensed, contextual, and claim-bounded.

The default registry tracks Reactome, GO-CAM, and Open Targets source releases, evidence namespaces, redistribution policy, layer/status counts, blocked claims, and linked executable islands.

model/list_world_model_sources

Body models

3 cards

Federated model cards replace broad whole-body claims.

Each card records context of use, calibration state, update cadence, uncertainty method, validation endpoints, governance boundaries, allowed claims, and blocked clinical claims.

model/body_twin_model_card

MCP tools

3 new

Agents can inspect sources, assertions, and model-card boundaries.

PeptiterModelMCPServer now exposes list_world_model_sources, inspect_pathway_assertions, and body_twin_model_card so external agents see the same governance contract as the tests.

Packages/PeptiterDiscovery/Sources/PeptiterDiscovery/MCP/

Branch

origin/main

Open issues

BioFoundation

adapter v0

Providers

8 contracts

Graph claims

8 new kinds

Validation

4 suites / 20 tests

Generator

complex-trained on MPS

Receptor-first

learned + fallback

Wet-lab loop

mock + measured gates

LabSpace bridge

exportable feedback

Loop audit

one-command gate

Mock labels

378 rows / 2 rounds

Round 2

12 analogs

Repair panel

4 scored + 1 hold

Verification

Swift + Python + Node + build

What Bio-JEPA means now

A trainable peptide backbone with runtime inference hooks, not just static roadmap copy.

BioFoundation bridge

The adapter layer now accepts cell atlas, protein sequence, regulatory variant, perturbation, and external execution-plan contexts and emits scorecards with provider provenance, calibration state, caveats, and bounded claim language.

Provider contracts

deterministic, transcriptformer, geneformer, scvi, aido-cell, arc-state, alphagenome, and biohub-esm are registered as explicit contracts so frontier biology priors can be planned or imported without pretending every provider has already run.

Evidence graph ingestion

BioFoundation scorecards now attach to candidate graphs beside ChemCheck-ESA, creating biological-context claims and benchmark next actions while blocking rank-only outputs from becoming go/no-go evidence.

Representation

ESM-2 embeddings remain the common 2560-dimensional candidate state consumed by task heads, benchmarks, and Swift scoring.

Training

The current run retrained six heads with 1024-wide MLPs, scaffold splits, leakage reports, PyTorch checkpoints, source and compiled Core ML artifacts, and repository predictions.

Selection

ActiveLearningBatchExporter can now delegate expected-improvement or upper-confidence-bound acquisition to a BoTorch Python bridge before exporting wet-lab batches.

Audit boundary

The site now separates infrastructure progress from biology claims: trained heads and pathway priors can prioritize candidates, but assay results still decide biological truth.

World model

Pathway assertions now carry source release, license, context, evidence, knowledge status, confidence, executable-island links, and explicit blocked claims.

Mac execution

Local scoring prefers committed .mlmodelc bundles, requests all available Core ML compute units, and shares one Float32 MLMultiArray across all loaded heads for a candidate embedding.

Repair panel

The requested DNA-repair and skin-repair peptides are now separated into scored sequence records and evidence holds, with assay gates for stability, permeability, toxicity, synthesis, pathway fit, and perturbation evidence.

Receptor-first

ReceptorHotspotMap, ResiduePreferenceMatrix, ReceptorConditionedPeptideDesigner, NegativeDesignPanel, and ReceptorFirstBatchExporter now turn a target interface into ranked peptide hypotheses with residue-level rationales.

Generative model

receptor_inverse_complex_gru_v2 trained on 12,643 receptor-peptide PDB complex pairs, beat a smoothed unigram amino-acid baseline by 71.8% on validation perplexity and 72.7% on test perplexity, and generated 30 exact-novel quality-filtered candidates.

Wet-lab loop

Measured feedback retraining now has an executable audit gate; it will train only from returned assay labels and currently records skipped_no_measured_labels instead of promoting simulated data.

No-wet-lab mode

MockLab now turns receptor-inverse candidates into 270 simulated good/bad assay rows and a sandbox-only retraining run, while the measured-feedback path still rejects those rows by default.

Closed loop

The mock active-learning planner ranks first-round outcomes, proposes 12 second-round analogs, simulates 108 more assay rows, and retrains a second sandbox model without touching measured-feedback claims.

LabSpace feedback

Provider-returned LabSpace batches can now be exported to the same backbone lab_feedback.csv contract, preserving MockLab simulation labels and letting real provider rows enter the measured retraining gate.

Feedback loop audit

A single command now chains LabSpace export, measured retraining, optional simulated sandbox retraining, and a promotion-status report with separate artifact directories for measured and simulated outputs.

BioFoundation Adapter Layer

Frontier biology priors are now ingested as scorecards with calibration boundaries.

The new adapter converts frozen-model or deterministic biological context into an auditable scorecard before the Agentic Evidence Graph is allowed to reason over it. The graph records provider provenance, calibration state, caveats, and next benchmark actions instead of treating model output as assay evidence.

Adapter

biofoundation-adapter-v0

Providers

deterministic + 7 external contracts

Contexts

organ · cell state · protein · variant · perturbation

Execution plans

TranscriptFormer and provider handoffs

Graph CLI

repeatable --biofoundation scorecards

Claim types

8 BioFoundation graph claims

Examples

liver · heart · immune · striatum · protein · variant

Validation

5 adapter tests + 6 graph tests

Artifacts: experiments/biofoundation-adapter-v0/, docs/BIOFOUNDATION_ADAPTER.md, and experiments/agentic-evidence-graph-v0/examples/mesdopetam_with_biofoundation_graph.json

Learned generator proof

receptor_inverse_complex_gru_v2 is a trained local generative model, not another rules-only fallback.

The proof run used 12,643 PepBDB receptor-peptide PDB complex pairs, trained on Apple Silicon MPS, selected epoch 12, reached 5.382 validation perplexity versus a 19.058 unigram baseline, and generated 30 exact-novel quality-filtered candidates across RAD51, GLP1R, and BRD4 contexts.

Complex pairs

12,643

Unique sequences

7,769

Best epoch

Val PPL

5.382

Val baseline

19.058

Test PPL

5.206

Test baseline

19.076

Exact novel

30 / 100%

Artifacts: backbone/artifacts/weights/receptor_inverse_generator/training_report.json and backbone/artifacts/weights/receptor_inverse_generator/generated_candidates.json

MockLab rehearsal

The no-wet-lab loop now produces explicit simulated feedback and sandbox retraining.

A dry run sampled random good/bad outcomes for all 30 receptor-inverse candidates across binding, activity, stability, permeability, synthesis, purification, aggregation, and toxicity-style gates. The normal retraining path skipped those rows; only the explicit sandbox run trained from them.

Candidates

Feedback rows

270

Task axes

Passed / failed

158 / 112

Sandbox heads

Dry-run promoted

Default retrain

skipped

Opt-in status

trained_simulated

Artifacts: backbone/artifacts/data_cache/mock_wetlab_feedback/, backbone/artifacts/weights/mock_wetlab_feedback_audit/, and backbone/artifacts/weights/mock_wetlab_feedback_retraining/

Active-learning loop

Round 2 now closes the dry-run feedback loop without claiming wet-lab evidence.

The planner scored the first 30 candidates from simulated feedback, kept five mock promotes, generated 12 exploit/explore analogs, ran MockLab again, and trained a second sandbox model from 108 additional simulated rows.

Ranked parents

Mock promote

Mock revisit

Mock reject

R2 analogs

R2 rows

108

R2 passed / failed

66 / 42

R2 sandbox heads

Artifacts: backbone/artifacts/active_learning/mock_round2/, backbone/artifacts/data_cache/mock_wetlab_feedback_round2/, and backbone/artifacts/weights/mock_wetlab_feedback_round2_retraining/

Governance updates

Area

Artifact

Status

Splits

scaffold leakage audit

clean

Negatives

synthetic-only provenance

marked

CoreML

6 source + compiled heads

ready

Pathway KG

release/license/context registry

valid

Body twins

VVUQ model cards

bounded

Receptor-first

learned generator artifact

wired

Wet-lab feedback

measured-label retraining gate

audited

LabSpace

provider-result feedback export

wired

Loop audit

export + retraining report

wired

BioFoundation

provider contracts + scorecard ingestion

wired

Biology priors

rank-only calibration next actions

bounded

MockLab

simulation-only feedback gate

sandboxed

Active learning

mock round-2 analog plan

closed

Repair peptides

5-entity claim-bound panel

processed

Optional heads

real-label gate

held

The earlier Phase 2 issue-closure push remains recorded: Branch origin/main; Issues closed #132-#136.

Task-head held-out metrics

Task

Test MAE

Corr

binding

0.477

0.065

solubility

0.005

0.999

stability

0.002

0.999

immunogenicity

0.082

0.556

aggregation

0.007

0.998

activity

0.263

0.321

Claim boundaries

-BioFoundation scorecards provide biological context, not proof of target engagement, efficacy, safety, pathogenicity, or clinical actionability.
-Planned external provider runs are execution plans only; imported or computed provider outputs remain rank-only until benchmark-calibrated with source-audited evidence.
-receptor_inverse_complex_gru_v2 is a newly trained receptor-peptide complex model, but its outputs are sequence hypotheses, not binding, activity, safety, or mechanism evidence.
-Wet-lab feedback retraining is implemented and audited, but no feedback model was trained because no measured returned-label rows exist in the cache.
-MockLab labels are random simulated dry-run outcomes for pipeline rehearsal only; they are excluded from measured retraining unless --allow-simulated is explicit.
-Mock active-learning round-2 candidates are analog proposals from simulated feedback, not measured optimization or evidence of improved biology.
-LabSpace export is an ingestion bridge: MockLab rows stay simulated, and non-MockLab rows still require assay context, provenance, and review before they support model claims.
-The LabSpace loop runner reports promotion status, but only a trained measured-feedback run can promote model claims; sandbox heads remain rehearsal artifacts.
-The pathway registry is a living, release-aware knowledge substrate with executable islands; it is not a complete quantitative simulator of all biology.
-Body-model support is a federation of process twins and executable islands with model cards; it is not clinical decision support or a continuously updated whole-body digital twin.
-KEGG, BioCyc, patient, EHR, and LabSpace-derived evidence remain source-isolated until explicit licensing, consent, and export rules permit use.
-The repo commits normalized training artifacts and provenance; upstream raw corpora remain download/cache inputs, not literal in-repo source mirrors.
-Synthetic shuffled negatives are marked synthetic-only and are not presented as experimental inactivity.
-Lower-is-better heads report held-out metrics in raw label space; app-facing scores still expose utility semantics.
-Optional translational heads stay unexported until real serum, protease, permeability, toxicity, synthesis, or purification labels exist.
-CoreML export now commits source .mlmodel artifacts plus precompiled .mlmodelc directories for local macOS; Swift falls back to compile-and-cache when needed.
-GHK-Cu copper coordination and BLM phosphorylation are modification claims; plain peptide sequence scores do not validate those modified states.
-IK14800 remains unscored until a primary amino-acid sequence and IL-12/UV-damage assay context are attached.
-Receptor-first design produces residue constraints and multiple hypotheses; it does not prove binding, activation, or off-target absence.

Overview graph

One story, seven pages: from biological intent to recursive evidence improvement.

DiscoveryLab is organized around a compact navigation spine: discovery inputs, product workflow, mechanism proof, recursive discovery, epigenome rescue, platform handoff, and competitor comparison. Each route preserves the detailed sections while giving readers a clear task-oriented path.

Node 01→

Condition

Start with phenotype, contraindications, pathway context, and testable therapeutic intent.

Open section →

Node 02→

Seeds

Move from natural parent peptides and evidence-backed source systems into bounded scaffold families.

Open section →

Node 03→

Search

Reduce combinatorial sequence space with receptor, motif, stability, synthesis, and family constraints.

Open section →

Node 04→

Workflow

Carry each program through Program, Discover, Evidence, Portfolio, and Handoff workspaces.

Open section →

Node 05→

Proof

Check pathway reachability, blockade, safety, conservation, perturbation support, and Lean receipts.

Open section →

Node 06↺

Learning loop

Turn lab returns, proof failures, reviewer corrections, and eval regressions into gated improvement proposals.

Open section →