Peptiter / DiscoveryLab
Peptiter · DiscoveryLab

Find the drug combination that controls a disease — and prove it before the wet lab.

DiscoveryLab designs interventions across modalities — peptides, small molecules, designed proteins, and microbiome-mediated routes — then finds the combination that holds a disease variable in its target state across the full dosing cycle, even when every agent on its own falls short. Every result ships with a machine-checked stability proof and a pre-registered wet-lab experiment whose kill-criterion is named in advance — so a hypothesis is testable, not just plausible.

Strongest in peptide discovery — receptor-aware families distilled from a ~10¹³ sequence space — but the verification layer is modality-general, and combinations span all three paths.

Honest scope These are certified designs with documented surrogate efficacies — in-silico, not clinical claims. Every proof binds to an orthogonally measurable endpoint; one external wet-lab factorial is the validation gate.

Verified controllers
6 axes
Lean-checked, PK-aware
Search reduction
10¹³ → 10²
peptide programs
PeptCheck
100%
56 scored
FIG-01 · DiscoveryLab · 5-workspace program viewObesity / GLP1R program
DiscoveryLab application showing comparative BioScout view with peptide candidate hypotheses, scores, and contextual tutor for an Obesity / GLP1R program
Why we're different

Most drug-AI asserts mechanism. We machine-check it.

Every competitor ships a model that scores and a story about why. We ship a proof. DiscoveryLab encodes its core claims in Lean 4 — the same proof assistant used to verify mathematics and safety-critical systems — and recompiles every one of them in CI. A claim that doesn't pass doesn't ship. This is the moat: not a better score, a checkable guarantee.

01 · Stability

Lyapunov-stable controllers

A drug acting on a disease variable is modeled as a feedback controller; Lean proves the closed loop descends — V̇(x) ≤ −ε‖x‖² for every state — in exact integer arithmetic. Generic 2-, 3-, and n-state framework.

02 · Combination

The cross-modal certificate

One theorem certifies a combination is stable at the dosing-cycle peak AND trough, in-band, below the over-suppression ceiling, and strictly better than any monotherapy. Each receipt also carries an 81-world ±15% potency × PK-trough robustness margin, so the verdict doesn't hinge on the surrogate numbers.

03 · Safety

Safety gates, proven faithful

Each safety axis (CNS, cardiac, hepatic, renal, GI, immunogenicity, metabolic) is re-encoded in Lean and proven to compute the exact same verdict as its audited scorer — pinned to the source hash, so drift breaks the build.

04 · Mechanism

Signed mechanism attribution

Mechanism graphs are checked path-by-path with signs, catching the cancellation-hides-signal failure an unsigned check misses — the formal backbone of every attributed claim.

6
verified combination controllers
C5a · IL-6 · glucose · neuro · cortisol · rabeximod-RA — each a CI-recompiled Lean receipt
7
safety axes proven to spec
Lean model = audited scorer, on every PeptCheck entry
0
proof holes
no sorry, no admit — stdlib-only Lean 4, fully discharged
CI
re-checked every commit
lake build recompiles every proof; a broken proof fails the build
The boundary is the product

A receipt proves the math is correct given the model and its documented surrogate constants — it is an in-silico certificate, not a clinical claim, and the only bridge to the clinic is a pre-registered wet-lab factorial with its kill-criterion named in advance. We state that line plainly because it is exactly what a serious reviewer checks — and being able to state it is the difference between a verified design and a generated one.

That same discipline runs at generation time: designed sequences are screened against configured select-agent and toxin deny-lists before any wet-lab handoff, hits are blocked, and the screen fails closed if its database cannot load — an operational, rank-only gate, not a comprehensive biorisk clearance.

On the roadmap
next receipts

The framework is general — new indications are new instantiations, not new mathematics. Receipts we are composing next:

  • Epigenome SIRT6 / NAD⁺ controller
    Compose the peptide + NAD⁺ + reader-probe levers into a certified fidelity controller on the longevity axis.
  • Patient-twin controllers
    Re-certify each receipt against individual PK/PD twins, not just the population-nominal plant.
  • More disease axes
    The n-state framework generalizes the receipt to new indications — each new axis is a new instantiation, not new math.
Latest results

Next-level push: a complex-trained receptor inverse generator, learned receptor-first design, repair-peptide curation, and Mac-local model runtime.

DiscoveryLab now has a learned peptide generator behind the receptor-first lane: receptor-peptide PDB complex pairs train the sequence model, receptor context conditions candidate generation, and no-wet-lab runs can rehearse the feedback loop with clearly marked MockLab simulations.

BioFoundation adapter
8 providers

Biological foundation-model outputs now enter the evidence ledger as bounded scorecards.

Build 2 adds a BioFoundation Adapter Layer for deterministic, TranscriptFormer, Geneformer, scVI, AIDO.Cell, Arc State, AlphaGenome, and BioHub ESM-style provider contracts. It turns cell-state, protein, variant, perturbation, and execution-plan context into explicit scorecard JSON instead of untracked model notes.

experiments/biofoundation-adapter-v0/
Evidence graph bridge
8 claims

Build 1 can now ingest BioFoundation biological-context scorecards.

evidence_graph.py build accepts repeatable --biofoundation scorecards and creates biofoundation_context, calibration-state, organ-state, target-cell, protein, variant, perturbation, and execution-plan claims with benchmark next actions when outputs remain rank-only.

experiments/agentic-evidence-graph-v0/evidence_graph.py
Verified combinations
Lean ✓

Cross-domain combination therapies are now machine-checked as stable controllers.

Peptide + designed-protein + small-molecule combinations are certified as Lyapunov-stable feedback controllers that are control-adequate where no monotherapy is — proven in Lean (lake build, CI-recompiled), robustness-checked over an ±15% surrogate-error grid, and shipped with a pre-registered factorial. Three worked examples (C5a, IL-6, glucose); reproduce with peptiter-reproduce-combination.

experiments/multimodal-combination/METHOD.md
Complex model
GRU v2

A receptor-first inverse generator is now trained on complex labels.

receptor_inverse_complex_gru_v2 learns from 12,643 PepBDB receptor-peptide PDB complex pairs, conditions on receptor sequence and hotspot context, runs locally on Apple Silicon MPS, and emits learned receptor-first peptide hypotheses.

backbone/artifacts/weights/receptor_inverse_generator/receptor_inverse_complex_gru.pt
Current push
Learned RF

Receptor-first design can now consume learned generator artifacts.

The Swift lane still keeps the deterministic residue-preference fallback, but it can now ingest generated_candidates.json, rank model-backed RIG candidates, carry model provenance, and export assay-gated LabCandidate batches.

Packages/PeptiterDiscovery/Sources/PeptiterDiscovery/ReceptorFirstDesign.swift
Feedback gate
audited

Wet-lab retraining is wired and refuses simulated labels.

The feedback retraining path now looks for measured returned assay rows, writes an audit report, and deliberately skips model training when only MockLab, fixture, or absent labels are available.

backbone/artifacts/weights/wetlab_feedback_retraining/feedback_retraining_report.json
MockLab loop
270 rows

No-wet-lab dry runs now return random good/bad feedback.

The receptor-inverse candidates can flow through a MockLab simulator that emits simulated outcomes across 9 assay axes, then trains sandbox-only heads behind an explicit --allow-simulated gate.

backbone/artifacts/data_cache/mock_wetlab_feedback/lab_feedback.csv
Closed loop
R2 batch

Mock active learning now proposes and tests the next dry-run batch.

Round 1 MockLab outcomes rank the 30 receptor-inverse candidates, classify mock promote/revisit/reject decisions, generate 12 second-round analogs, and send them through a second simulated feedback/retraining pass.

backbone/artifacts/active_learning/mock_round2/mock_active_learning_round2.json
LabSpace bridge
CSV gate

Returned provider results now flow into the same guarded feedback schema.

LabSpace batches can export synthesis, purification, stability, binding, activity, and toxicity-style readouts into lab_feedback.csv. MockLab exports remain simulated-only, while non-MockLab provider rows become measured retraining inputs.

backbone/scripts/export_labspace_feedback.py
Loop audit
1 cmd

LabSpace feedback export and retraining gates now run together.

The new loop runner exports LabSpace returns, executes the measured-label retraining gate, optionally trains simulated sandbox heads, and writes a single promotion-status report so MockLab rehearsal cannot be confused with wet-lab evidence.

backbone/scripts/run_labspace_feedback_loop.py
Repair panel
5 seeds

Repair-peptide panel is now curated, scored, and claim-bounded.

GHK-Cu, MOTS-c for the NAD/SIRT axis, a BRC4-like RAD51 peptide, and a BLM DDR fragment now have sequence-level descriptor cards. IK14800 is kept as a metadata hold until a source-confirmed sequence is attached.

experiments/wetlab-prep/repair_peptide_panel.json
Seed library
4 + 1

Sequence-defined repair seeds entered the natural peptide registry.

The Swift library now registers GHK, MOTS-c, RAD51/BRC-like, and BLM fragment seeds, while IK14800 is explicitly blocked from analog generation because the repository has no primary sequence.

Packages/PeptiterDiscovery/Sources/PeptiterDiscovery/RepairPeptideSeedPanel.swift
Pathway KG
3 layers

Assertions are versioned, licensed, contextual, and claim-bounded.

The default registry tracks Reactome, GO-CAM, and Open Targets source releases, evidence namespaces, redistribution policy, layer/status counts, blocked claims, and linked executable islands.

model/list_world_model_sources
Body models
3 cards

Federated model cards replace broad whole-body claims.

Each card records context of use, calibration state, update cadence, uncertainty method, validation endpoints, governance boundaries, allowed claims, and blocked clinical claims.

model/body_twin_model_card
MCP tools
3 new

Agents can inspect sources, assertions, and model-card boundaries.

PeptiterModelMCPServer now exposes list_world_model_sources, inspect_pathway_assertions, and body_twin_model_card so external agents see the same governance contract as the tests.

Packages/PeptiterDiscovery/Sources/PeptiterDiscovery/MCP/
Branch
origin/main
Open issues
0
BioFoundation
adapter v0
Providers
8 contracts
Graph claims
8 new kinds
Validation
4 suites / 20 tests
Generator
complex-trained on MPS
Receptor-first
learned + fallback
Wet-lab loop
mock + measured gates
LabSpace bridge
exportable feedback
Loop audit
one-command gate
Mock labels
378 rows / 2 rounds
Round 2
12 analogs
Repair panel
4 scored + 1 hold
Verification
Swift + Python + Node + build
What Bio-JEPA means now

A trainable peptide backbone with runtime inference hooks, not just static roadmap copy.

BioFoundation bridge

The adapter layer now accepts cell atlas, protein sequence, regulatory variant, perturbation, and external execution-plan contexts and emits scorecards with provider provenance, calibration state, caveats, and bounded claim language.

Provider contracts

deterministic, transcriptformer, geneformer, scvi, aido-cell, arc-state, alphagenome, and biohub-esm are registered as explicit contracts so frontier biology priors can be planned or imported without pretending every provider has already run.

Evidence graph ingestion

BioFoundation scorecards now attach to candidate graphs beside ChemCheck-ESA, creating biological-context claims and benchmark next actions while blocking rank-only outputs from becoming go/no-go evidence.

Representation

ESM-2 embeddings remain the common 2560-dimensional candidate state consumed by task heads, benchmarks, and Swift scoring.

Training

The current run retrained six heads with 1024-wide MLPs, scaffold splits, leakage reports, PyTorch checkpoints, source and compiled Core ML artifacts, and repository predictions.

Selection

ActiveLearningBatchExporter can now delegate expected-improvement or upper-confidence-bound acquisition to a BoTorch Python bridge before exporting wet-lab batches.

Audit boundary

The site now separates infrastructure progress from biology claims: trained heads and pathway priors can prioritize candidates, but assay results still decide biological truth.

World model

Pathway assertions now carry source release, license, context, evidence, knowledge status, confidence, executable-island links, and explicit blocked claims.

Mac execution

Local scoring prefers committed .mlmodelc bundles, requests all available Core ML compute units, and shares one Float32 MLMultiArray across all loaded heads for a candidate embedding.

Repair panel

The requested DNA-repair and skin-repair peptides are now separated into scored sequence records and evidence holds, with assay gates for stability, permeability, toxicity, synthesis, pathway fit, and perturbation evidence.

Receptor-first

ReceptorHotspotMap, ResiduePreferenceMatrix, ReceptorConditionedPeptideDesigner, NegativeDesignPanel, and ReceptorFirstBatchExporter now turn a target interface into ranked peptide hypotheses with residue-level rationales.

Generative model

receptor_inverse_complex_gru_v2 trained on 12,643 receptor-peptide PDB complex pairs, beat a smoothed unigram amino-acid baseline by 71.8% on validation perplexity and 72.7% on test perplexity, and generated 30 exact-novel quality-filtered candidates.

Wet-lab loop

Measured feedback retraining now has an executable audit gate; it will train only from returned assay labels and currently records skipped_no_measured_labels instead of promoting simulated data.

No-wet-lab mode

MockLab now turns receptor-inverse candidates into 270 simulated good/bad assay rows and a sandbox-only retraining run, while the measured-feedback path still rejects those rows by default.

Closed loop

The mock active-learning planner ranks first-round outcomes, proposes 12 second-round analogs, simulates 108 more assay rows, and retrains a second sandbox model without touching measured-feedback claims.

LabSpace feedback

Provider-returned LabSpace batches can now be exported to the same backbone lab_feedback.csv contract, preserving MockLab simulation labels and letting real provider rows enter the measured retraining gate.

Feedback loop audit

A single command now chains LabSpace export, measured retraining, optional simulated sandbox retraining, and a promotion-status report with separate artifact directories for measured and simulated outputs.

BioFoundation Adapter Layer

Frontier biology priors are now ingested as scorecards with calibration boundaries.

The new adapter converts frozen-model or deterministic biological context into an auditable scorecard before the Agentic Evidence Graph is allowed to reason over it. The graph records provider provenance, calibration state, caveats, and next benchmark actions instead of treating model output as assay evidence.

Adapter
biofoundation-adapter-v0
Providers
deterministic + 7 external contracts
Contexts
organ · cell state · protein · variant · perturbation
Execution plans
TranscriptFormer and provider handoffs
Graph CLI
repeatable --biofoundation scorecards
Claim types
8 BioFoundation graph claims
Examples
liver · heart · immune · striatum · protein · variant
Validation
5 adapter tests + 6 graph tests
Artifacts: experiments/biofoundation-adapter-v0/, docs/BIOFOUNDATION_ADAPTER.md, and experiments/agentic-evidence-graph-v0/examples/mesdopetam_with_biofoundation_graph.json
Learned generator proof

receptor_inverse_complex_gru_v2 is a trained local generative model, not another rules-only fallback.

The proof run used 12,643 PepBDB receptor-peptide PDB complex pairs, trained on Apple Silicon MPS, selected epoch 12, reached 5.382 validation perplexity versus a 19.058 unigram baseline, and generated 30 exact-novel quality-filtered candidates across RAD51, GLP1R, and BRD4 contexts.

Complex pairs
12,643
Unique sequences
7,769
Best epoch
12
Val PPL
5.382
Val baseline
19.058
Test PPL
5.206
Test baseline
19.076
Exact novel
30 / 100%
Artifacts: backbone/artifacts/weights/receptor_inverse_generator/training_report.json and backbone/artifacts/weights/receptor_inverse_generator/generated_candidates.json
MockLab rehearsal

The no-wet-lab loop now produces explicit simulated feedback and sandbox retraining.

A dry run sampled random good/bad outcomes for all 30 receptor-inverse candidates across binding, activity, stability, permeability, synthesis, purification, aggregation, and toxicity-style gates. The normal retraining path skipped those rows; only the explicit sandbox run trained from them.

Candidates
30
Feedback rows
270
Task axes
9
Passed / failed
158 / 112
Sandbox heads
9
Dry-run promoted
7
Default retrain
skipped
Opt-in status
trained_simulated
Artifacts: backbone/artifacts/data_cache/mock_wetlab_feedback/, backbone/artifacts/weights/mock_wetlab_feedback_audit/, and backbone/artifacts/weights/mock_wetlab_feedback_retraining/
Active-learning loop

Round 2 now closes the dry-run feedback loop without claiming wet-lab evidence.

The planner scored the first 30 candidates from simulated feedback, kept five mock promotes, generated 12 exploit/explore analogs, ran MockLab again, and trained a second sandbox model from 108 additional simulated rows.

Ranked parents
30
Mock promote
5
Mock revisit
17
Mock reject
8
R2 analogs
12
R2 rows
108
R2 passed / failed
66 / 42
R2 sandbox heads
9
Artifacts: backbone/artifacts/active_learning/mock_round2/, backbone/artifacts/data_cache/mock_wetlab_feedback_round2/, and backbone/artifacts/weights/mock_wetlab_feedback_round2_retraining/
Governance updates
Area
Artifact
Status
Splits
scaffold leakage audit
clean
Negatives
synthetic-only provenance
marked
CoreML
6 source + compiled heads
ready
Pathway KG
release/license/context registry
valid
Body twins
VVUQ model cards
bounded
Receptor-first
learned generator artifact
wired
Wet-lab feedback
measured-label retraining gate
audited
LabSpace
provider-result feedback export
wired
Loop audit
export + retraining report
wired
BioFoundation
provider contracts + scorecard ingestion
wired
Biology priors
rank-only calibration next actions
bounded
MockLab
simulation-only feedback gate
sandboxed
Active learning
mock round-2 analog plan
closed
Repair peptides
5-entity claim-bound panel
processed
Optional heads
real-label gate
held
The earlier Phase 2 issue-closure push remains recorded: Branch origin/main; Issues closed #132-#136.
Task-head held-out metrics
Task
Test MAE
Corr
binding
0.477
0.065
solubility
0.005
0.999
stability
0.002
0.999
immunogenicity
0.082
0.556
aggregation
0.007
0.998
activity
0.263
0.321
Claim boundaries
  • -BioFoundation scorecards provide biological context, not proof of target engagement, efficacy, safety, pathogenicity, or clinical actionability.
  • -Planned external provider runs are execution plans only; imported or computed provider outputs remain rank-only until benchmark-calibrated with source-audited evidence.
  • -receptor_inverse_complex_gru_v2 is a newly trained receptor-peptide complex model, but its outputs are sequence hypotheses, not binding, activity, safety, or mechanism evidence.
  • -Wet-lab feedback retraining is implemented and audited, but no feedback model was trained because no measured returned-label rows exist in the cache.
  • -MockLab labels are random simulated dry-run outcomes for pipeline rehearsal only; they are excluded from measured retraining unless --allow-simulated is explicit.
  • -Mock active-learning round-2 candidates are analog proposals from simulated feedback, not measured optimization or evidence of improved biology.
  • -LabSpace export is an ingestion bridge: MockLab rows stay simulated, and non-MockLab rows still require assay context, provenance, and review before they support model claims.
  • -The LabSpace loop runner reports promotion status, but only a trained measured-feedback run can promote model claims; sandbox heads remain rehearsal artifacts.
  • -The pathway registry is a living, release-aware knowledge substrate with executable islands; it is not a complete quantitative simulator of all biology.
  • -Body-model support is a federation of process twins and executable islands with model cards; it is not clinical decision support or a continuously updated whole-body digital twin.
  • -KEGG, BioCyc, patient, EHR, and LabSpace-derived evidence remain source-isolated until explicit licensing, consent, and export rules permit use.
  • -The repo commits normalized training artifacts and provenance; upstream raw corpora remain download/cache inputs, not literal in-repo source mirrors.
  • -Synthetic shuffled negatives are marked synthetic-only and are not presented as experimental inactivity.
  • -Lower-is-better heads report held-out metrics in raw label space; app-facing scores still expose utility semantics.
  • -Optional translational heads stay unexported until real serum, protease, permeability, toxicity, synthesis, or purification labels exist.
  • -CoreML export now commits source .mlmodel artifacts plus precompiled .mlmodelc directories for local macOS; Swift falls back to compile-and-cache when needed.
  • -GHK-Cu copper coordination and BLM phosphorylation are modification claims; plain peptide sequence scores do not validate those modified states.
  • -IK14800 remains unscored until a primary amino-acid sequence and IL-12/UV-damage assay context are attached.
  • -Receptor-first design produces residue constraints and multiple hypotheses; it does not prove binding, activation, or off-target absence.
Overview graph

One story, seven pages: from biological intent to recursive evidence improvement.

DiscoveryLab is organized around a compact navigation spine: discovery inputs, product workflow, mechanism proof, recursive discovery, epigenome rescue, platform handoff, and competitor comparison. Each route preserves the detailed sections while giving readers a clear task-oriented path.