REACH-Ab | Dynamic Interface Edge Field

Key Scientific Question

Can epitope contacts guide paratope formation before the paratope geometry is reliable?

Epitope-conditioned antibody design is not simply CDR loop generation. The generated CDRs must become an antibody-like paratope whose contact pattern is compatible with a specified antigenic surface.

The core difficulty is temporal: many generators build cross-interface edges from the current provisional CDR geometry. Antigen information is therefore used, but often only after the model has already proposed a paratope hypothesis. REACH-Ab asks whether interface contacts can become active generative variables earlier, while the paratope is still being formed.

Core Innovation

Dynamic Interface Edge Field

REACH-Ab treats cross-interface edges as learnable, evolving relation variables rather than passive kNN or distance-threshold links. Candidate epitope-paratope relations can exist before the current geometry makes them obvious, accumulate evidence across refinement rounds, and feed selected relation states back into sequence and structure updates.

Half-edge handshake Two interface sides propose compatible relation evidence.

Edge memory Candidate relations keep history across generation steps.

Lifecycle scoring Relations can be born, maintained, suppressed, or marked uncertain.

Preparing structure view...

Scientific Problem

Epitope guidance lag: when the right information arrives after the generative decision has begun

The central claim of REACH-Ab is not that previous methods ignore the antigen. The sharper problem is that antigen information often reaches CDR generation through edges induced by a provisional paratope geometry. This makes epitope guidance posterior to the current CDR hypothesis, even though the epitope should help organize that hypothesis from the beginning.

Core contradiction

Paratope formation needs epitope contacts for guidance, but epitope contacts are often constructed from a paratope geometry that has not yet been reliably formed.

1. The task

Design is interface organization, not only loop plausibility

Without a target epitope, many CDR sequences and conformations can look locally reasonable. With a target epitope, the model must generate a paratope that realizes a binding-compatible cross-interface contact pattern. A plausible CDR that misses the antigen-facing organization is still a failure for epitope-conditioned design.

2. The hidden assumption

Nodes are persistent, while interface edges are often derived

Many geometric generators maintain residues, atoms, frames, or coordinates as the main evolving objects. Cross-interface edges are then rebuilt from current distances, kNN neighborhoods, pairwise scores, or attention-like links. These edges can pass antigen information, but they remain consequences of the current node hypothesis rather than independent contact hypotheses.

3. The failure mode

Early geometry becomes the gatekeeper of epitope information

When the CDR is still uncertain, the model most needs epitope contacts to constrain generation. Yet if contact edges are induced only from provisional geometry, then early local shape decides which epitope residues are visible, active, and able to influence the next update. Useful future contacts can be delayed or excluded before the paratope has had a fair chance to form.

4. The consequence

Generation becomes path-dependent

Incorrect but geometrically close contacts can be reinforced, while correct but not-yet-close contacts may be ignored. Later refinement may still improve local structure, but the search has already been narrowed by early provisional geometry. This is why final loop plausibility can coexist with weak contact recovery, high interface RMSD, or low DockQ.

5. The evidence

The problem must be observed along the trajectory

Because epitope guidance lag is a timing problem, final metrics alone are not enough. The process should reveal whether correct contacts are visible early, whether they become active edges early, whether they emerge before the final geometry is settled, and whether wrong active edges persist across refinement.

6. The missing capability

Edges need anticipation, persistence, and feedback

A useful interface edge should be able to participate before the paratope is stable, keep memory across refinement steps, and carry relation content back into node updates. Conventional dynamic graphs may change over time, but if every change is still induced by current node geometry, the edges remain reactive rather than generative.

Node-centric timing

Generate or update CDR nodes → induce interface edges from provisional geometry → pass antigen information.

REACH-Ab timing

Maintain candidate interface edges early → update edge states with memory and compatibility → feed selected edge states back into generation.

Result

Quantitative evidence from the main paper and supplementary experiments

The tables below collect the experimental result tables from the manuscript and supplementary material. Bold cells mark the best result in the corresponding comparison, while underlined cells mark the second-best result when reported in the paper.

Main Paper

Task-level performance

CDR-H3 generation on RAbD

Generation quality and interface-sensitive evaluation for epitope-conditioned CDR-H3 design.

Model	Generation			Interface
Model	AAR ↑	TM-score ↑	LDDT ↑	CAAR ↑	RMSD ↓	DockQ ↑
RosettaAb	32.14%	0.9416	0.8002	16.79%	17.7735	0.197
HERN	31.59%	--	--	20.09%	11.1451	0.057
DiffAb	26.46%	0.6087	0.5665	20.07%	19.1944	0.119
MEAN	33.50%	0.7855	0.5616	22.11%	19.4989	0.132
ADesigner	34.40%	0.6321	0.5154	21.90%	10.7468	0.225
dyMEAN	35.13%	0.8579	0.6668	20.24%	11.5814	0.325
dyAb	35.91%	0.8593	0.6557	22.45%	10.8222	0.330
REACH-Ab	40.76%	0.9787	0.8750	25.31%	7.7024	0.431

Interpretation

CDR-H3 is the most variable antigen-facing loop, so this benchmark tests whether a model can improve local loop generation while also forming the correct antigen-facing interface. Several baselines recover plausible H3 structures, but their CAAR and DockQ remain limited, showing that a reasonable loop shape can still miss the intended contact pattern.

REACH-Ab leads both the generation metrics and the interface-sensitive metrics. This is the key point: the edge field does not simply trade local CDR quality for docking quality. Instead, active interface hypotheses constrain H3 refinement inside an antigen-aware contact space, which improves sequence, structure, and interface organization at the same time.

Antibody-antigen complex structure prediction

Pipeline-based methods are separated from end-to-end deep learning models.

Method	TM-score ↑	LDDT ↑	RMSD ↓	DockQ ↑
IgFold->Hdock	0.9706	0.5398	17.65	0.228
IgFold->HERN	0.9719	0.5488	16.16	0.222
dyMEAN	0.8919	0.6366	10.97	0.392
dyAb	0.9079	0.6610	10.91	0.412
REACH-Ab	0.9698	0.8593	9.18	0.475

Interpretation

Complex prediction separates antibody fold recovery from antibody-antigen assembly. The IgFold-based pipelines maintain strong TM-score, yet their LDDT, RMSD, and DockQ are much weaker, indicating that a correct antibody fold alone does not guarantee a correct binding interface.

REACH-Ab achieves the strongest local and interface-sensitive scores among the compared methods. This supports the timing argument behind the method: candidate interface edges are maintained during refinement, selected edge states are fed back into node updates, and unstable contact hypotheses can be suppressed before they distort the final complex.

Affinity optimization under mutation budgets

Mutation budgets are 1, 2, 4, 8, and all mutable positions.

Metric	Model	1	2	4	8	All
ΔΔG ↓	MEAN	0.00	0.00	-0.16	-3.29	-4.01
	ADesigner	-0.89	-1.46	-1.9558	-2.47	-2.64
	dyMEAN	-3.01	-3.59	-3.7599	-4.05	-4.95
	dyAb	-3.15	-4.74	-4.819	-5.19	-5.76
	REACH-Ab	-3.25	-3.43	-3.63	-3.70	-3.81
ΔL	MEAN	0.00	0.00	0.75	7.91	10.89
	ADesigner	0.78	1.18	3.54	7.17	10.24
	dyMEAN	0.97	1.53	3.45	6.55	9.02
	dyAb	0.91	1.75	3.45	7.79	8.81
	REACH-Ab	1.00	1.51	2.67	5.49	8.80

Interpretation

Affinity optimization is a different regime from generation or complex prediction. It rewards useful deviation from an existing complex, while still requiring the mutated antibody to preserve a credible binding interface. For this reason, both predicted affinity change and mutation aggressiveness matter.

REACH-Ab is competitive under the one-mutation budget, but its affinity gain saturates as the budget increases and its mutation distance remains comparatively conservative. This exposes a useful boundary of the current mechanism: persistent edge states are strong at preserving coherent interface geometry, but affinity search may require controlled exploration away from the original contact basin.

Mechanism inset from edge-field-guided generation

Process-level metrics compare REACH-Ab with a geometry-induced kNN baseline.

Metric	Baseline	REACH-Ab	Gain
CVG ↑	44.5%	100.0%	+55.5 pp
CAG ↑	44.5%	60.6%	+16.0 pp
CEL ↑	71.6%	81.0%	+9.4 pp
FPRR ↓	67.1%	47.5%	+19.6 pp

Interpretation

Final structure scores show whether the output is good, but they do not reveal when epitope guidance becomes available during generation. This inset therefore evaluates the process itself: whether contact evidence is visible early, becomes active early, emerges earlier across refinement, and avoids persistent false-positive reinforcement.

REACH-Ab improves contact visibility, contact activation, and cumulative contact emergence while also reducing false-positive persistence. This combination is important: the broad edge field is not merely adding more noisy edges. It keeps useful candidates available, selects a better active subset, and prevents unsupported contacts from dominating message passing.

Key ablation on epitope-conditioned CDR-H3 generation

Variants remove edge-field components while keeping parameter scale comparable where possible.

Generation metrics

Variant	AAR ↑	TM-score ↑	LDDT ↑
REACH-Ab	40.76%	0.9787	0.8750
w/o edge field	38.75%	0.9731	0.8665
w/o broad space	38.36%	0.9271	0.8604
w/o half-edge	39.02%	0.9363	0.8101
w/o edge feedback	38.52%	0.9565	0.8105

Interface metrics

Variant	CAAR ↑	RMSD ↓	DockQ ↑
REACH-Ab	25.31%	7.7024	0.431
w/o edge field	23.37%	8.3241	0.422
w/o broad space	24.16%	8.9556	0.407
w/o half-edge	23.34%	9.1534	0.398
w/o edge feedback	24.01%	10.2957	0.326

Interpretation

The full model performs best across both generation and interface metrics, which indicates that the improvement comes from the edge-field mechanism as a system rather than from a single isolated score adjustment. Removing the full edge field mainly weakens interface organization, even when some structure metrics remain strong, so the lost capability is not just backbone plausibility but antigen-aware contact formation.

The component ablations clarify what each part contributes. Removing the broad candidate space weakens geometry-sensitive metrics, showing that early interface search cannot be reduced to current-distance neighborhoods. Removing half-edge probing lowers LDDT and DockQ, suggesting endpoint compatibility is needed for stable interface geometry. Removing edge-state feedback gives the sharpest DockQ and RMSD degradation, which means selected edges must carry compatibility, uncertainty, and memory into message passing rather than serving only as binary links.

Supplementary Material

Extended result tables

Seed sensitivity analysis

Mean and standard deviation across seeds for epitope-conditioned CDR-H3 generation.

Model	Generation mean ± std			Interface mean ± std
Model	AAR ↑	TM-score ↑	LDDT ↑	CAAR ↑	RMSD ↓	DockQ ↑
RosettaAb	32.14%	0.9416	0.8002	16.79%	17.7735	0.1968
HERN	31.64% ± 0.62%	--	--	18.21% ± 1.53%	10.49 ± 0.34	0.051 ± 0.004
DiffAb	29.32% ± 2.33%	0.6359 ± 0.0181	0.5692 ± 0.0057	18.71% ± 1.32%	17.74 ± 0.92	0.137 ± 0.013
MEAN	35.22% ± 0.91%	0.8035 ± 0.0140	0.5554 ± 0.0144	20.94% ± 0.73%	18.28 ± 0.85	0.112 ± 0.011
ADesigner	34.23% ± 1.78%	0.6508 ± 0.0101	0.5042 ± 0.0066	20.73% ± 1.75%	10.11 ± 0.45	0.246 ± 0.012
dyMEAN	35.52% ± 0.39%	0.8554 ± 0.0035	0.6618 ± 0.0055	20.05% ± 0.12%	11.77 ± 0.16	0.317 ± 0.005
dyAb	36.22% ± 0.41%	0.8836 ± 0.0240	0.6662 ± 0.0151	23.22% ± 0.56%	10.21 ± 0.46	0.329 ± 0.010
REACH-Ab	39.67% ± 0.62%	0.9724 ± 0.0079	0.8608 ± 0.0088	24.28% ± 0.79%	7.990 ± 0.17	0.424 ± 0.008

Interpretation

This table separates average performance from seed sensitivity. A method that improves only under a favorable initialization would show large variance, especially on docking-oriented metrics where small H3 geometry changes can strongly affect interface quality.

REACH-Ab remains best on the mean generation metrics and on the interface metrics, with modest standard deviations. This stability is important for the mechanism claim: the improvement is not a lucky final correction, but a repeatable effect of constraining refinement through persistent interface hypotheses.

Six-CDR co-design

Per-CDR AAR and global structure/interface metrics when all CDRs are generated together.

Metric	Region	dyMEAN	dyAb	REACH-Ab
AAR ↑	H1	56.32%	48.68%	70.44%
	H2	42.39%	32.86%	63.57%
	H3	33.51%	23.50%	39.69%
	L1	42.91%	19.29%	60.68%
	L2	45.86%	41.69%	77.27%
	L3	42.43%	20.11%	42.31%
TM-score ↑	--	0.8868	0.6667	0.9639
LDDT ↑	--	0.7849	0.6234	0.8341
DockQ ↑	--	0.4192	0.2416	0.4428

Interpretation

Six-CDR co-design is harder than single-H3 generation because the model must organize a full paratope instead of one dominant loop. As all CDRs become mutable, antigen-conditioned information has to be distributed across multiple loops that may compete for interface contacts.

REACH-Ab improves sequence recovery on five of the six CDRs and remains comparable on L3, while also achieving the best TM-score, LDDT, and DockQ. The result suggests that the dynamic edge field does not overfit to H3-specific behavior. It can maintain a shared set of epitope-paratope contact hypotheses and use selected edge states to coordinate the whole paratope during refinement.

Trajectory-level metrics for epitope guidance lag

Definitions used to quantify when native-like contact evidence becomes visible and active.

Metric	Meaning	Reported quantity	Direction
CVG	Contact visibility	\(V_{\mathrm{R}}^0 - V_{\mathrm{K}}^0\)	Higher is better
CAG	Contact fraction	\(A_{\mathrm{R}}^0 - A_{\mathrm{K}}^0\)	Higher is better
CEL	Cumulative contact emergence lead	\(\frac{1}{T}\sum_t(E_{\mathrm{R}}^t - E_{\mathrm{K}}^t)\)	Higher is better
FPRR	False active-edge persistence rate	Method-specific rate	Lower is better

How to read this table

These metrics turn epitope guidance lag into observable process-level quantities. CVG asks whether correct contact evidence is available in the candidate field at the beginning of refinement; CAG asks whether such evidence is already selected into the active graph and can transmit messages; CEL measures whether useful contacts appear earlier across the trajectory.

FPRR is the complementary failure check. A broad candidate field would be risky if it simply reinforced incorrect active edges, so lower FPRR indicates that REACH-Ab can keep candidate contacts visible without forcing noisy contacts to persist. Together, the four metrics evaluate timing, activation, and stability rather than only final output quality.

Edge-state intervention study

All variants keep selected topology fixed and modify only edge attributes used in message passing.

Variant	AAR ↑	CAAR ↑	TM-score ↑	LDDT ↑	RMSD ↓	DockQ ↑
Full REACH-Ab	40.76%	25.31%	0.9787	0.8750	7.7024	0.431
Topology only	36.24%	20.75%	0.9431	0.8617	15.1983	0.156
Shuffled edge state	38.54%	22.77%	0.9429	0.8696	10.6439	0.357
Random edge state	33.19%	22.64%	0.9560	0.8684	9.7629	0.380

Interpretation

This intervention isolates whether REACH-Ab benefits only from selecting a better active topology, or whether the learned edge states themselves carry generative information. All variants keep the selected active edge set fixed and modify only the edge attributes passed into message passing.

The topology-only setting produces the largest diagnostic drop, with DockQ falling from 0.431 to 0.156 and RMSD increasing from 7.70 to 15.20. Shuffled and random edge states also degrade performance, showing that useful feedback requires edge states that are learned and matched to the active contact hypothesis they represent. In other words, REACH-Ab is not only choosing which residues may interact; it is also passing dynamic relation content back into sequence and structure generation.

Mechanism Figures

How REACH-Ab turns epitope guidance into evolving interface relations

These figures summarize the scientific story behind the viewer: REACH-Ab first defines the antibody design task at the complex level, then maintains candidate epitope-paratope relations as dynamic states, and finally feeds selected relation states back into local message passing.

Framework of REACH-Ab with input initialization, global graph, local graph, iterative edge-node co-evolution, and outputs — Framework Overview

From masked CDR residues to generated antibody sequence and complex structure

The framework initializes masked CDR residues from an antibody-antigen complex, builds global and local graphs, and iteratively couples SE(3) message passing with a local Dynamic Interface Edge Field. The key idea is that candidate interface edges are explored, updated, and selected before they guide local coordinate refinement and sequence generation.

Schematic of the Dynamic Interface Edge Field moving from candidate edge exploration to stable interface edge formation — Core Concept

A dense relation field becomes a sparse active interface graph

REACH-Ab maintains many possible epitope-paratope contacts instead of committing only to current geometric neighbors. During refinement, half-edge evidence and lifecycle control let some relations stabilize, some compete, and some die out. This is the mechanism that reduces epitope guidance lag: antigen-facing contact hypotheses remain available before the paratope geometry is fully reliable.

Detailed implementation flow of candidate edge construction, half-edge probing, dynamic memory, lifecycle heads, active edge selection, and message passing — Implementation Flow

Candidate edges carry memory, compatibility, uncertainty, and lifecycle signals

At each refinement step, residue states and anchors construct a directed candidate edge pool. Pair-level features are updated through source-side and destination-side half-edge probing, then fused into persistent edge memory. Distance, occupancy, birth, death, persistence, and uncertainty heads produce an active edge score; selected edge states then become attributes in local message passing and are carried to the next refinement step.

Interactive Mechanism Playground

Feel the epitope guidance lag before reading about it

This toy generation sandbox turns the central mechanism into something visitors can manipulate. The left side rebuilds interface edges only from the current node geometry, while the right side maintains a broader edge field with memory and compatibility evidence before selecting active message-passing edges.

Early coordinates control geometry-kNN edges immediately; REACH-Ab keeps candidate contact hypotheses available while compatibility and memory evidence accumulates.

Process-Level Readout

Edges should guide nodes before nodes are reliable.

Early CDR geometry is noisy, so nearest-neighbor edges can expose the wrong antigen contacts. The edge field keeps native-like candidates visible until they can become active edges.

Native contact visibility -

Active native edges -

False-positive reinforcement -

Guidance timing -

CDR geometry uncertainty Edge memory strength Show faint candidate field

Edge-Field-Guided Generation

Watching contact hypotheses form during generation

The main-paper mechanism claim is temporal: REACH-Ab should expose useful epitope-paratope contact hypotheses before the final CDR geometry is settled. These 57 trajectories compare a geometry-kNN proxy with the REACH-Ab edge field for the same benchmark cases shown in the PDB viewer.

Animated comparison of geometry-kNN proxy edges and REACH-Ab edge-field edges for 4FQJ — 4FQJ edge-field trajectory

The left panel reconstructs edges after provisional node geometry appears. The right panel shows REACH-Ab maintaining and activating candidate edge states during refinement, so contact evidence can participate while the paratope is still being formed.

Mechanism Reading

What the trajectory is meant to prove

If epitope guidance lag is real, the important evidence is not only the final contact map. The process should show whether native-like contacts become visible, active, and stable early enough to guide the next node update.

Contact visibility Whether native epitope-paratope pairs remain accessible in the candidate field.

Contact activation Whether useful candidate edges become message-passing edges during refinement.

Contact emergence Whether stable contacts arise before the final paratope geometry is fully determined.

False-positive control Whether unsupported geometry-induced edges are prevented from reinforcing wrong contacts.

The gray dashed edges on the left are reactive: geometry first, edge afterward. The purple edges on the right are generative variables: candidate relations accumulate compatibility, memory, lifecycle, and uncertainty evidence, then feed selected edge states back into CDR sequence and structure updates.

Paper to Code

Method equations linked to their implementation

This section merges the main-paper REACH-Ab method with the supplementary Model Architecture and Implementation Details and Theoretical Analysis. Each formula below is clickable: selecting it opens the concrete PyTorch implementation used by the current codebase.

Architecture Thread

REACH-Ab starts from a broad directed epitope-paratope candidate pool, encodes pair geometry and endpoint states, injects two-sided half-edge evidence, stores candidate-level memory, predicts lifecycle and uncertainty signals, then selects a sparse active interface graph for local message passing.

Theoretical Thread

The supplementary analysis explains why this matters: a broad candidate space preserves early contact accessibility, vector edge states are more expressive than scalar scores, half-edge probing generalizes one-sided scoring, and memory allows delayed activation from repeated weak evidence.

Implementation

Epitope guidance lag: when the right information arrives after the generative decision has begun

Design is interface organization, not only loop plausibility

Nodes are persistent, while interface edges are often derived

Early geometry becomes the gatekeeper of epitope information

Generation becomes path-dependent

The problem must be observed along the trajectory

Edges need anticipation, persistence, and feedback

Quantitative evidence from the main paper and supplementary experiments

Task-level performance

CDR-H3 generation on RAbD

Antibody-antigen complex structure prediction

Affinity optimization under mutation budgets

Mechanism inset from edge-field-guided generation

Key ablation on epitope-conditioned CDR-H3 generation

Generation metrics

Interface metrics

Extended result tables

Seed sensitivity analysis

Six-CDR co-design

Trajectory-level metrics for epitope guidance lag

Edge-state intervention study

How REACH-Ab turns epitope guidance into evolving interface relations

From masked CDR residues to generated antibody sequence and complex structure

A dense relation field becomes a sparse active interface graph

Candidate edges carry memory, compatibility, uncertainty, and lifecycle signals

Feel the epitope guidance lag before reading about it

Step 0: uncertain paratope hypothesis

Watching contact hypotheses form during generation

4FQJ edge-field trajectory

Method equations linked to their implementation

Architecture Thread

Theoretical Thread