Computational Drug Discovery

This application area focuses on using computational methods to design, prioritize, and optimize therapeutic candidates—proteins, small molecules, and binders—before they reach the wet lab. It integrates structure prediction, virtual screening, and generative design to explore vast chemical and structural spaces far more quickly than traditional experimental workflows. By predicting protein structures (including hard-to-resolve or intrinsically disordered proteins) and modeling their conformations, these tools enable more rational target selection and structure-based design when experimental data are missing or incomplete. For organizations in biopharma and adjacent sectors, this dramatically compresses early R&D timelines, reduces the number of physical experiments required, and increases the probability of finding viable hits and leads. AI and physics-based models work together to propose and prioritize candidate molecules or miniprotein binders, guide synthesis planning, and improve virtual screening hit rates. The result is faster, cheaper, and more targeted discovery pipelines that expand the druggable target space and de‑risk investment in new therapeutic programs.

The Problem

“Accelerate computational drug discovery for dual-use biosecurity and medical countermeasure programs”

Organizations face these key challenges:

Limited or missing structural information for priority targets

Biological evidence is fragmented across sequence, assay, omics, literature, and internal notebooks

Virtual screening pipelines produce too many false positives without integrated ranking

Generative design outputs are hard to constrain for synthesizability, developability, and safety

Notebook-heavy workflows are difficult to standardize, reproduce, and govern

Regulatory expectations for AI model updates and lifecycle controls are hard to operationalize

Cross-functional teams lack a shared platform connecting computational predictions to experimental validation

High compute demand for structure prediction, docking, and model training complicates deployment

Impact When Solved

Reduce target-to-hit cycle time by 30-60% through structure inference and virtual prioritizationCut low-value wet-lab experiments by focusing on top-ranked compounds, binders, and targetsIncrease hit rates in virtual screening by combining learned representations with docking and ADMET filtersExpand discovery against structurally unresolved or intrinsically disordered targetsCreate traceable evidence packages for regulatory, quality, and program review boardsSupport rapid response pipelines for emerging biological threats and medical countermeasure programs

The Shift

Before AI~85% Manual

Human Does

•Select targets based on limited structural/functional evidence; commission or run structure determination
•Design molecules manually using SAR intuition; triage ideas in meetings
•Plan syntheses with chemists and CROs; manage queues and rework when routes fail
•Interpret assay data and decide next compounds across slow iteration loops

Automation

•Rule-based filtering (Lipinski/PAINS), basic docking on a single structure, and spreadsheet-driven prioritization
•Simple QSAR models trained on small internal datasets with limited generalization

With AI~75% Automated

Human Does

•Define program constraints (mechanism, ADMET/Tox, selectivity, safety/handling, manufacturing constraints, mission timelines)
•Review AI-proposed candidate lists and rationales; approve a small set for synthesis and testing
•Design focused experiments to validate binding/mechanism and feed results back to models (active learning)

AI Handles

•Predict target structures and conformational ensembles (including hard-to-resolve/disordered regions) for docking and design
•Run large-scale virtual screening and rank candidates using ML + physics scoring; flag liabilities (reactivity, tox risks, off-targets)
•Generate novel molecules/miniprotein binders optimized for binding + constraints; propose synthesizable routes and alternatives
•Continuously learn from assay outcomes to refine prioritization and suggest next-round compounds (closed-loop optimization)

Operating Intelligence

How Computational Drug Discovery runs once it is live

AI runs the first three steps autonomously.

Humans own every decision.

The system gets smarter each cycle.

Confidence91%

ArchetypeRecommend & Decide

Shape6-step converge

Human gates1

Autonomy

67%AI controls 4 of 6 steps

Who is in control at each step

Each column marks the operating owner for that step. AI-led actions sit above the divider, human decisions and feedback loops sit below it.

Loop shapeconverge

Step 1

Assemble Context

Step 2

Analyze

Step 3

Recommend

Step 4

Human Decision

Step 5

Execute

Step 6

Feedback

AI lead

Autonomous execution

1AI

2AI

3AI

5AI

gate

Human lead

Approval, override, feedback

4Human

6↺ Loop

AI-led step

Human-controlled step

Feedback loop

TL;DR

AI handles assembly, analysis, and execution. The human gate sits at the decision point. Every cycle refines future recommendations.

The Loop

6 steps

1AI

Assemble Context

Combine the relevant records, signals, and constraints.

instant

2AI

Analyze

Evaluate options, risk, and likely outcomes.

instant

3AI

Recommend

Present a ranked recommendation with supporting rationale.

instant

4Human checkpoint

Human Decision

A human accepts, edits, or rejects the recommendation.

hours to days

Authority gates · 1

The system must not authorize synthesis, testing, or advancement of a therapeutic candidate without named scientist approval [S6][S8][S12].

Why this step is human

The decision carries real-world consequences that require professional judgment and accountability.

5AI

Execute

Carry out the approved action in the operating workflow.

instant

6Feedback

Feedback

Outcome data improves future recommendations.

continuous

1 operating angles mapped

Operational Depth

Technologies

Technologies commonly used in Computational Drug Discovery implementations:

drugforge-spectrumOther

2 mentions

Key Players

Companies actively working on Computational Drug Discovery solutions:

AI drug discovery firms Insilico Medicine Recursion BenevolentAI Schrödinger, Inc.

+2 more companies(sign up to see all)

Real-World Use Cases

Functional cellular validation of YTHDC2 inhibition in disease-relevant models

After confirming a molecule binds the target, researchers test whether it changes the target's RNA-control job inside cells and whether that reduces harmful cell behavior.

causal mechanism validation in cellular systemspreclinical cellular validation.

10.0

Predetermined Change Control Plan for AI/ML-enabled medical device software

A manufacturer tells regulators in advance what AI software updates it expects to make, how it will test them, and how it will keep the device safe after release.

governed model lifecycle managementproposed regulatory workflow described in fda draft guidance; not final policy in the source.

10.0

Predetermined postmarket update planning for AI-enabled devices

The maker plans ahead for how the AI device may be updated after launch, so improvements can happen in a controlled way instead of causing surprises.

Adaptive predictive system with controlled update loopemerging but concrete regulatory operating model, reinforced by final fda guidance on predetermined change control plans.

10.0

Low-code molecular graph preprocessing and feature extraction from SMILES

Give the system a text recipe for a molecule, and it turns it into a machine-readable graph with useful chemistry features.

structured data transformation for graph ML inputsproposed and implemented as reusable library functionality; suitable as a building block in downstream applications.

10.0

Protein structure inference from sequence and fitness data

The toolkit can work from protein sequence and fitness information to infer structures, helping researchers study targets even when experimental structures are limited.

Biological inference from sequence and fitness signals to structural hypotheses.deployed as a named capability within a pre-alpha toolkit.

10.0

+7 more use cases(sign up to see all)