Computational Drug Discovery

This application area focuses on using computational methods to design, prioritize, and optimize therapeutic candidates—proteins, small molecules, and binders—before they reach the wet lab. It integrates structure prediction, virtual screening, and generative design to explore vast chemical and structural spaces far more quickly than traditional experimental workflows. By predicting protein structures (including hard-to-resolve or intrinsically disordered proteins) and modeling their conformations, these tools enable more rational target selection and structure-based design when experimental data are missing or incomplete. For organizations in biopharma and adjacent sectors, this dramatically compresses early R&D timelines, reduces the number of physical experiments required, and increases the probability of finding viable hits and leads. AI and physics-based models work together to propose and prioritize candidate molecules or miniprotein binders, guide synthesis planning, and improve virtual screening hit rates. The result is faster, cheaper, and more targeted discovery pipelines that expand the druggable target space and de‑risk investment in new therapeutic programs.

The Problem

Accelerate computational drug discovery for dual-use biosecurity and medical countermeasure programs

Organizations face these key challenges:

1

Limited or missing structural information for priority targets

2

Biological evidence is fragmented across sequence, assay, omics, literature, and internal notebooks

3

Virtual screening pipelines produce too many false positives without integrated ranking

4

Generative design outputs are hard to constrain for synthesizability, developability, and safety

5

Notebook-heavy workflows are difficult to standardize, reproduce, and govern

6

Regulatory expectations for AI model updates and lifecycle controls are hard to operationalize

7

Cross-functional teams lack a shared platform connecting computational predictions to experimental validation

8

High compute demand for structure prediction, docking, and model training complicates deployment

Impact When Solved

Reduce target-to-hit cycle time by 30-60% through structure inference and virtual prioritizationCut low-value wet-lab experiments by focusing on top-ranked compounds, binders, and targetsIncrease hit rates in virtual screening by combining learned representations with docking and ADMET filtersExpand discovery against structurally unresolved or intrinsically disordered targetsCreate traceable evidence packages for regulatory, quality, and program review boardsSupport rapid response pipelines for emerging biological threats and medical countermeasure programs

The Shift

Before AI~85% Manual

Human Does

  • Select targets based on limited structural/functional evidence; commission or run structure determination
  • Design molecules manually using SAR intuition; triage ideas in meetings
  • Plan syntheses with chemists and CROs; manage queues and rework when routes fail
  • Interpret assay data and decide next compounds across slow iteration loops

Automation

  • Rule-based filtering (Lipinski/PAINS), basic docking on a single structure, and spreadsheet-driven prioritization
  • Simple QSAR models trained on small internal datasets with limited generalization
With AI~75% Automated

Human Does

  • Define program constraints (mechanism, ADMET/Tox, selectivity, safety/handling, manufacturing constraints, mission timelines)
  • Review AI-proposed candidate lists and rationales; approve a small set for synthesis and testing
  • Design focused experiments to validate binding/mechanism and feed results back to models (active learning)

AI Handles

  • Predict target structures and conformational ensembles (including hard-to-resolve/disordered regions) for docking and design
  • Run large-scale virtual screening and rank candidates using ML + physics scoring; flag liabilities (reactivity, tox risks, off-targets)
  • Generate novel molecules/miniprotein binders optimized for binding + constraints; propose synthesizable routes and alternatives
  • Continuously learn from assay outcomes to refine prioritization and suggest next-round compounds (closed-loop optimization)

Operating Intelligence

How Computational Drug Discovery runs once it is live

AI runs the first three steps autonomously.

Humans own every decision.

The system gets smarter each cycle.

Confidence91%
ArchetypeRecommend & Decide
Shape6-step converge
Human gates1
Autonomy
67%AI controls 4 of 6 steps

Who is in control at each step

Each column marks the operating owner for that step. AI-led actions sit above the divider, human decisions and feedback loops sit below it.

Loop shapeconverge

Step 1

Assemble Context

Step 2

Analyze

Step 3

Recommend

Step 4

Human Decision

Step 5

Execute

Step 6

Feedback

AI lead

Autonomous execution

1AI
2AI
3AI
5AI
gate

Human lead

Approval, override, feedback

4Human
6 Loop
AI-led step
Human-controlled step
Feedback loop
TL;DR

AI handles assembly, analysis, and execution. The human gate sits at the decision point. Every cycle refines future recommendations.

The Loop

6 steps

1 operating angles mapped

Operational Depth

Technologies

Technologies commonly used in Computational Drug Discovery implementations:

Key Players

Companies actively working on Computational Drug Discovery solutions:

+2 more companies(sign up to see all)

Real-World Use Cases

Functional cellular validation of YTHDC2 inhibition in disease-relevant models

After confirming a molecule binds the target, researchers test whether it changes the target's RNA-control job inside cells and whether that reduces harmful cell behavior.

causal mechanism validation in cellular systemspreclinical cellular validation.
10.0

Predetermined Change Control Plan for AI/ML-enabled medical device software

A manufacturer tells regulators in advance what AI software updates it expects to make, how it will test them, and how it will keep the device safe after release.

governed model lifecycle managementproposed regulatory workflow described in fda draft guidance; not final policy in the source.
10.0

Predetermined postmarket update planning for AI-enabled devices

The maker plans ahead for how the AI device may be updated after launch, so improvements can happen in a controlled way instead of causing surprises.

Adaptive predictive system with controlled update loopemerging but concrete regulatory operating model, reinforced by final fda guidance on predetermined change control plans.
10.0

Low-code molecular graph preprocessing and feature extraction from SMILES

Give the system a text recipe for a molecule, and it turns it into a machine-readable graph with useful chemistry features.

structured data transformation for graph ML inputsproposed and implemented as reusable library functionality; suitable as a building block in downstream applications.
10.0

Protein structure inference from sequence and fitness data

The toolkit can work from protein sequence and fitness information to infer structures, helping researchers study targets even when experimental structures are limited.

Biological inference from sequence and fitness signals to structural hypotheses.deployed as a named capability within a pre-alpha toolkit.
10.0
+7 more use cases(sign up to see all)

Free access to this report