1

Dataset Overview

9,992
DTSM Compounds Profiled
96
Mean EGFR Activity
101
Mean PKCα Activity
95
Mean CDK2 Activity
899
CDK2 Very High Activity (≥150)
1,064
CDK2-Selective Compounds

The DTSM project compound library has been profiled across three kinase assays: EGFR (Epidermal Growth Factor Receptor), PKCα (Protein Kinase C alpha), and CDK2 (Cyclin-Dependent Kinase 2). All assay values are reported as percentage activity where higher values indicate greater potency. The dataset covers 9,992 unique DTSM compounds, each fully profiled in all three assays. CDK2 shows the broadest dynamic range (min: -3, max: 1130), confirming strong and diverse hit matter within this series.

2

Activity Distributions Across Assays

Compound Count by Activity Bin
Activity Profile (% of Compounds)
Molecular Weight vs CDK2 Activity
cLogP vs CDK2 Activity
AssayMeanMedianStd DevMinMax Inactive (<80)Low (80–100)Moderate (100–120)High (≥120)
EGFR 96.498.517.5 -0.3196.6 1,253 (12.5%)4,170 (41.7%)4,112 (41.2%)457 (4.6%)
PKCα 101.5102.537.6 -215.01346.6 1,461 (14.6%)2,932 (29.3%)4,106 (41.1%)1,493 (14.9%)
CDK2 95.487.047.7 -3.41129.9 3,529 (35.3%)3,822 (38.3%)1,153 (11.5%)1,488 (14.9%)
3

Structure–Activity Relationship Trends

Molecular Weight Trend  (r = +0.13)

There is a modest positive trend between molecular weight and CDK2 activity. Compounds in the 350–550 Da range show the highest median CDK2 activity (~90–104 units) compared to lighter molecules (<250 Da, median 82.6). This suggests that additional molecular complexity generally benefits CDK2 engagement, though the top CDK2 hits span a wide MW range (200–593 Da), indicating MW alone is not the primary driver.

Lipophilicity (cLogP) Trend  (r = +0.14)

A mild positive correlation exists between cLogP and CDK2 activity. Compounds with cLogP 4–6 show median CDK2 of 91.0, and >6 show 92.3, versus 84.4 for the 0–2 range. This is consistent with increased hydrophobic contacts in the CDK2 ATP-binding pocket, though many highly potent CDK2 hits carry cLogP < 2 — indicating hydrogen-bond-rich scaffolds can also achieve excellent CDK2 activity.

Polar Surface Area  (r = –0.02)

No significant correlation was observed between PSA and CDK2 activity, suggesting the CDK2 binding mode accommodates a wide polarity range. Several drug-like top hits fall in the 50–110 Ų window, which is expected to support good cell permeability.

⚠ EGFR vs CDK2 Orthogonality  (r = +0.12)

Very low correlation between CDK2 and EGFR activity confirms that EGFR inhibition is largely independent of CDK2 activity across this library. CDK2-active hits are not systematically EGFR-active, making selectivity readily achievable. Only 19 compounds (0.19%) were pan-active across all three assays.

MW Bins vs CDK2 Activity
MW RangeNMedian CDK2Mean CDK2Trend
<250 Da2,35882.686.4
250–350 Da3,81486.893.8
350–450 Da2,34790.3100.4
450–550 Da1,00889.6104.4
>550 Da46592.3110.1
cLogP Bins vs CDK2 Activity
cLogP RangeNMedian CDK2Trend
<075987.1
0–22,13784.4
2–43,97685.4
4–62,30491.0
>681692.3
4

Selectivity Landscape Analysis

Compounds were classified using a threshold of ≥120 activity units per assay, representing the top ~15% per assay.

1,064
CDK2-Selective
CDK2 ≥120, others <120
360
EGFR-Selective
EGFR ≥120, others <120
1,094
PKCα-Selective
PKCα ≥120, others <120
78
CDK2 + EGFR
Both ≥120, PKCα <120
327
CDK2 + PKCα
Both ≥120, EGFR <120
19
Pan-Active
All three ≥120
7,050
Low Activity
All assays <120
Selectivity Profile Distribution

CDK2 Selectivity Summary

1,064 compounds (10.6%) are CDK2-selective with no significant EGFR or PKCα cross-reactivity — a strong and drug-friendly hit rate. The near-equal proportion of CDK2-selective and PKCα-selective compounds, combined with only 19 pan-active (0.19%), confirms that the DTSM series has excellent capacity for kinase selectivity optimisation.

5

Top 10 Diverse CDK2-Selective Compounds for Progression

Selected via a two-stage algorithm: (1) ranking by combined CDK2 activity + selectivity score, then (2) greedy maximum-diversity filter using Morgan fingerprint Tanimoto similarity (threshold <0.40), ensuring structural diversity. Only compounds with CDK2 ≥ 120 and MW ≤ 700 Da were considered (1,476 candidates).

# Structure Compound ID & Properties CDK2 Activity EGFR Activity PKCα Activity Selectivity
(CDK2 – max off-target)
CDK2 / Avg Ratio
1
DTSM99-0004386 SPO-24560
MW: 593 Da  |  cLogP: -0.07
PSA: 245.4 Ų  |  HBD: 8  |  HBA: 12
C23H36N4O10S2
1129.9 88.8 73.7 1041.1 13.91×
2
DTSM96-01346 SPO-23153
MW: 334 Da  |  cLogP: 4.45
PSA: 52.4 Ų  |  HBD: 0  |  HBA: 3
C15H12BrNO3
997.5 89.7 108.2 889.3 10.08×
3
DTSM96-02653 SPO-22692
MW: 268 Da  |  cLogP: 1.91
PSA: 57.6 Ų  |  HBD: 1  |  HBA: 4
C15H16N4O
631.3 80.8 92.6 538.7 7.28×
4
DTSM96-02990 SPO-25840
MW: 408 Da  |  cLogP: 5.57
PSA: 25.8 Ų  |  HBD: 0  |  HBA: 2
C20H13IN2
509.8 88.9 115.3 394.5 4.99×
5
DTSM96-00430 SPO-25007
MW: 313 Da  |  cLogP: -2.21
PSA: 109.2 Ų  |  HBD: 0  |  HBA: 6
C12H9ClN2O4S
474.5 103.4 71.0 371.1 5.44×
6
DTSM96-00887 SPO-27974
MW: 481 Da  |  cLogP: 6.76
PSA: 54.0 Ų  |  HBD: 0  |  HBA: 5
C31H28O5
455.8 83.2 93.8 362.1 5.15×
7
DTSM96-02525 SPO-26017
MW: 478 Da  |  cLogP: 5.30
PSA: 68.3 Ų  |  HBD: 0  |  HBA: 4
C30H16F2O4
434.8 76.8 117.9 316.9 4.47×
8
DTSM99-0004490 SPO-21263
MW: 200 Da  |  cLogP: 1.29
PSA: 63.3 Ų  |  HBD: 2  |  HBA: 2
C9H10ClNO2
404.7 119.5 59.7 285.2 4.52×
9
DTSM96-08784 SPO-28741
MW: 482 Da  |  cLogP: 8.53
PSA: 37.0 Ų  |  HBD: 2  |  HBA: 2
C32H23N3S
385.5 100.3 104.5 281.0 3.77×
10
DTSM96-01696 SPO-26902
MW: 249 Da  |  cLogP: 1.59
PSA: 68.0 Ų  |  HBD: 1  |  HBA: 6
C11H11N3O2S
396.4 100.7 145.5 250.9 3.22×

Badge key

≥300 Very High CDK2   120–299 High CDK2   ≥120 Off-target concern (≥120)   <120 Acceptable off-target

6

Summary & Recommendations

Key Findings & Recommended Next Steps

  • Strong CDK2 hit rate: 1,488 compounds (14.9%) show high CDK2 activity (≥120), with 899 at very high activity (≥150). This is an excellent hit rate for a 10,000-compound kinase screen.
  • Selectivity is achievable: 1,064 CDK2-selective compounds with no EGFR/PKCα co-activity demonstrate that kinase selectivity can be built into the series without sacrificing CDK2 potency.
  • Key SAR drivers: Modest MW (350–550 Da) and moderate lipophilicity (cLogP 4–6) slightly favour CDK2 activity, though multiple chemotypes with varied physicochemical profiles are viable CDK2 scaffolds.
  • Top 10 progression compounds span CDK2 activity 385–1,130 units with selectivity margins of 250–1,041 units over the best off-target, covering diverse chemical space across 5+ structural classes.
  • Priority lead — DTSM99-0004386: CDK2 = 1,129.9 with EGFR and PKCα both below 90 (selectivity margin = 1,041). MW 593 and high PSA (245 Ų) warrant ADME assessment; however the exceptional CDK2 selectivity makes this the top candidate.
  • Recommended next steps: Confirm dose-response (IC₅₀) for all 10 compounds; assess ADME/PK (solubility, microsomal stability, Caco-2 permeability); run expanded 50-kinase selectivity panel; prioritise 3–5 scaffolds for medicinal chemistry optimisation using the SAR vectors identified in this report.