Professional Documents
Culture Documents
Analysis Data
Model (ADaM)
Annamaria Muraro
Helsinn Healthcare
Analysis Datasets
(=ADaM data)
Datasets used in the
analysis, restructured and
contain additional
information (derived
variables, flags, etc.)
Source: raw datasets
Two sets of data
Each with a specific purpose
2
FDA requirements
SDTM
Source data
Vertical
No redundancy
Character variables
Each domain is specific to
itself
Dates are ISO8601 character
strings
Two chars for dataset name
Data transfer
Interoperability
ADaM
Derived data
Structure may not necessarily by
vertical
Redundancy is needed for easy
analysis
Numeric variables
Combines variables across
multiple domains
Dates are formatted as numeric
(e.g. SAS dates) to allow
manipulation
Dataset Name: ADXXXX
Analytic & graphical analysis
Clear communication of statistical
analysis and related decision
Identifiers
Analysis Population Indicators
Analysis Date Variables
Analysis Study Day Variables
Visit time Variables
Numeric Code Variables
Analysis Treatment Variables
10
11
ADSL, Example
Studyid
USUBJID
SAFETY
ITT
PPROT
COMPLT
XX0001
0001-1
XX0001
0001-2
DSREAS
AGE
AGEGRP
30
21-35
ADVERSE EVENT
38
36-50
AGEGRPN
SEX
RACE
RACEN
TRTP
TRTPN
HEIGHTBL
WEIGHTBL
BMIBL
WHITE
DRUG A
170
63.5
21.97
ASIAN
PLACEBO
183
86.2
25.74
Dataset named
ADxxxxxx
SDTM variable
with no changes
ADaM Treatment
Variable
12
Demographic
variables
Treatment
variables
Analysis
Population
Variable
Name
STUDYID
USUBJID
Variable Label
Type
Source
Char
Char
Controlled Terms
or Format
$15.
$30.
Study Identifier
Unique Subject Identifier
SUBJID
SITEID
VSBLFL
Char
Char
Char
$5.
$5.
Y or Null
ADSL.SUBJID
ADSL.SITEID
VS.VSBLFL (where VS.VSTESTCD in ('DIABP' 'SYSBP' 'HR'))
VISITNUM
VISIT
Visit Number
Visit Name
Num
Char
3.
$100.
VS.VISITNUM
VS.VISIT
Num
Num
Num
5.1
5.1
5.1
HR_BASE
HR_VAL
HR_CHG
Num
Num
Num
3.
3.
3.
SBP_BASE
SBP_VAL
3.
3.
SBP_CHG
.......
AGE
3.
ADVS.SBP_VAL - ADVS.SBP_BASE
Num
3.
ADSL.AGE
AGEU
SEX
SEXN
Age Units
Sex
Sex Numeric
Char
Char
Num
years
F,M,U
1=Male, 2=Female
ADSL.AGEU
ADSL.SEX
ADSL.SEXN
RACE
Race
Char
ADSL.RACE
RACEN
Race Numeric
Num
...........
TRTP
Char
TRTPN
TRTA
TRTAN
SAFETY
FULLSET
PPROT
Safety Set
Full Analysis Set
Per-Protocol Set
Char
Char
Char
VS.STUDYID
VS.USUBJID
ADSL is the
source
ADSL.RACEN
ADSL.TRTP
ADSL.TRTPN
ADSL.TRTA
ADSL.TRTAN
Y, N
Y, N
Y, N
ADSL.SAFETY
ADSL.FULLSET
ADSL.PPROT
13
Add numeric
variables
Variable
Name
Variable Label
Type
STUDYID
Study Identifier
Char
Variable
Name
Variable Label
Type
USUBJID
Char
AEPRE
Char
SUBJID
Char
AETRTEM
Char
SITEID
Char
AEPOST
Char
AESEQ
Sequence Number
Num
AETERM
Char
HEIGHTBL
Num
AEDECOD
Dictionary-Derived Term
Char
WEIGHTBL
Num
Char
AGE
Num
AESEV
Severity/Intensity
Char
AGEU
Age Units
Char
AESEVN
Severity/Intensity Numeric
Num
SEX
Sex
SEXN
Sex Numeric
Num
AESER
Serious Event
Char
RACE
Race
Char
AEACN
Char
RACEN
Race Numeric
Num
AEREL
Causality
Char
AERELN
Causality Numeric
Num
RACEOTH
AEOUT
Char
AEOUTN
Num
.....
Add derived
variables
Add demographic
Char
variables from ADSL
.....
Char
Add treatment
variables from ADSL
TRTP
Char
TRTPN
Num
AESTDT
Num
TRTA
Char
AESTDY
Num
TRTAN
Num
SAFETY
Safety Set
AERELAT
Char
AEDUR
Num
....
Add population
flag from ADSL
Char
14
15
16
17
ADSL / 2
18
ADSL / 3
19
A unique identifier
for the analysis
Analysis name
Description
Reason
Analysis pre-specified in
SAP
Analysis pre-specified in
SAP
Analysis pre-specified in
SAP
Summary of demographic data by country for Analysis pre-specified in
full analysis set
SAP
Summary of demographic data by gender for Analysis pre-specified in
full analysis set
SAP
Documentation
ADSL
select records with FULLSET=Y
ADSL
select records with PPROT=Y
ADSL
select records with SAFETY=Y
ADSL
select records with FULLSET=Y
ADSL
select records with FULLSET=Y
SAP Section XX
SAP Section XX
SAP Section XX
SAP Section XX
SAP Section XX
20
Parallel method
SDTM
CDMS
ADaM
Linear method
CDMS
ADaM
Draft
SDTM
ADaM
Hybrid method
CDMS
SDTM
SDTM
Other approaches
21
Datasets
Variables
Variables in SDTM SUPPQUAL merged back to the original domain (ex. Race, other)
Common set of variables in each dataset (age, gender, race, stratifications variables,
treatment planned/actual)
Analysis population flag: added to each dataset
Numeric variables: added as needed for the analysis (dates, numeric version of categorical
variables)
Add dataset specific variables (analysis day, TE, change from baseline etc.)
22
Benefits
(even if you are not working on a submission)
23
- Categorical Analysis
- Adverse Events
24
Questions
25
Clearly distinguish:
derivations & decision rules specified a priori
decisions that were data-driven
Key issues:
derived variables documentation: algorithms
handling of missing data
data item specific derivations, i.e change to a data value for a
specific observation
27
Should contain
ANALYSIS NAME: A unique identifier for this analysis. May include a table
number or other sponsor-specific reference.
DESCRIPTION: A text description documenting the analysis performed.
REASON: The reason for performing this analysis. Examples may include Prespecified, Exploratory, and Regulatory Request.
DATASET: the name of the analysis dataset used for this analysis. The column
may also include specific selection criteria (e.g. where SAFETY=Y)
DOCUMENTATION: information about how the analysis was performed (text
description, link to another document or the analysis generation program)
28