Project Lead:

Analysis Methodology

Note: The analysis that produced this target list is available to all registered AMP PD users in Terra and can be reused in your work.

Introduction

The AMP-PD Target Explorer uses transcriptomic data described here.

3 Studies were included in this analysis:

Number of Samples Successfully Processed
BioFIND 165
PDBP 3296
PPMI 4622

Methods:

Outliers and covariates

Outlier samples

  • Principal components were correlated to metadata and QC metrics. RIN and PCT USABLE BASES (Picard RnaSeqMetrics) were used to set cut offs for outlier detection.
  • Tools:
    • R

Covariates

  • To help identify covariates, principal components were correlated to metadata and QC metrics. variancePartition was used to validate potential covariates. Finally a neutrophil score was developed to fill the gap in CBC data.
  • Tools:
    • R
    • variancePartition
  • Input:
  • Output:
    • default covariates: sex + plate + age + neutPerc + lymphPerc

Expression

Differential expression

  • Cohorts for DE were created by filtering metadata for case/control status, visit month, and genetic mutation status. DE was performed with and without neutrophil score as a covariate.
  • Tools:
  • Input:
    • matrix.featureCounts.tsv - raw count data (output from Subread featureCounts)
    • fullTableRNAseqMeta_TNT.csv - table of relevant sample metadata
  • Output:
    • _limmaResults.tsv - differential expression table including fold change, p-values, and average expression
    • Plots (volcano and mean difference)

Underlying Analyses

Analysis Type:

Differential Expression

Target Type:

Gene, Transcript

Description:

Results of PD case versus HC differential expression analysis. Differential expression was assessed using linear modeling and empirical Bayes moderation (employing default settings) with limma+voom, using sex, age, plate, derived neutrophil and lymphocyte percentages in the design formula in addition to disease status. Genes with an adjusted p < 0.05 and a log2 fold change > 0.1 are considered differentially expressed. Multiple hypothesis testing was performed with the decideTests() function in limma using the default adjustment method (Benjamini-Hochberg).

Tissue Type:

Whole Blood

Source Data Type:

Transcriptomics

Source Data Cohorts:

BioFIND, Parkinson's Disease Biomarkers Program (PDBP), Parkinson's Progression Markers Initiative (PPMI)


Data Dictionary

Field NameField Name ExpandedShort Description (optional)
ensembl_gene_idENSEMBL Gene ID
hgnc_symbolHGNC symbol
gene_typeGene type from GENCODE
logfcLog2 Fold Change
AveExprAverage Expression
tT-test statistic
p_valp-Value
adj_p_valAdjusted p-value

Nominated Targets

OLAH (gene)

ensembl_id: ENSG00000152463
hgnc_symbol: OLAH
gene_type: protein_coding
logfc:
0.2432115523
AveExpr:
-1.473925406
t:
7.479148263
p_val:
8.44e-14
adj_p_val:
3.75e-11
gene_name:
OLAH

KIR2DL4 (gene)

ensembl_id: ENSG00000189013
hgnc_symbol: KIR2DL4
gene_type: protein_coding
logfc:
0.2196040597
AveExpr:
-1.752118928
t:
7.47105836
p_val:
8.97e-14
adj_p_val:
3.9e-11
gene_name:
KIR2DL4

BOK (gene)

ensembl_id: ENSG00000176720
hgnc_symbol: BOK
gene_type: protein_coding
logfc:
0.2258142416
AveExpr:
-1.281519973
t:
6.631860673
p_val:
3.57e-11
adj_p_val:
6.06e-9
gene_name:
BOK

OLA1P2 (gene)

ensembl_id: ENSG00000213671
hgnc_symbol: OLA1P2
gene_type: processed_pseudogene
logfc:
0.2276290027
AveExpr:
-2.321197406
t:
6.566382948
p_val:
5.54e-11
adj_p_val:
8.33e-9
gene_name:
OLA1P2

SLC26A5 (gene)

ensembl_id: ENSG00000170615
hgnc_symbol: SLC26A5
gene_type: protein_coding
logfc:
0.2165369306
AveExpr:
-1.243071028
t:
5.703192167
p_val:
1.23e-8
adj_p_val:
7.25e-7
gene_name:
SLC26A5

HYDIN (gene)

ensembl_id: ENSG00000157423
hgnc_symbol: HYDIN
gene_type: protein_coding
logfc:
0.2692465381
AveExpr:
-1.792202495
t:
5.698817976
p_val:
1.26e-8
adj_p_val:
7.34e-7
gene_name:
HYDIN

ARHGAP44 (gene)

ensembl_id: ENSG00000006740
hgnc_symbol: ARHGAP44
gene_type: protein_coding
logfc:
0.237621232
AveExpr:
-2.043915818
t:
5.600169224
p_val:
2.23e-8
adj_p_val:
0.00000118
gene_name:
ARHGAP44

ADAMTS2 (gene)

ensembl_id: ENSG00000087116
hgnc_symbol: ADAMTS2
gene_type: protein_coding
logfc:
0.2525335063
AveExpr:
-2.067700072
t:
5.398997899
p_val:
6.93e-8
adj_p_val:
0.00000296
gene_name:
ADAMTS2

CEACAM22P (gene)

ensembl_id: ENSG00000230666
hgnc_symbol: CEACAM22P
gene_type: transcribed_unprocessed_pseudogene
logfc:
0.2770785409
AveExpr:
-2.425268918
t:
5.392987049
p_val:
7.17e-8
adj_p_val:
0.00000304
gene_name:
CEACAM22P

SHANK2 (gene)

ensembl_id: ENSG00000162105
hgnc_symbol: SHANK2
gene_type: protein_coding
logfc:
0.2173924312
AveExpr:
-1.809445391
t:
4.756746229
p_val:
0.00000201
adj_p_val:
0.0000451
gene_name:
SHANK2

CSMD2 (gene)

ensembl_id: ENSG00000121904
hgnc_symbol: CSMD2
gene_type: protein_coding
logfc:
0.2378988925
AveExpr:
-2.23949969
t:
4.273533268
p_val:
0.0000195
adj_p_val:
0.000284
gene_name:
CSMD2

KIF1A (gene)

ensembl_id: ENSG00000130294
hgnc_symbol: KIF1A
gene_type: protein_coding
logfc:
0.2077414804
AveExpr:
-2.533304925
t:
3.846671668
p_val:
0.000121
adj_p_val:
0.001237814175
gene_name:
KIF1A

DUX4L26 (gene)

ensembl_id: ENSG00000236138
hgnc_symbol: DUX4L26
gene_type: processed_pseudogene
logfc:
0.2057827939
AveExpr:
-1.900596894
t:
3.834454669
p_val:
0.000127
adj_p_val:
0.001288476817
gene_name:
DUX4L26

HMCN2 (gene)

ensembl_id: ENSG00000148357
hgnc_symbol: HMCN2
gene_type: protein_coding
logfc:
0.2071515434
AveExpr:
-2.047573397
t:
3.790507898
p_val:
0.000152
adj_p_val:
0.001492078964
gene_name:
HMCN2

ZFP57 (gene)

ensembl_id: ENSG00000204644
hgnc_symbol: ZFP57
gene_type: protein_coding
logfc:
0.2907951024
AveExpr:
-1.912883679
t:
3.63455441
p_val:
0.000281
adj_p_val:
0.002427717311
gene_name:
ZFP57

LRRN3 (gene)

ensembl_id: ENSG00000173114
hgnc_symbol: LRRN3
gene_type: protein_coding
logfc:
-0.2144334069
AveExpr:
3.036942532
t:
-8.053781808
p_val:
9.43e-16
adj_p_val:
1.18e-12
gene_name:
LRRN3

KRT1 (gene)

ensembl_id: ENSG00000167768
hgnc_symbol: KRT1
gene_type: protein_coding
logfc:
-0.2291220873
AveExpr:
5.214187084
t:
-7.317559794
p_val:
2.82e-13
adj_p_val:
1.08e-10
gene_name:
KRT1

LINC01013 (gene)

ensembl_id: ENSG00000228495
hgnc_symbol: LINC01013
gene_type: lincRNA
logfc:
-0.2066209357
AveExpr:
-0.07104315524
t:
-6.980144839
p_val:
3.23e-12
adj_p_val:
8.64e-10
gene_name:
LINC01013

NEBL (gene)

ensembl_id: ENSG00000078114
hgnc_symbol: NEBL
gene_type: protein_coding
logfc:
-0.3067501758
AveExpr:
0.9300496481
t:
-5.620829588
p_val:
1.98e-8
adj_p_val:
0.00000107
gene_name:
NEBL

FAM3B (gene)

ensembl_id: ENSG00000183844
hgnc_symbol: FAM3B
gene_type: protein_coding
logfc:
-0.330617395
AveExpr:
-2.278354988
t:
-5.06435642
p_val:
4.21e-7
adj_p_val:
0.0000132
gene_name:
FAM3B