Craig Lab, Department of Translational Genomics, Keck School of Medicine of USC

The Craig Lab at USC has been heavily involved in AMP-PD, beginning with the PPMI project and continues to be a valuable contributing member of the community. Their expertise is in RNAseq analysis and visualization as exemplified in the AMP-PD RNA-Seq Explorer, found here on Terra.

Mission Statement: We are deeply committed to excellence in translational genomics research, bringing to bear vast experience and expertise in molecular genetics, genome science, biomedical informatics, translational science, and molecular medicine. Our ultimate goal is to serve the Keck community by bridging basic and clinical research through discovery and validation of novel diagnostics and therapeutics for earlier diagnosis and smarter treatments.

Project Lead: David Craig, PhD
For information, contact: http://dtg.usc.edu

Analysis Methodology

Note: The analysis that produced this target list is available to all registered AMP PD users in Terra and can be reused in your work.

Introduction

The AMP-PD Target Explorer uses transcriptomic data described here.

3 Studies were included in this analysis:

Number of Samples Successfully Processed
BioFIND 165
PDBP 3296
PPMI 4622

Methods:

Outliers and covariates

Outlier samples

  • Principal components were correlated to metadata and QC metrics. RIN and PCT USABLE BASES (Picard RnaSeqMetrics) were used to set cut offs for outlier detection.
  • Tools:
    • R

Covariates

  • To help identify covariates, principal components were correlated to metadata and QC metrics. variancePartition was used to validate potential covariates. Finally a neutrophil score was developed to fill the gap in CBC data.
  • Tools:
    • R
    • variancePartition
  • Input:
  • Output:
    • default covariates: sex + plate + age + neutPerc + lymphPerc

Expression

Differential expression

  • Cohorts for DE were created by filtering metadata for case/control status, visit month, and genetic mutation status. DE was performed with and without neutrophil score as a covariate.
  • Tools:
  • Input:
    • matrix.featureCounts.tsv - raw count data (output from Subread featureCounts)
    • fullTableRNAseqMeta_TNT.csv - table of relevant sample metadata
  • Output:
    • _limmaResults.tsv - differential expression table including fold change, p-values, and average expression
    • Plots (volcano and mean difference)

Underlying Analyses

Analysis Type:

Differential Expression

Target Type:

Gene, Transcript

Description:

Results of PD case versus HC differential expression analysis. Differential expression was assessed using linear modeling and empirical Bayes moderation (employing default settings) with limma+voom, using sex, age, plate, derived neutrophil and lymphocyte percentages in the design formula in addition to disease status. Genes with an adjusted p < 0.05 and a log2 fold change > 0.1 are considered differentially expressed. Multiple hypothesis testing was performed with the decideTests() function in limma using the default adjustment method (Benjamini-Hochberg).

Tissue Type:

Whole Blood

Source Data Type:

Transcriptomics

Source Data Cohorts:

BioFIND, Parkinson's Disease Biomarkers Program (PDBP), Parkinson's Progression Markers Initiative (PPMI)


Data Dictionary

Field NameField Name ExpandedShort Description (optional)
ensembl_gene_idENSEMBL Gene ID
hgnc_symbolHGNC symbol
gene_typeGene type from GENCODE
logfcLog2 Fold Change
AveExprAverage Expression
tT-test statistic
p_valp-Value
adj_p_valAdjusted p-value

Nominated Targets

OLAH (gene)

ensembl_id: ENSG00000152463
hgnc_symbol: OLAH
gene_type: protein_coding
logfc:
0.2432115523
AveExpr:
-1.473925406
t:
7.479148263
p_val:
8.44e-14
adj_p_val:
3.75e-11
gene_name:
OLAH

KIR2DL4 (gene)

ensembl_id: ENSG00000189013
hgnc_symbol: KIR2DL4
gene_type: protein_coding
logfc:
0.2196040597
AveExpr:
-1.752118928
t:
7.47105836
p_val:
8.97e-14
adj_p_val:
3.9e-11
gene_name:
KIR2DL4

BOK (gene)

ensembl_id: ENSG00000176720
hgnc_symbol: BOK
gene_type: protein_coding
logfc:
0.2258142416
AveExpr:
-1.281519973
t:
6.631860673
p_val:
3.57e-11
adj_p_val:
6.06e-9
gene_name:
BOK

OLA1P2 (gene)

ensembl_id: ENSG00000213671
hgnc_symbol: OLA1P2
gene_type: processed_pseudogene
logfc:
0.2276290027
AveExpr:
-2.321197406
t:
6.566382948
p_val:
5.54e-11
adj_p_val:
8.33e-9
gene_name:
OLA1P2

SLC26A5 (gene)

ensembl_id: ENSG00000170615
hgnc_symbol: SLC26A5
gene_type: protein_coding
logfc:
0.2165369306
AveExpr:
-1.243071028
t:
5.703192167
p_val:
1.23e-8
adj_p_val:
7.25e-7
gene_name:
SLC26A5

HYDIN (gene)

ensembl_id: ENSG00000157423
hgnc_symbol: HYDIN
gene_type: protein_coding
logfc:
0.2692465381
AveExpr:
-1.792202495
t:
5.698817976
p_val:
1.26e-8
adj_p_val:
7.34e-7
gene_name:
HYDIN

ARHGAP44 (gene)

ensembl_id: ENSG00000006740
hgnc_symbol: ARHGAP44
gene_type: protein_coding
logfc:
0.237621232
AveExpr:
-2.043915818
t:
5.600169224
p_val:
2.23e-8
adj_p_val:
0.00000118
gene_name:
ARHGAP44

ADAMTS2 (gene)

ensembl_id: ENSG00000087116
hgnc_symbol: ADAMTS2
gene_type: protein_coding
logfc:
0.2525335063
AveExpr:
-2.067700072
t:
5.398997899
p_val:
6.93e-8
adj_p_val:
0.00000296
gene_name:
ADAMTS2

CEACAM22P (gene)

ensembl_id: ENSG00000230666
hgnc_symbol: CEACAM22P
gene_type: transcribed_unprocessed_pseudogene
logfc:
0.2770785409
AveExpr:
-2.425268918
t:
5.392987049
p_val:
7.17e-8
adj_p_val:
0.00000304
gene_name:
CEACAM22P

SHANK2 (gene)

ensembl_id: ENSG00000162105
hgnc_symbol: SHANK2
gene_type: protein_coding
logfc:
0.2173924312
AveExpr:
-1.809445391
t:
4.756746229
p_val:
0.00000201
adj_p_val:
0.0000451
gene_name:
SHANK2

CSMD2 (gene)

ensembl_id: ENSG00000121904
hgnc_symbol: CSMD2
gene_type: protein_coding
logfc:
0.2378988925
AveExpr:
-2.23949969
t:
4.273533268
p_val:
0.0000195
adj_p_val:
0.000284
gene_name:
CSMD2

KIF1A (gene)

ensembl_id: ENSG00000130294
hgnc_symbol: KIF1A
gene_type: protein_coding
logfc:
0.2077414804
AveExpr:
-2.533304925
t:
3.846671668
p_val:
0.000121
adj_p_val:
0.001237814175
gene_name:
KIF1A

DUX4L26 (gene)

ensembl_id: ENSG00000236138
hgnc_symbol: DUX4L26
gene_type: processed_pseudogene
logfc:
0.2057827939
AveExpr:
-1.900596894
t:
3.834454669
p_val:
0.000127
adj_p_val:
0.001288476817
gene_name:
DUX4L26

HMCN2 (gene)

ensembl_id: ENSG00000148357
hgnc_symbol: HMCN2
gene_type: protein_coding
logfc:
0.2071515434
AveExpr:
-2.047573397
t:
3.790507898
p_val:
0.000152
adj_p_val:
0.001492078964
gene_name:
HMCN2

ZFP57 (gene)

ensembl_id: ENSG00000204644
hgnc_symbol: ZFP57
gene_type: protein_coding
logfc:
0.2907951024
AveExpr:
-1.912883679
t:
3.63455441
p_val:
0.000281
adj_p_val:
0.002427717311
gene_name:
ZFP57

LRRN3 (gene)

ensembl_id: ENSG00000173114
hgnc_symbol: LRRN3
gene_type: protein_coding
logfc:
-0.2144334069
AveExpr:
3.036942532
t:
-8.053781808
p_val:
9.43e-16
adj_p_val:
1.18e-12
gene_name:
LRRN3

KRT1 (gene)

ensembl_id: ENSG00000167768
hgnc_symbol: KRT1
gene_type: protein_coding
logfc:
-0.2291220873
AveExpr:
5.214187084
t:
-7.317559794
p_val:
2.82e-13
adj_p_val:
1.08e-10
gene_name:
KRT1

LINC01013 (gene)

ensembl_id: ENSG00000228495
hgnc_symbol: LINC01013
gene_type: lincRNA
logfc:
-0.2066209357
AveExpr:
-0.07104315524
t:
-6.980144839
p_val:
3.23e-12
adj_p_val:
8.64e-10
gene_name:
LINC01013

NEBL (gene)

ensembl_id: ENSG00000078114
hgnc_symbol: NEBL
gene_type: protein_coding
logfc:
-0.3067501758
AveExpr:
0.9300496481
t:
-5.620829588
p_val:
1.98e-8
adj_p_val:
0.00000107
gene_name:
NEBL

FAM3B (gene)

ensembl_id: ENSG00000183844
hgnc_symbol: FAM3B
gene_type: protein_coding
logfc:
-0.330617395
AveExpr:
-2.278354988
t:
-5.06435642
p_val:
4.21e-7
adj_p_val:
0.0000132
gene_name:
FAM3B