Tests input parameters for functions

validate_input_parameters_power(
  SCE = "placeholder",
  range_downsampled = "placeholder",
  output_path = "placeholder",
  inpath = "placeholder",
  sampled = "placeholder",
  sampleID = "placeholder",
  design = "placeholder",
  sexID = "placeholder",
  celltypeID = "placeholder",
  assay_name = "placeholder",
  coef = "placeholder",
  fdr = "placeholder",
  nom_pval = "placeholder",
  Nperms = "placeholder",
  N_randperms = "placeholder",
  N_subsetpairs = "placeholder",
  y = "placeholder",
  region = "placeholder",
  control = "placeholder",
  pval_adjust_method = "placeholder",
  rmv_zero_count_genes = "placeholder"
)

Arguments

SCE: the input data (should be an SCE object)
range_downsampled: vector or list containing values which the SCE will be downsampled at, in ascending order
output_path: base path in which outputs will be stored
inpath: base path where downsampled DGE analysis output is stored (taken to be output_path if not provided)
sampled: downsampling carried out based on what (either "individuals" or "cells")
sampleID: sample ID
design: the design formula of class type formula. Equation used to fit the model- data for the generalised linear model e.g. expression ~ sex + pmi + disease
sexID: sex ID
celltypeID: cell type ID
assay_name: the name of the assay in the SCE object to be used for the analysis (default is "counts")
coef: which coefficient to carry out DE analysis with respect to
fdr: the cut-off False Discovery Rate below which to select DEGs
nom_pval: the cut-off nominal P-value below which to select DEGs (as an alternative to FDR)
Nperms: number of subsets created when downsampling at each level
N_randperms: number of randomised permutations of the dataset (based on sex) to be correlated
N_subsetpairs: number of pairs of subsets of the dataset to be correlated
y: the column name in the SCE object for the return variable e.g. "diagnosis" - Case or disease. Default is the last variable in the design formula. y can be discrete (logistic regression) or continuous (linear regression)
region: the column name in the SCE object for the study region. If there are multiple regions in the study (for example two brain regions). Pseudobulk values can be derived separately. Default is "single_region" which will not split by region.
control: character specifying which control level for the differential expression analysis e.g. in a case/control/other study use "control" in the y column to compare against. NOTE only need to specify if more than two groups in y, leave as default value for two groups or continuous y. Default is NULL.
pval_adjust_method: the adjustment method for the p-value in the differential expression analysis. Default is benjamini hochberg "BH". See stats::p.adjust for available options
rmv_zero_count_genes: whether genes with no count values in any cell should be removed. Default is TRUE Checks all power analysis parameters are specified correctly