module config.models
¶
Configuration models.
class CustomBaseModel
¶
Base model that all other models derive from.
class InitUser
¶
User-specific parameters for initialization.
Args:
author
: Name of the person running the analysis.email
: Email address of the person running the analysis.logo
: Path or URL pointing to image file to be used as a logo in the run report.url
: A URL with additional information about the author or their affiliation.
Attributes:
author
: Name of the person running the analysis.email
: Email address of the person running the analysis.logo
: Path or URL pointing to image file to be used as a logo in the run report.url
: A URL with additional information about the author or their affiliation.
class InitRun
¶
Run-specific parameters for initialization.
Args:
cores
: Cores to be used by the workflow engine.dependency_embedding
: Dependency embedding strategy to use.execution_mode
: Execution mode to use.genome_assemblies_map
: Genome assemblies mapping file.profile
: Snakemake profile for ZARP workflow.resources_version
: Version of Ensembl genome resources to use when resources are not provided.rule_config
: ZARP rule configuration.working_directory
: Root directory for all runs.zarp_directory
: Root directory of the ZARP repository.
Attributes:
cores
: Cores to be used by the workflow engine.dependency_embedding
: Dependency embedding strategy to use.execution_mode
: Execution mode to use.genome_assemblies_map
: Genome assemblies mapping file.profile
: Snakemake profile for ZARP workflow.resources_version
: Version of Ensembl genome resources to use when resources are not provided.rule_config
: ZARP rule configuration.working_directory
: Root directory for all runs.zarp_directory
: Root directory of the ZARP repository.
class InitSample
¶
Sample-specific parameters for initialization.
Args:
fragment_length_distribution_mean
: Mean of the fragment length distribution.fragment_length_distribution_sd
: Standard deviation of the fragment length distribution.
Attributes:
fragment_length_distribution_mean
: Mean of the fragment length distribution.fragment_length_distribution_sd
: Standard deviation of the fragment length distribution.
class InitConfig
¶
ZARP-cli user default configuration set during initialization.
Args:
run
: Run-specific parameters.sample
: Sample-specific parameters.user
: User-specific parameters.
Attributes:
run
: Run-specific parameters.sample
: Sample-specific parameters.user
: User-specific parameters.
class ConfigUser
¶
User-specific parameters.
class ConfigRun
¶
Run-specific parameters.
Args:
description
: Run description.genome_assemblies_map
: Genome assemblies mapping file.identifier
: Unique identifier for a run.zarp_directory
: Root directory of the ZARP repository.
Attributes:
description
: Run description.genome_assemblies_map
: Genome assemblies mapping file.identifier
: Unique identifier for a run.zarp_directory
: Root directory of the ZARP repository.
classmethod get_identifier
¶
get_identifier(identifier: str) → str
Get default identifier.
class ConfigSample
¶
Sample-specific parameters.
Args:
adapter_3p
: Tuple of adapter sequences to truncate from the 3'-ends of single-end/first and second mate libraries, respectively.adapter_5p
: Tuple of adapter sequences to truncate from the 5'-ends of single-end/first and second mate libraries, respectively.adapter_poly_3p
: Tuple of polynucleotide stretch sequences to truncate from the 3'-ends of single-end/first and second mate libraries, respectively.adapter_poly_5p
: Tuple of polynucleotide stretch sequences to truncate from the 3'-ends of single-end/first and second mate libraries, respectively.annotations
: Path to GTF file containing gene annotations for thereference_sequences
.read_orientation
: Orientation of reads in sequencing library. Cf.https
: //salmon.readthedocs.io/en/latest/library_type.html.reference_sequences
: Path to FASTA file containing reference sequences to align reads against, typically chromosome sequences.source
: Origin of the sample as either a NCBI taxonomy database identifier, e.g,9606
for humans, or the corresponding full name, e.g., "Homo sapiens".star_sjdb_overhang
: Overhang length for splice junctions in STAR ( parametersjdbOverhang
). Ideally the maximum read length minus 1. Lower values may result in decreased mapping accuracy, while higher values may result in longer processing times. Cf.https
: //github.com/alexdobin/STAR/blob/3ae0966bc604a944b1993f49aaeb597e809eb5c9/doc/STARmanual.pdfsalmon_kmer_size
: Size of k-mers for building the Salmon index. The default value typically works fine for reads of 75 bp or longer. Consider using lower values if dealing with shorter reads. Cf.https
: //salmon.readthedocs.io/en/latest/salmon.html#preparing-transcriptome-indices-mapping-based-mode
Attributes:
adapter_3p
: Tuple of adapter sequences to truncate from the 3'-ends of single-end/first and second mate libraries, respectively.adapter_5p
: Tuple of adapter sequences to truncate from the 5'-ends of single-end/first and second mate libraries, respectively.adapter_poly_3p
: Tuple of polynucleotide stretch sequences to truncate from the 3'-ends of single-end/first and second mate libraries, respectively.adapter_poly_5p
: Tuple of polynucleotide stretch sequences to truncate from the 3'-ends of single-end/first and second mate libraries, respectively.annotations
: Path to GTF file containing gene annotations for thereference_sequences
.read_orientation
: Orientation of reads in sequencing library. Cf.https
: //salmon.readthedocs.io/en/latest/library_type.html.reference_sequences
: Path to FASTA file containing reference sequences to align reads against, typically chromosome sequences.source
: Origin of the sample as either a NCBI taxonomy database identifier, e.g,9606
for humans, or the corresponding full name, e.g., "Homo sapiens".star_sjdb_overhang
: Overhang length for splice junctions in STAR ( parametersjdbOverhang
). Ideally the maximum read length minus 1. Lower values may result in decreased mapping accuracy, while higher values may result in longer processing times. Cf.https
: //github.com/alexdobin/STAR/blob/3ae0966bc604a944b1993f49aaeb597e809eb5c9/doc/STARmanual.pdfsalmon_kmer_size
: Size of k-mers for building the Salmon index. The default value typically works fine for reads of 75 bp or longer. Consider using lower values if dealing with shorter reads. Cf.https
: //salmon.readthedocs.io/en/latest/salmon.html#preparing-transcriptome-indices-mapping-based-mode
class Config
¶
ZARP-cli main configuration.
Args:
ref
: References to individual sequencing libraries by local file path or read archive identifiers OR paths to ZARP sample tables; see documentation for details.run
: Run-specific parameters.sample
: Sample-specific parameters.user
: User-specific parameters.
Attributes:
ref
: References to individual sequencing libraries by local file path or read archive identifiers OR paths to ZARP sample tables; see documentation for details.run
: Run-specific parameters.sample
: Sample-specific parameters.user
: User-specific parameters.
class SampleReference
¶
Sample reference information.
Args:
identifier
: Read archive identifier.lib_paths
: Path (single-ended) or paths (paired-ended) to files containing sequencing reads.name
: Sample name.ref
: References to individual sequencing libraries by local file path or read archive identifiers OR paths to ZARP sample tables; see documentation for details.table_path
: Path to ZARP sample table.type
: Type of sample reference.
Attributes:
identifier
: Read archive identifier.lib_paths
: Path (single-ended) or paths (paired-ended) to files containing sequencing reads.name
: Sample name.ref
: References to individual sequencing libraries by local file path or read archive identifiers OR paths to ZARP sample tables; see documentation for details.table_path
: Path to ZARP sample table.type
: Type of sample reference.
class Sample
¶
Sample-specific parameters.
Args:
type
: Type of sample, local (single/paired) or remote.identifier
: Read archive identifier.name
: Sample name; if not provided, a sample name will be set based on the input file name or read archive identifier.paths
: Path (single-ended) or paths (paired-ended) to files containing sequencing reads.
Attributes:
type
: Type of sample, local (single/paired) or remote.identifier
: Read archive identifier.name
: Sample name; if not provided, a sample name will be set based on the input file name or read archive identifier.paths
: Path (single-ended) or paths (paired-ended) to files containing sequencing reads.
class ConfigFileContent
¶
Snakemake configuration file content.
class ConfigFileSRA
¶
SRA download workflow configuration file content.
Args:
cluster_log_dir
: Path to cluster log directory.log_dir
: Path to log directory.outdir
: Path to output directory.samples
: Path to sample table.samples_out
: Path to output sample table.
Attributes:
cluster_log_dir
: Path to cluster log directory.log_dir
: Path to log directory.outdir
: Path to output directory.samples
: Path to sample table.samples_out
: Path to output sample table.
class ConfigFileHTSinfer
¶
HTSinfer workflow configuration file content.
Args:
cluster_log_dir
: Path to cluster log directory.log_dir
: Path to log directory.samples
: Path to sample table.outdir
: Path to output directory.samples_out
: Path to output sample table.
Attributes:
cluster_log_dir
: Path to cluster log directory.log_dir
: Path to log directory.samples
: Path to sample table.outdir
: Path to output directory.samples_out
: Path to output sample table.
class ConfigFileZARP
¶
ZARP workflow configuration file content.
Args:
samples
: Path to sample table.output_dir
: Path to output directory.log_dir
: Path to log directory.cluster_log_dir
: Path to cluster log directory.kallisto_indexes
: Path to kallisto indexes.salmon_indexes
: Path to Salmon indexes.star_indexes
: Path to STAR indexes.alfa_indexes
: Path to ALFA indexes.rule_config
: Path to ZARP rule configuration.report_description
: Run report description.report_logo
: Path or URL pointing to image file to be used as a logo for the run report.report_url
: URL to link to from the run report.author_name
: Name of the person running the analysis.author_email
: Email address of the person running the analysis.
Attributes:
samples
: Path to sample table.output_dir
: Path to output directory.log_dir
: Path to log directory.cluster_log_dir
: Path to cluster log directory.kallisto_indexes
: Path to kallisto indexes.salmon_indexes
: Path to Salmon indexes.star_indexes
: Path to STAR indexes.alfa_indexes
: Path to ALFA indexes.report_description
: Run report description.report_logo
: Path or URL pointing to image file to be used as a logo for the run report.report_url
: URL to link to from the run report.author_name
: Name of the person running the analysis.author_email
: Email address of the person running the analysis.