Skip to content

Commit

Permalink
Update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
alexmanuele committed Mar 18, 2022
1 parent cb0f783 commit 46d9706
Show file tree
Hide file tree
Showing 2 changed files with 23 additions and 8 deletions.
10 changes: 9 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,15 @@
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/)
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## v1.2 dev - January 19,2021
## v.1.2.1 dev - March 18, 2022

### `Added`
- Add CheckM nf-core module
- Add `assembly_qc` workflow; runs QC software on existing software

### `Fixed`
- Fixed MultiQC not finding QUAST report (again)
## v1.2.0 dev - January 19,2022
Major refactoring to align with Nextflow DSL2 syntax.
### `Added`
- Add SNPsites module
Expand Down
21 changes: 14 additions & 7 deletions docs/usage.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,10 +4,11 @@
## Introduction

The ARETE pipeline can is designed as an end-to-end workflow manager for genome assembly, annotation, and phylogenetic analysis, beginning with read data. However, in some cases a user may wish to stop the pipeline prior to annotation or use the annotation features of the work flow with pre-existing assemblies. Therefore, ARETE allows users three use cases:
The ARETE pipeline can is designed as an end-to-end workflow manager for genome assembly, annotation, and phylogenetic analysis, beginning with read data. However, in some cases a user may wish to stop the pipeline prior to annotation or use the annotation features of the work flow with pre-existing assemblies. Therefore, ARETE allows users four use cases:
1. Run the full pipeline end-to-end..
2. Input a set of reads and stop after assembly.
3. Input a set of assemblies and perform annotation and taxonomic analyses.
3. Input a set of assemblies and perform QC.
4. Input a set of assemblies and perform annotation and taxonomic analyses.

This document will describe how to perform each workflow.
## Samplesheet input
Expand Down Expand Up @@ -45,14 +46,14 @@ TREATMENT_REP3,AEG588A6_S6_L004_R1_001.fastq.gz,
An [example samplesheet](../assets/samplesheet.csv) has been provided with the pipeline.

### Annotation only samplesheet
The ARETE pipeline allows users to provide pre-existing assemblies to make use of the annotation and reporting features of the workflow. Users are expected to independently quality check these assemblies prior to use.
The ARETE pipeline allows users to provide pre-existing assemblies to make use of the annotation and reporting features of the workflow. Users may use the `assembly_qc` entry point to perform QC on the assemblies. **Note that the QC workflow does not automatically filter low quality assemblies, it simply generates QC reports!** `assembly` and `assembly_qc` workflows accept the same format of sample sheet.

The sample sheet must be a 2 column, comma-seperated CSV file with header.

| Column | Description |
|----------------|-----------------------------------------------------------------------------------------------------------------------------|
| `sample` | Custom sample name. This entry will be identical for multiple sequencing libraries/runs from the same sample. |
|`fna_file_path` | Full path to FASTA or fna file for assembly or genome. File has to be gzipped and have the extension ".fna.gz" or ".fasta.gz". |
|`fna_file_path` | Full path to fna file for assembly or genome. File must have `.fna` file extension. |
## Reference Genome

For full workflow or assembly, users may provide a path to a reference genome in fasta format for use in assembly evaluation.
Expand Down Expand Up @@ -84,15 +85,21 @@ results # Finished results (configurable, see below)
# Other nextflow hidden files, eg. history of pipeline runs and old logs.
```

As written above, the pipeline also allows users to execute only assembly or only annotation. To execute assembly:
As written above, the pipeline also allows users to execute only assembly or only annotation. To execute assembly (reference genome optional):
```bash
nextflow run arete -entry assembly --input_sample_table samplesheet.csv --reference_genome ref.fasta -profile docker
```

To execute annotation:
To execute QC on pre-existing assemblies (reference genome optional):

```bash
nextflow run fmaguire/arete -entry annotation --input_sample_table samplesheet.csv -profile docker
nextflow run arete -entry assembly_qc --input_sample_table samplesheet.csv --reference_genome ref.fasta -profile docker
```

To execute annotation of pre-existing assemblies:

```bash
nextflow run arete -entry annotation --input_sample_table samplesheet.csv -profile docker
```

## Updating the pipeline
Expand Down

0 comments on commit 46d9706

Please sign in to comment.