Transcriptome Pipeline Welcome Basket
- Presentation on how Trinity works, and an introduction to the pipeline (from MDIBL Environmental Genomics 2018)
- Presentation focused more on the pipeline (from PAG 2018)
- Get the pipeline project folder
- The pipeline is currently set up for SLURM and TORQUE job handlers, with current set up being for IU Carbonate Cluster and PSC Bridges. If you want to run this on your own hardware, we can help you convert the scripts to run on your machine.
- READ THE READMEs. There are READMEs in every folder.
- There are two commands you must run first that are in that file. The first will set the email address so that you get notification when the jobs are done. The second will set the current directory as the working directory for the project.
- Place your input in the input_sequences folder. These should be trimmed and quality controlled sequences. This pipeline does not deal with that step, as it is very context and project dependent. Concatonate all your left reads into one file called left.fq; combine your right reads into right.fq.
- For each assembler folder, run each set of steps. If there are run files with the same prefix number (e.g. 1a and 1b), these can be run concurrently. Simply submit both with qsub. See README in each file for more information.
- For description, documentation, and licence for each program, go here. You can also use “module display $NAME” where $NAME is the name of the module listed in the run files.
- After you have finished the assemblies, run the ./Combine.sh script. This will combine the kmers, label them with the kmer and the assembler, and output them to the final_assemblies folder
- After all assemblers are done and there are <ASSEMBLER>.fa files for each in the final_assemblies files, run the final_assemblers/Combine.sh first, then run RunEvigenes. See the README for details on the output and next steps.
- If you have questions, feel free to email us at firstname.lastname@example.org.