No taxonId provided.
stringtie align.bam \ -G .improvedAnno.gtf \ -e \ -o transcriptome.gtf \ -A gene.abundance.tab Note: -G a reference gene model annotation in gtf format. It refers to the ".improvedAnno.gtf" file. -e switch off discovery of novel transcripts. -o output a transcriptome file in gtf format containing transcript expression information. -A output a gene abundance file in tsv format.
perl gather.expression.for.co-analysis.pl \ --python /usr/bin/python \ --prepDE /opt/software/stringtie-2.1.4/prepDE.py \ --readLen 101 \ --transcriptomeFile transcriptome.gtf \ --geneAbundanceFile gene.abundance.tab \ --outputGeneExprFile gene.expr.tsv \ --outputTrsptExprFile transcript.expr.tsv Note: --python a python program with full path. --prepDE a python script named "prepDE.py" to extract the read count information directly from the "transcriptome.gtf" file generated by StringTie (run with the -e parameter). prepDE.py can be downloaded from the stringTie2 offical website. If your python is Python 3 version, only the "prepDE.py3" script is available. --readLen a number corresponding to RNA-seq read length. --transcriptomeFile a transcriptome file from stringTie2 in the previous step. --geneAbundanceFile a gene abundance file from stringTie2 in the previous step. --outputGeneExprFile output file containing gene expression data that is available for co-analysis. --outputTrsptExprFile output file containing transcript expression data that is available for co-analysis.
python rmats.py \ --gtf .improvedAnno.gtf \ --b1 b.txt \ --readLength 101 \ --fixed-event-set .fromGTF \ --statoff \ --od outputDir \ --tmp /tmp / Note: --gtf reference gene model annotation in gtf format. --b1 a text file containing a comma separated list of the BAM files for RNA-seq sample, for example "/tmp/align.bam" --readLength a number corresponding to RNA-seq read length, for example 101. --fixed-event-set a directory where the five alternative splicing event annotation files exist. The five alternative splicing files are downloaded in the previous step and must be named "fromGTF.A3SS.txt", "fromGTF.A5SS.txt", "fromGTF.RI.txt", "fromGTF.MXE.txt" and "fromGTF.SE.txt". --statoff switch off statistic model. --od a directory where rMATS outputs the result files including JCEC.raw.input.[asType].txt and JC.raw.input.[asType].txt. --tmp a temporary directory for work.
perl gather.splicing.for.co-analysis.pl \ --asIdToNumericId .asId.to.numId.tsv \ --jcecA3SS outDir/JCEC.raw.input.A3SS.txt \ --jcecA5SS outDir/JCEC.raw.input.A5SS.txt \ --jcecMXE outDir/JCEC.raw.input.MXE.txt \ --jcecRI outDir/JCEC.raw.input.RI.txt \ --jcecSE outDir/JCEC.raw.input.SE.txt \ --jcA3SS outDir/JC.raw.input.A3SS.txt \ --jcA5SS outDir/JC.raw.input.A5SS.txt \ --jcMXE outDir/JC.raw.input.MXE.txt \ --jcRI outDir/JC.raw.input.RI.txt \ --jcSE outDir/JC.raw.input.SE.txt \ --outputJCECtsv jcec.tsv \ --outputJCtsv jc.tsv Note: --asIdToNumericId a mapping file from alternative spicing event to numeric identifer. It is downloaded in the previous 4th step. --jcec[asType] the read count files of rMATS JCEC outputs in the previous 5th step. --jc[asType] the read count files of rMATS JC outputs in the previous 5th step. --outputJCECtsv the combined file for the 5 JCEC read count files. --outputJCtsv the combined file for the 5 JC read count files.