A flexible, open-access workflow to facilitate the identification of cell clones that have desirable CRISPR-Cas9-induced gene edits.
Running the analysis below will take about an hour on a laptop.
Create a project, upload the submission spreadsheet associated with this project from data/submission_spreadsheets/demo.xlsx into the WebApp.
conda activate geneditid
cd ~/GenEditID/
./scripts/start_webapp.sh
Go to http://localhost:8080
Create a new project, note the GEPID, if it is your first project it will be GEP00001.
The active environment, the one you are currently using, is shown in parentheses () or brackets [] at the beginning of your command prompt.
(geneditid) $
if not, activate it using conda activate geneditid before running the next steps.
While waiting for the submission spreadsheet to load, open a new terminal and download the public fastq files. Don’t forget to activate conda’s environment using conda activate geneditid before merging the reads.
cd ~/GenEditID/
./scripts/get_data_demo.sh
All the fastq files related to this project will be stored in the PROJECTS/demo folder. Combine paired-end reads by merging reads to generate .fqjoin.gz files for amplicount analysis (seqkit needs to be installed).
cd PROJECTS/demo
../../scripts/run_mergereads.sh
Move the joined fastq files into the PROJECTS/GEPID/fastq/ replacing GEPID with the identifier of this project.
mv *.fqjoin.gz ../GEP00001/fastq/.
amplicount analysiscd ~/GenEditID
source venv/bin/activate
cd PROJECTS/GEP00001
geneditid_run_amplicount
Replace GEPID with the identifier of this project.
View the output of the analysis in the output result file GenEditID/PROJECTS/GEP00001/amplicount.csv
./scripts/start_webapp.sh
GenEditID/PROJECTS/GEP00001/geneditid_plots/coverage.html
GenEditID/PROJECTS/GEP00001/geneditid_plots/impacts.html
GenEditID/PROJECTS/GEP00001/geneditid_plots/koscores.html
GenEditID/PROJECTS/GEP00001/geneditid_plots/targeted_search.html
To analyse project SRP198941_SLX15021_GEP00005, use this submission spreadsheet data/submission_spreadsheets/SRP198941_SLX15021_GEP00005.xlsx, download the fastq files using ./scripts/get_data_SRP198941_SLX15021_GEP00005.sh and go to the cd PROJECTS/SRP198941_GEP00005 project to combine paired-end reads.
To analyse project SRP199742_SLX15026_GEP00009, use this submission spreadsheet data/submission_spreadsheets/SRP199742_SLX15026_GEP00009.xlsx, download the fastq files using ./scripts/get_data_SRP199742_SLX15026_GEP00009.sh and go to the cd PROJECTS/SRP199742_GEP00009 project to combine paired-end reads.
To analyse project SRP199742_SLX15025_GEP00010, use this submission spreadsheet data/submission_spreadsheets/SRP199742_SLX15025_GEP00010.xlsx, download the fastq files using ./scripts/get_data_SRP199742_SLX15025_GEP00010.sh and go to the cd PROJECTS/SRP199742_GEP000010 project to combine paired-end reads.