A flexible, open-access workflow to facilitate the identification of cell clones that have desirable CRISPR-Cas9-induced gene edits.
Running the analysis below will take about an hour on a laptop.
Create a project, upload the submission spreadsheet associated with this project from data/submission_spreadsheets/demo.xlsx
into the WebApp.
conda activate geneditid
cd ~/GenEditID/
./scripts/start_webapp.sh
Go to http://localhost:8080
Create a new project, note the GEPID
, if it is your first project it will be GEP00001
.
The active environment, the one you are currently using, is shown in parentheses () or brackets [] at the beginning of your command prompt.
(geneditid) $
if not, activate it using conda activate geneditid
before running the next steps.
While waiting for the submission spreadsheet to load, open a new terminal and download the public fastq files. Don’t forget to activate conda’s environment using conda activate geneditid
before merging the reads.
cd ~/GenEditID/
./scripts/get_data_demo.sh
All the fastq files related to this project will be stored in the PROJECTS/demo
folder. Combine paired-end reads by merging reads to generate .fqjoin.gz
files for amplicount
analysis (seqkit
needs to be installed).
cd PROJECTS/demo
../../scripts/run_mergereads.sh
Move the joined fastq files into the PROJECTS/GEPID/fastq/
replacing GEPID
with the identifier of this project.
mv *.fqjoin.gz ../GEP00001/fastq/.
amplicount
analysiscd ~/GenEditID
source venv/bin/activate
cd PROJECTS/GEP00001
geneditid_run_amplicount
Replace GEPID
with the identifier of this project.
View the output of the analysis in the output result file GenEditID/PROJECTS/GEP00001/amplicount.csv
./scripts/start_webapp.sh
GenEditID/PROJECTS/GEP00001/geneditid_plots/coverage.html
GenEditID/PROJECTS/GEP00001/geneditid_plots/impacts.html
GenEditID/PROJECTS/GEP00001/geneditid_plots/koscores.html
GenEditID/PROJECTS/GEP00001/geneditid_plots/targeted_search.html
To analyse project SRP198941_SLX15021_GEP00005, use this submission spreadsheet data/submission_spreadsheets/SRP198941_SLX15021_GEP00005.xlsx
, download the fastq files using ./scripts/get_data_SRP198941_SLX15021_GEP00005.sh
and go to the cd PROJECTS/SRP198941_GEP00005
project to combine paired-end reads.
To analyse project SRP199742_SLX15026_GEP00009, use this submission spreadsheet data/submission_spreadsheets/SRP199742_SLX15026_GEP00009.xlsx
, download the fastq files using ./scripts/get_data_SRP199742_SLX15026_GEP00009.sh
and go to the cd PROJECTS/SRP199742_GEP00009
project to combine paired-end reads.
To analyse project SRP199742_SLX15025_GEP00010, use this submission spreadsheet data/submission_spreadsheets/SRP199742_SLX15025_GEP00010.xlsx
, download the fastq files using ./scripts/get_data_SRP199742_SLX15025_GEP00010.sh
and go to the cd PROJECTS/SRP199742_GEP000010
project to combine paired-end reads.