This page is old and discontinoued.
plase use www.e-crisp.org for research
Florian Heigwer, Marco Breinig, Tianzuo Zhan, Michael Boutros
Programming: Grainne Kerr, Oliver Dreier, Johanna Kratzer
Ideas: Marco Breinig, Tianzuo Zhan, Jan Winter, Dirk Brüggemann
How to cite
|Heigwer, F. , Kerr, G. & Boutros, M. E-CRISP: fast CRISPR target site identification. Nat. Methods 11, 122-123 (2014).|
This webpage uses Google Analytics. For the tracking codes the Google function "anonymizeIp()" is
used which curtails the IP address of visitors to ensure adherence to privacy protection law. When
unchecking the box "Allow Google analytics track data" no data will be sent to Google Analytics. To
make this settings permanent (e.g. on revisit) cookies have to be enabled.
Zugriffe mit Google Analytics erfassen
E-CRISP is an online tool to design and evaluate Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)
E-CRISP has been optimized using fast and accurate algorithms to design CRISPR gRNA sequences to target any nucleotide sequence ranging from single exons to entire genomes. Special emphasis in the design process has been given to usability in experimental applications. E-CRISP not only checks for target specificity of the putative designs but also assesses their genomic context (e.g. exons, transcripts, CpG islands).
|The input options are divided into 6 sections, each dealing with a different aspect of the design process|
|Organism||The organism the designs should be created for. The databases are pre-built for each organism. The genome release is indicated in the dropdown menu.|
|Target Sequence||The sequence the CRISPR should be designed to target. Either enter an Ensembl ID, a gene symbol or a sequence in fasta format. If a fasta sequence is given the locus can be stated in the header in the form of "chrom:X:1..1000 ", if your sequence originates from the first 1000 bases of the X chromosome. If this location is not be stated in the header (the text after ">") the program does not check the genomic context.|
|Design Purpose||In this section, the user can specify the experimental purpose of the CRISPR. Depending on the purpose, different regions of the input sequence will be targeted. Purposes included, knock-out experiments, N-Terminal tagging, C-Terminal tagging, CRISPR double nicking.|
|Gene annotation filtering||In this section, the user can filter the output results, based on gene annotation information. For example, all results which do not target an exon can be excluded from the output, or the user can specify which exon to target.|
|Off-target Analysis||In this section, the user can specify parameters to search for off-target effects (regions where the design targets outside of input query sequence)|
|Output||In this section, the user can specify what output files are produced. If the user expects a lot of CRISPR designs to be return (e.g. inputting a large number of sequences at once), the user can switch off producing an image and an html output table.|
A summary of the design process.
|A html table is returned, where each row indicates a CRISPR alignment.|
|Name||The ID of the CRISPR. This is of the form: ID of the input sequence_randomNumber_random_Numer.|
|Nucleotide Sequence||The gRNA sequence|
|Rank||The rank of the CRISPR. A higher rank indicates a more specific design.|
|Target||The gene that is targeted by this gRNA. If a fasta sequence is given as input with no chromosome location information, E-CRISP cannot search the annotation databases, and no target gene will be returned.|
|Match String||A coloured match screen, which indicates at a glance how good the alignment is: A green "M" for a match. A "X" for a mismatch", an "I" for an insertion in the gRNA.|
|Number of Hits||The number of locations this CRISPR design targets, or, the number of times this CRISPR appears in the output table (one for each target).|
A genome browser image of the CRISPR designs in their genomic context. This allows the user to visually inspect where the CRISPR in the input sequence. Off-targets (where the CRISPR targets outside of the input sequence) are not shown in this image.
There is also an option to output a gff file (http://www.ensembl.org/info/website/upload/gff.html), and an annotated sequence file (Genbank file: http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html)
Scoring: All scores given are normalized to 100 % reachable score
|Specificity Score (S-score)||Annotation Score (A-score)||Efficacy Score (E-score)|
Start with 100
for every off-target substract (20-mismatches)/iteration
Start with zero
For every hit exon add 5/exon count
For every hit CpG Island subtract 1
For every start codon hit add 1
For every stop codon hit add 1
For every CDS hit add 5/CDS count
For every gene hit add 1
Add 1 if the the last 6 bp have a CG content higher then 70 %
Subtract 1 if the entire sequence has GC content > 80 %
Add 1 if sequence is preceded by a G
Add 1 if there are GG in front of the target sequence (opposite the PAM)
Add micro -homology score (is higher when sequence tends to give out of frame deletions)
E-CRISP not only identifies if your input sequence has a CRISPR target site, it also annotates this site with genomic annotation information, such as which gene, transcript, exon are that the targeted site, if any. It also, check for off targets in the rest of the mRNA, transcripts or chromosomal DNA of that organism. In order to do this, you must select an organism, so that the correct genomic annotation databases and off-target databases can be searched.
Every putative binding site found is annotated with its genomic context, i.e. whether it is contained within an exon, coding sequence, transcript, gene, CpG island etc. Annotation of many putative bindings sites requires an efficient search of genome annotation databases. To maximize efficiency and shorten runtime, E-CRISP uses a binary interval which stores all genome annotations for the respective organism.
Check if the CRISPR targets any foreign, exogenously introduced sequences. You can select from a list of commonly introduced sequences in lab, or paste in the sequence in the text area.
This is a scheme to rank the crisprs according to specificity of the gRNA alignment and the number (and specificity) of off-targets.
In other words a sequence with a high rank is likely much more specific and has fewer, if any, off-targets compared to a sequence with a low rank.
A crispr is given priority rank if it aligns 100% specifically and has no off-targets. For each off-target with 100% specificity a penalty of 20 is deducted. The specificity of the off-target should also be considered. A crispr may align to an off-target position, however it may map very unspecifically (with a high number of mismatches). If the off-target has mis-matches, the impact of the off-target is reduced, by scaling by the number of mismatches.
E-CRISP can only return target information, if location information is given in the input. If a gene name/symbol is given, this location information can be retrieved from the pre-built databases. If a fasta sequence is given, the location must be given in the fasta header, in order to check for genomic context.
If it is an off-target match, this match may lie outside an annotated region, in which case only the chromosome is returned.
Recent publication have shown that the guide RNA target sequence might be as well shorter or longer than 20 bp. This can have influence on the binding affinity and thus the efficiency of the CRISPR construct.
1. Gasiunas, G. & Siksnys, V. RNA-dependent DNA endonuclease Cas9 of the CRISPR system: Holy Grail of genome editing? Trends Microbiol. (2013). doi:10.1016/j.tim.2013.19.001
|Arabidopsis thaliana||TAIR10.22||Ensembl Plants|
|Brachypodium distachyon||v1.0.22||Ensembl Plants|
|Oryza sativa||IRGSP-1.0.22||Ensembl Plants|
|Populus trichocarpa||JGI2.0.21||Ensembl Plants|
|Schizosaccharomyces pombe||ASM294v2.22||Ensembl Fungi|
|Toxoplasma gondii GT1||ToxoDB-10.0||ToxoDB|
|Toxoplasma gondii ME49||ToxoDB-7.1.21||Ensembl|
|Ustilago hordei||MUHDB Ustilago hordei DataBase|
|Ustilago maydis||UM1.21||Ensembl Fungi|
|Zea mays||AGPv3.21||Ensembl Plants|
|22 Jan 2015, version 4.2||1. Many new organisms including Lifestock, crops, funghi and bacteria have been added to E-CRISP
2. All other databses were updated to the newest version available in ensembl.
3. Annotations have to overlap with the pam and not only some part of the sgRNA target
4. Targets can be identified in sequences which do not originate from the organism selected.
5. CpG islands are again properly shown in the result image
|01 August 2014, version 4.0||E-CRISP has been reworked to inlcude the latest scientific results of the last months:
1. The following organisms hav been added:
Toxoplasma gondii GT1 (ToxoDB-10.0)
Gasterosteus aculeatus (Three-spined stickleback, BROADS1.75)
Populus trichocarpa (Black cottonwood, JGI2.0.21)
Sus scrofa (Pig, Sscrofa10.2.75)
2. A new more intuitive scoring system, devided into Specificity, Annotation and Efficiency score has been implemented.
Design results are sorted by Specificity, then Annotation and then efficiency
3. Three new default options have been added guiding you fastly to the most wanted results.
For further details visit the help pages and scroll down to the schoring scheme.
4. Off-target checks are now much more precise, because the PAM region (NAG or NGG) now is truely ambigous.
An off-target is searched without the PAM but only considered valid if any PAM is present.
|26 May 2014, version 3.1||We are happy to announce a further major update to our E-CRISP web service.
Many new organisms have been added together with big changes in the web front end.
Hence you will find the new forum and many other new things here in the new BETA version 3.1.
|14 April 2014, version 3.0.2||In this minor update different default values for de-novo sgRNA design have been implemented, allowing for more designs to be found.|
|01 April 2014, version 3.0.1||The following organisms have been added to E-CRISP:
Toxoplasma gondii ME49
|20 March 2014, version 3||A new version of E-CRISP has been released (version 3.0). It includes more off-target search options and we implemented speed improvements to enable the design of sgRNAs against up to 200 genes in parallel.|
|The older version of E-CRISP can be reached Here
Boutros lab, E-CRISP-Version 4.2
For suggestions please contact us at firstname.lastname@example.org