CRISPR Genome Analysis Tool

Welcome to the Iowa State University Crop Bioengineering Consortium's CRISPR Genome Analysis Tool.

This tool works in two steps:

Identify potential target sites for CRISPR gene editing in DNA sequences
Optionally, use identified target sequences from step 1 to search a genome of interest for potential off-target matches

Click here for publication citation

Genome

Chromosome

Select Gene ID from list:

Enter Gene ID:

Please enter or select an input sequence

Target Length

Max Allowed Nucleotide Repeats

Minimum GC Content %

Maximum GC Content %

Genome to Search for Off Targets (Optional)

Analyze

Clear Input

receiving {! progressCounter !} of {! progressMax !} search results finished processing {! progressMax !} searches

	strand	length	sequence	unique	position	max repeat	GC Content
Input a sequence to search in the textarea above
Nothing found for provided input sequence and parameters
{! 1 + $index !}	{! match.strand !}	{! match.match.length !}	{! match.normalizedMatch !}	{! match.isUnique ? 'yes' : 'no' !}	{! match.positions[0] !} – {! match.positions[1] !}	{! match.maxRepeats !}	{! match.cgContent !} %

Get CSV

How This Tool Works:

For each genome available to search above, the genome sequence has been parsed in advance for valid CRISPR target sequences. All found target sequences were exported to a SQL database along with some relevant metadata. Additionally, the transcript data for each gene has also been stored in the SQL DB for easy retrieval when a user opts to select the input sequence from a specific gene.
In the tool interface, Javascript is used to parse both the input sequence and its complement for valid CRISPR targets based on the user-provided search parameters (i.e., Target Length, GC Content and Allowed Nucleotide Repeats). The results are rendered in the browser and, for each found target sequence, a request is sent to the webserver to search the specified genome database for potential off-target matches.
For each request sent from the web browser to the webserver in the previous step, the server queries the database for the target genome with the user-provided search parameters.
Search results are filtered and sorted primarily by an identity score between an input subsequence (bases 6-18 for 21 base sequences or bases 6-20 for 23 base sequences) and the corresponding subsequences stored in the database. Additional sorting is performed based on an identity score between the subsequence at bases 2-5 of the input sequence and the corresponding subsequences in the database.
Finally, the webserver returns the search results to the browser which updates the existing table. Clicking any table row reveals more details about the result.

Additional design features under development but not currently available include:

Graphical and tabular genomic position information for target & off-targets
Restriction Enzyme cut sites within targets
Currently the tool searches exclusively for 'G' as the first base on the five prime end of targets, we may consider expanding that to other nucleotides
Additional indexed genomes

Please report any questions or concerns about this tool to the administrator/developer

This tool uses genomic data from The U.S. Department of Energy (DOE) Joint Genome Institute (JGI) Genome Portal

Citation
Vincent A Brazelton Jr, Scott Zarecor, David A Wright, Yuan Wang, Jie Liu, Keting Chen, Bing Yang & Carolyn J Lawrence-Dill (2015)
A quick guide to CRISPR sgRNA design tools, GM Crops & Food, 6:4, 266-276, DOI: 10.1080/21645698.2015.1137690