Lindel

Lindel is a Logistic regression model for accurate indel prediction induced by Cas9 cleavage. It takes 65 bp sequence (cleavage site at 30) as an input and predicts the frequencies for all possible deletions <30 bp, all 1-2 bp insertions, and insertions larger than 2 bp as a group.

The scheme below shows the structure of Lindel modeling. It takes the sequence and extracting Sequence features (one-hot encoded 20bp target sequence) and Microhomology features (all possible 1-4 bp microhomology tracts from the input sequence). The model will use these features to make three predictions:

Predicted insertion and deletion frequencies will be weighted by the overall insertion and deletion ratio predicted in part 1.

For more information, please check out our paper(Wei Chen, et al.)

Lindel modeling layout showing sequence features and microhomology features used to predict indel frequencies