Low complexity

From The School of Biomedical Sciences Wiki
Revision as of 09:31, 26 October 2017 by 160469932 (talk | contribs)
Jump to navigation Jump to search

Low complexity regions (LCRs) in a protein sequence are subsequences of biased composition. There are three main sources of LCRs are cryptic, tandem and interspersed repeats. [1]

Regions with low complexity sequence are characterised as having an unusual composition. This unusal composition can create problems when you are searching for sequence similiarty in BLAST. You can often recognise a low complexity sequence from simple visual insepection. In BLAST there is an option to use a filter to remove low complexity sequences inorder to prevent artifical hits.

Reference  

  1. Alb M.,et al.Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 2002;18:672-678.