Low complexity: Difference between revisions
Jump to navigation
Jump to search
Added links |
No edit summary |
||
(3 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
Low complexity regions (LCRs) in a [[ | Low complexity regions (LCRs) in a [[Protein sequence|protein sequence]] are subsequences of biased composition. There are three main sources of LCRs are [[Cryptic repeats|cryptic]], [[Tandem repeats|tandem]] and [[Interspersed repeats|interspersed repeats]] <ref>Alb M.,et al.Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 2002;18:672-678.</ref>. | ||
Regions with a low complexity sequence are characterised as having an unusual composition and can be recognised by simple visual inspection. This unusual composition can create problems when you are searching for sequence similarity in BLAST. BLAST provides the option to add a filter to remove low complexity sequences. This filter is important as it will prevent the occurrence of artificial hits. | |||
=== Reference === | === Reference === | ||
<references /> | <references /> |
Latest revision as of 10:26, 26 October 2017
Low complexity regions (LCRs) in a protein sequence are subsequences of biased composition. There are three main sources of LCRs are cryptic, tandem and interspersed repeats [1].
Regions with a low complexity sequence are characterised as having an unusual composition and can be recognised by simple visual inspection. This unusual composition can create problems when you are searching for sequence similarity in BLAST. BLAST provides the option to add a filter to remove low complexity sequences. This filter is important as it will prevent the occurrence of artificial hits.
Reference
- ↑ Alb M.,et al.Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 2002;18:672-678.