Low complexity: Difference between revisions

From The School of Biomedical Sciences Wiki
Jump to navigation Jump to search
Nnjm2 (talk | contribs)
Cleaned text.
Nnjm2 (talk | contribs)
No edit summary
 
Line 1: Line 1:
Low complexity regions (LCRs) in a [[Protein sequence|protein sequence]] are subsequences of biased composition. There are three main sources of LCRs are [[Cryptic repeats|cryptic]], [[Tandem repeats|tandem]] and [[Interspersed repeats|interspersed repeats]] <ref>Alb M.,et al.Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 2002;18:672-678.</ref>.
Low complexity regions (LCRs) in a [[Protein sequence|protein sequence]] are subsequences of biased composition. There are three main sources of LCRs are [[Cryptic repeats|cryptic]], [[Tandem repeats|tandem]] and [[Interspersed repeats|interspersed repeats]] <ref>Alb M.,et al.Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 2002;18:672-678.</ref>.  


Regions with a low complexity sequence are characterised as having an unusual composition and can be recognised by simple visual inspection. This unusual composition can create problems when you are searching for sequence similiarty in BLAST. BLAST provides the option to add a filter to remove low complexity sequences. This filter is important as it will prevent the occurrence of artificial hits.  
Regions with a low complexity sequence are characterised as having an unusual composition and can be recognised by simple visual inspection. This unusual composition can create problems when you are searching for sequence similarity in BLAST. BLAST provides the option to add a filter to remove low complexity sequences. This filter is important as it will prevent the occurrence of artificial hits.  


=== Reference   ===
=== Reference ===


<references />
<references />

Latest revision as of 10:26, 26 October 2017

Low complexity regions (LCRs) in a protein sequence are subsequences of biased composition. There are three main sources of LCRs are cryptic, tandem and interspersed repeats [1].

Regions with a low complexity sequence are characterised as having an unusual composition and can be recognised by simple visual inspection. This unusual composition can create problems when you are searching for sequence similarity in BLAST. BLAST provides the option to add a filter to remove low complexity sequences. This filter is important as it will prevent the occurrence of artificial hits.

Reference

  1. Alb M.,et al.Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 2002;18:672-678.