Database
From The School of Biomedical Sciences Wiki
(Difference between revisions)
Line 1: | Line 1: | ||
− | In [[Bioinformatics|Bioinformatics]], databases are used for a number of functions, crucially including finding protein and gene sequence information. They are divided into two groups:<br> | + | In [[Bioinformatics|Bioinformatics]], databases are used for a number of functions, crucially including finding [[protein|protein]] and [[Gene|gene]] sequence information. They are divided into two groups:<br> |
== Primary Databases == | == Primary Databases == | ||
Line 6: | Line 6: | ||
*Databases of Proteins: [http://pir.georgetown.edu/ PIR], [http://mips.helmholtz-muenchen.de/proj/ppi/ MIPS], [http://www.ebi.ac.uk/uniprot SwissProt and TrEMBL] | *Databases of Proteins: [http://pir.georgetown.edu/ PIR], [http://mips.helmholtz-muenchen.de/proj/ppi/ MIPS], [http://www.ebi.ac.uk/uniprot SwissProt and TrEMBL] | ||
− | *Databases of DNA Sequences: [http://www.ebi.ac.uk/ena/ EMBL], [http://www.ncbi.nlm.nih.gov/genbank/ GenBank], [http://www.ddbj.nig.ac.jp/ DDBJ] | + | *Databases of DNA Sequences: [http://www.ebi.ac.uk/ena/ EMBL], [http://www.ncbi.nlm.nih.gov/genbank/ GenBank], [http://www.ddbj.nig.ac.jp/ DDBJ]<br> |
− | + | ||
− | <br> | + | |
== Secondary Databases == | == Secondary Databases == | ||
Contain selected sequences which are compiled by scientists analysing the available Primary Databases. Usually contain sequences chosen for their similarities: for example, [http://prosite.expasy.org/ ProSite] focuses on functional [[Motifs|motifs]]. | Contain selected sequences which are compiled by scientists analysing the available Primary Databases. Usually contain sequences chosen for their similarities: for example, [http://prosite.expasy.org/ ProSite] focuses on functional [[Motifs|motifs]]. |
Latest revision as of 02:24, 15 October 2013
In Bioinformatics, databases are used for a number of functions, crucially including finding protein and gene sequence information. They are divided into two groups:
Primary Databases
Aim to contain all protein or DNA sequence data - more is continually added as new discoveries are made. Include:
- Databases of Proteins: PIR, MIPS, SwissProt and TrEMBL
- Databases of DNA Sequences: EMBL, GenBank, DDBJ
Secondary Databases
Contain selected sequences which are compiled by scientists analysing the available Primary Databases. Usually contain sequences chosen for their similarities: for example, ProSite focuses on functional motifs.