Running a DNA BLAST
Home | Index | Assessment | Blog | Wiki
Clean up
You can now close all the other open windows (try not to close this one!).
Human DNA BLAST
In this section, we will run a DNA BLAST using the system at NCBI in the USA, and restrict the search to the human genome data.
Restricting a search to a given species, be it in a protein or DNA search, can be very useful as it can help reduce the 'noise' from similar or related sequences in other organisms.
Setting up a BLAST search
To run a DNA BLAST search, we need a sequence. For this search we will use:
TGGCTTGAAGTCCCCTCCTCCATCCCGGGGATCACGTTTTGCCTGGGGGAGCTGATCGCAAGACTAGGCAACCTCCAGCCAGTCCCTGGGTCGGGCGGATCCTCCCAGAGGTGGCACAATGGAGCGATCTCCAGGAGAGGGCCCCAGCCCCAGCCCCATGGACCAGCCCTCTGCTCCCTCCGACCCCACTGACCAGCCCCCCGCTGCTCACGCAAAGCCAGACCCAGGTTCTGGGGGCCAACCTGCTGGCCCTGGCGCGGCGGGTGAGGCCCTGGCGGTGCTGACTTCATTCGGGAGGCGGTTGCTGGTGCTGATACCTGTGTATTTGGCCGGGGCAGTGGGACTCAGCGTGGGTTTCGTGCTCTTCGGCCTCGCCCTCTACCTGGGCTGGCGCCGGGTCCGCGACGAGAAAGAACGGAGCCTTCGAGCAGCGAGGCAGCTACTGGACGACGAGGAGCAGCTCACTGCGAAAACTCTCTATATGAGTCATCGAGAGCTACCTGCCTGGGTCAGCTTCCCAGACGTGGAAAAGGCTGAATGGCTCAATAAGATTGTGGCCCAGGTCTGGCCCTTCCTGGGCCAGTATATGGAGAAGCTTCTGGCTGAAACTGTGGCTCCGGCTGTTAGGGGATCTAACCCCCATCTGCAAACATTTACATTTACACGAGTGGAACTGGGTGAAAAGCCATTGCGCATCATTGGAGTCAAGGTTCACCCAGGTCAGAGAAAAGAGCAGATCCTGCTGGACTTGAACATCAGCTATGTAGGTGATGTGCAGATTGATGTGGAAGTGAAGAAATATTTTTGCAAAGCAGGAGTCAAGGGCATGCAGCTACATGGCGTTTTGCGGGTGATACTGGAGCCACTCATTGGGGACCTTCCCTTCGTGGGGGCTGTGTCAATGTTCTTCATCCGACGCCCGACCCTAGACATCAACTGGACAGGGATGACCAACCTGCTGGATATCCCAGGACTTAGCTCACTCTCTGACACCATGATCATGGACTCCATTGCTGCCTTCCTCGTGTTGCCCAACCGATTACTGGTGCCCCTTGTGCCTGACCTTCAAGATGTGGCTCAGTTGCGTTCCCCTCTGCCCAGGGGCATTATTCGAATTCACCTGCTGGCTGCTCGAGGGCTGAGTTCCAAGGACAAATATGTGAAGGGCCTGATTGAGGGCAAGTCAGACCCATATGCACTTGTGCGTTTGGGTACCCAGACATTCTGCAGTCGTGTCATTGATGAAGAACTCAACCCACAGTGGGGAGAGACTTATGAGGTGATGGTACACGAGGTCCCAGGGCAGGAGATTGAAGTGGAGGTGTTCGACAAGGATCCAGATAAAGATGACTTTCTGGGCAGAATGAAGCTGGATGTAGGGAAGGTGTTACAGGCTAGCGTTCTGGATGATTGGTTCCCTCTACAAGGTGGGCAAGGCCAAGTTCACTTGAGGCTAGAATGGCTGTCACTTTTGTCAGATGCAGAGAAACTGGAGCAGGTTCTACAGTGGAATTGGGGAGTCTCCTCTCGACCAGATCCCCCGTCAGCTGCCATCTTAGTTGTCTACCTGGATCGGGCCCAGGATCTTCCTCTGAAGAAGGGGAACAAGGAACCCAACCCTATGGTACAACTGTCAATTCAGGATGTGACTCAGGAGAGCAAGGCTGTCTACAGTACCAACTGCCCAGTGTGGGAGGAAGCGTTCCGGTTCTTCCTACAAGACCCTCAAAGCCAGGAGCTCGATGTGCAAGTGAAGGATGATTCCAGGGCCCTGACTTTAGGAGCACTGACGCTGCCTCTGGCCCGCCTGCTGACTGCCCCAGAACTCATCCTGGACCAGTGGTTCCAGCTCAGCAGCTCTGGTCCAAACTCCAGACTCTATATGAAACTAGTCATGAGGATCCTGTACTTGGATTCATCAGAAATATGCTTCCCCACGGTGCCTGGTTGTCCTGGTGCTTGGGACGTGGACAGTGAGAATCCCCAGAGAGGCAGCAGTGTGGATGCCCCACCTCGACCCTGTCACACGACTCCTGATAGCCAGTTTGGGACTGAGCATGTGCTTCGGATCCATGTATTAGAGGCCCAGGACCTGATTGCCAAAGACCGTTTCTTGGGGGGACTGGTGAAGGGCAAGTCAGACCCCTATGTCAAACTAAAGTTGGCAGGACGAAGCTTCCGGAGCCATGTTGTTCGGGAAGATCTCAATCCCCGCTGGAATGAGGTTTTTGAGGTGATCGTCACATCAGTTCCAGGCCAAGAGCTAGAGGTTGAAGTCTTTGACAAGGACTTGGACAAGGATGATTTTCTGGGCAGGTGTAAAGTGCGTCTCACCACAGTCTTAAACAGTGGCTTCCTTGATGAGTGGCTGACCCTGGAGGATGTCCCATCTGGCCGCCTGCACTTGCGCCTGGAGCGTCTCACCCCCCGTCCCACTGCTGCTGAGTTAGAGGAGGTGCTGCAGGTGAATAGTTTGATCCAGACTCAGAAGAGTGCGGAGCTGGCTGCGGCCCTGCTATCCATCTATATGGAGCGGGCAGAGGACCTCCCGCTGCGAAAAGGCACCAAGCACCTCAGCCCTTATGCTACTCTCACTGTGGGAGATAGTTCTCATAAAACCAAGACTATTTCGCAAACTTCAGCCCCTGTCTGGGATGAGAGTGCCTCCTTTCTCATCAGGAAACCACACACTGAGAGCCTAGAGTTGCAGGTTCGGGGTGAGGGCACTGGCGTGCTGGGCTCATTATCCCTGCCCCTCTCAGAGCTCCTCGTGGCTGACCAGCTCTGCTTGGACCGCTGGTTTACACTCAGCAGTGGTCAGGGGCAGGTGCTACTGAGAGCACAGCTAGGGATCCTGGTGTCCCAGCACTCGGGAGTGGAAGCTCATAGCCACAGCTACAGCCACAGCTCCTCATCGCTGAGTGAAGAACCAGAGCTCTCGGGGGGACCCCCTCACATCACCTCCTCAGCCCCAGAGCTCCGGCAGCGCCTAACACATGTTGACAGTCCCCTTGAGGCTCCAGCCGGGCCTCTGGGCCAGGTGAAACTGACTCTGTGGTACTACAGTGAAGAACGAAAGCTGGTCAGCATTGTTCATGGTTGCCGGTCCCTTCGACAGAATGGACGTGATCCTCCTGATCCCTATGTGTCACTGTTGCTACTGCCAGACAAGAACCGAGGCACCAAGAGGAGGACCTCACAGAAGAAGAGGACCCTGAGTCCTGAATTTAATGAACGGTTTGAGTGGGAACTCCCCCTGGATGAGGCCCAGAGACGAAAGCTGGATGTCTCTGTCAAGTCTAATTCCTCCTTCATGTCAAGAGAGCGTGAGCTGCTGGGGAAGGTGCAGCTGGACCTAGCTGAGACAGACCTTTCCCAGGGTGTAGCCCGGTGGTATGACCTGATGGACAACAAGGACAAGGGCAGCTCCTAGGAGCTGGCGAGTCCCAGCCTGACTGCTCTGTCTTCCTGCCTTCGTCTCGCTCCATCACCGCCTCAATGTGATGAGCCTAAAGCTAGGGTCCAAGGGCAGAGCCTGTGCCCTTCAGCCCTTTCACCTAACAGGCCCATATTCGGGCCTTTGCCTGACCAAAGAGAAGAACCGTATGTTCCCTTTACTGCACGGCCTTTATCCTTCTGGGCCCCTGGGGCGGGGACCTGAGCTGGCTGTTTCCTGCTTTGCCTGCACATTGTTCTCCCTTCCTCCCAACTCCTCAGGGCCTTCTGTATCTGTGCCTGGCCAGTGGCAGCACTAGCAGTGGTATTAGCTTATGCCAAATACAGCTTTGGAAGGATCTTTTTTTCTTTAACTAGATGGTCACCTTCTTCCCTACCACACATGGGTGGGAAGGTGGACAGGCTAACCTCTCCAGCTGTGAGCCTCTTAGACTACTGCATGTAGCAAATGTTCAGCAGCTCAGGCCCCCATGTCCAGTTCTGTCCCCACTGTCCTCAACCCTGTCCTGAAAATTCTACTGCTTTGATGGCTGGGGCCAGTCTCTTGTCACTTTGGAAACTGAGGACGCGTGGATTCTACTCAAGCCTCCAAGTAGTGGCATATCAGTCTTGGAGCTCCTAGCTGGTGATACGGAGAGGGCTTTGGAGGACTTGGGACAGCAGGGCCAATTTTTTTGCCCAAGTGCCTAGGCTGCTAACTCACTGACTAGAACTTAATCTGGTACTTTACAGTTTTGCACCAACTCTGCCAAGCCACTGGATCTTACATTAAACATCATACTCAAACCAGCAAAAAAAA
(To copy the sequence highlight it by double click on it, and then use the copy command - BTW, the reason the sequence is a big long line of letters that goes off the edge of the screen is so you can double-click on it to select it.)
And asked to:
  1. identify the article (paper) in which it was first cloned
  2. establish if the DNA is involved in any diseases
NOTE: NCBI sometimes remembers changes you have made to the default parameters. Changes are highlighted in YELLOW. Therefore, before you run a search make sure the parameters are set to the values you want (N.B. The default expect value is 10, and the default database is 'non-redundant'). To return the page to 'default' click 'Reset page' near the top of the page.
  1. Go to the BLAST site http://www.ncbi.nlm.nih.gov/blast/
  2. On the BLAST page (which should have opened in a new window) in the 'Basic BLAST' section click the 'nucleotide blast' link.
  3. Enter (paste) the sequence into the search box.
  4. Change the database to 'Human genomic plus transcript (Human G+T)'
The completed nucleotide BLAST screen - make sure the database setting is correct
  1. Click on the BLAST button and wait....
BLAST search running - Sunday October 14, 2018 at 1:47:12 pm