Query Sequence 1
Query Sequence 1
Instruction:
In short, you will copy one of the sequences from the data set, use nucleotide (blastn) in BLAST to
identify it, and use the information from your search to answer the question below. Instructions for
copying and pasting sequences are provided with the dataset. Instructions for using BLAST is
provided in your tutorial.
Materials:
An internet connection
Make a bookmark in your web browser to the NCBI website: http://www.ncbi.nih.gov
A guide that shows how to do search and interpret the results is available at
ftp://ftp.ncbi.nlm.nih.gov/pub/factsheets/HowTo_BLASTGuide.pdf
An overview on the BLAST interface is available on
http://blast.ncbi.nlm.nih.gov/about/#home
A set of an "Query Sequences" (download in MyGuru) for you to identify.
Questions: Be sure to include the source of the information along with your answer. In this case, the
source will be the database or web page that provided the information.
1) How long is the sequence that was used to search the database?
Hint: This sequence is called "query" sequence because you used it to ask a question (or query)
of the database.
2661
2) What is the most likely identify of this sequence? What data supports this conclusion?
Hint: Refer to the slide in the BLAST tutorial that discuss the E-value.
Entamoeba histolytica HM-1:IMSS pyruvate phosphate
dikinase.The E-value for this sequeunce is 0.0 and the ident is
100%.
1 | B i o i n f o r m a ti c s T u t o r i a l S B L 1 0 3 3
Hint: Refer to the BLAST tutorial to find an overview of the GenBank nucleotide record. If more
than one organism matches, look at the E- value to determine the most likely match.
Entamoeba histolytica
7) If possible, give the names of three different organisms with significant E-values. If organism
is represented, then write down the name of that organism.
Hint: Refer to the BLAST tutorial slide on E-value for description.
- Entamoeba nuttali P19 pyruvate,phosphate dikinase = 0.0
- Entamoeba dispar SAW670 pyruvate, phosphate dikinase = 0.0
8) Use GenBank, PubMed, Gene, and UniGene records to find the possible function of the
protein that's specified by your DNA sequence. Describe what's known about the role of this
protein in the organism that provided the DNA.
- Carbon metabolism
- Metabolic pathways
2 | B i o i n f o r m a ti c s T u t o r i a l S B L 1 0 3 3
- Pyruvate metabolism
3 | B i o i n f o r m a ti c s T u t o r i a l S B L 1 0 3 3