Sequence Name: Burkholderia mallei GB8 horse 4, whole genome shotgun sequencing

GenBank Accession Number: AAHO00000000.1

GenInfo (GI) Number: 67105661

gi|67105661|ref|AAHO00000000.1| Burkholderia mallei GB8 horse 4 , whole genome shotgun .5804858, gc%: 68.47%

Total: 3 prophage regions have been identified, of which 0 regions are intact, 1 regions are incomplete, and 2 regions are questionable.

RegionRegion LengthCompletenessScore# Total ProteinsRegion PositionMost Common PhageGC %Details
Region 1: 8Kb, incomplete, score 50, 6 proteins, position 2917208-2925283, most common phage: PHAGE_Burkho_2, GC%: 62.95%
Region 2: 20.6Kb, questionable, score 80, 11 proteins, position 3126171-3146786, most common prophage: PROPHAGE_Xantho_33913, GC%: 65.99%
Region 3: 35.6Kb, questionable, score 80, 24 proteins, position 5015821-5051448, most common prophage: PROPHAGE_Xantho_33913, GC%: 67.35%
Intact (score > 90)
Questionable (score 70-90)
Incomplete (score < 70)
Region: The number assigned to the region.
Region Length: The length of the sequence of that region (in bp).
Completeness: A prediction of whether the region contains a intact or incomplete prophage based on the above criteria.
Score: The score of the region based on the above criteria.
# Total Proteins: The number of ORFs present in the region.
Region Position: The start and end positions of the region on the bacterial chromosome.
Most Common Phage: The phage(s) with the highest number of proteins most similar to those in the region.
GC %: The percentage of GC nucleotides of the region.

Criteria for scoring prophage regions (as intact, questionable, or incomplete):
Method 1:

  1. If the number of certain phage organism in this table is more than or equal to 100% of the total number of CDS of the region, the region is marked with total score 150. If less than 100%, method 2 and 3 will be used.
Method 2:
  1. If the number of certain phage organism in this table is more than 50% of the total number of CDS of the region, that phage organism is considered as the major potential phage for that region; the percentage of the total number of that phage organism in this table in the total number of proteins of the region is calculated and then multipled by 100; the percentage of the length of that phage organism in this table in the length of the region is calculated and then multipled by 50 (phage head's encapsulation capability is considered).
Method 3:
  1. If any of the specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin') are present, the score will be increased by 10 for each keyword found.
  2. If the size of the region is greater than 30 Kb, the score will be increased by 10.
  3. If there are at least 40 proteins in the region, the score will be increased by 10.
  4. If all of the phage-related proteins and hypothetical proteins constitute more than 70% of the total number of proteins in the region, the score will be increased by 10.
Compared the total score of method 2 with the total score of method 3, the bigger one is chosen as the total score of the region.
If the region's total score is less than 70, it is marked as incomplete; if between 70 to 90, it is marked as questionable; if greater than 90, it is marked as intact.

Hits against Virus and Prophage Database
Hits against Bacterial Database or GenBank File

Region 1, total 9 CDS

#CDS PositionBLAST HitE-ValueSequence
CDS 2: complement(2917586..2918419), PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_2807; phage(gi77747809), E-value: 4e-88
CDS 3: complement(2918443..2918706), PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_2808; phage(gi21230912), E-value: 6e-28
CDS 4: 2919496..2919810, PHAGE_Burkho_2: gp7, putative addiction module killer protein; BMAGB8_2809; phage(gi134288691), E-value: 3e-54
CDS 5: 2919813..2920169, PHAGE_Burkho_2: gp6, putative addiction module antidote protein; BMAGB8_2810; phage(gi134288715), E-value: 4e-60
CDS 6: complement(2920213..2921268), PHAGE_Burkho_2: gp5, phage portal protein, pbsx family; BMAGB8_2811; phage(gi134288732), E-value: 0.0
CDS 8: complement(2921984..2922505), PROPHAGE_Escher_Sakai: ClpXP protease specificity-enhancing factor; BMAGB8_2813; phage(gi15833355), E-value: 3e-24

Region 2, total 13 CDS

#CDS PositionBLAST HitE-ValueSequence
CDS 2: complement(3139998..3140291), PHAGE_Ralsto_phiRSA1: transposase IRSO15-like; BMAGB8_1140; phage(gi145708108), E-value: 2e-42
CDS 3: complement(3140299..3141132), PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_1141; phage(gi77747809), E-value: 3e-87
CDS 4: complement(3141156..3141419), PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_1142; phage(gi21230912), E-value: 6e-28
CDS 5: 3141515..3142126, PHAGE_Staphy_phiPV83: transposase; BMAGB8_1143; phage(gi9635722), E-value: 9e-12
CDS 6: complement(3142239..3143465), putative lipoprotein; BMAGB8_1144, E-value: 0.0
CDS 7: 3143622..3143753, conserved hypothetical protein; BMAGB8_1145, E-value: 0.0
CDS 8: 3143794..3143994, conserved hypothetical protein; BMAGB8_1146, E-value: 0.0
CDS 9: 3144060..3144323, PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_1147; phage(gi21230912), E-value: 6e-28
CDS 10: 3144347..3145180, PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_1148; phage(gi77747809), E-value: 4e-88
CDS 11: 3145202..3145525, conserved hypothetical protein; BMAGB8_1149, E-value: 0.0
CDS 12: 3145536..3146738, PHAGE_Equid__9: envelope glycoprotein J; BMAGB8_1150; phage(gi216905924), E-value: 1e-04

Region 3, total 29 CDS

#CDS PositionBLAST HitE-ValueSequence
CDS 3: 5023306..5026092, PHAGE_Parame_AR158: hypothetical protein AR158_C785L; BMAGB8_A1408; phage(gi157953975), E-value: 1e-42
CDS 4: 5026168..5026911, conserved hypothetical protein; BMAGB8_A1409, E-value: 0.0
CDS 5: complement(5027070..5027314), DNA-binding protein; BMAGB8_A1410, E-value: 0.0
CDS 6: 5027500..5028081, conserved hypothetical protein; BMAGB8_0274, E-value: 0.0
CDS 7: complement(5028249..5029991), PHAGE_Pectob_My1: YadA domain-containing protein; BMAGB8_0275; phage(gi410491156), E-value: 4e-10
CDS 8: 5030122..5030385, PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_0276; phage(gi21230912), E-value: 6e-28
CDS 9: 5030409..5031242, PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_0277; phage(gi77747809), E-value: 4e-88
CDS 10: complement(5031253..5031861), site-specific recombinase, phage integrase family; BMAGB8_0278, E-value: 0.0
CDS 12: complement(5032124..5032267), hypothetical protein; BMAGB8_0279, E-value: 0.0
CDS 13: complement(5032264..5033886), PHAGE_Burkho_phi1026b: gp59; BMAGB8_0280; phage(gi38707949), E-value: 2e-45
CDS 14: complement(5034145..5035662), PHAGE_Ectoca_1: EsV-1-65; BMAGB8_0281; phage(gi13242537), E-value: 5e-12
CDS 15: complement(5035686..5036507), PHAGE_Ectoca_1: EsV-1-65; BMAGB8_0282; phage(gi13242537), E-value: 5e-06
CDS 16: 5036633..5037703, PROPHAGE_Escher_MG1655: DNA strand exchange and recombination protein with protease and nuclease activity; BMAGB8_0283; phage(gi16130606), E-value: 2e-131
CDS 17: 5037703..5038686, PHAGE_Salmon_1: putative bacteriophage tail fiber protein; Lambda gpN homolog; BMAGB8_0284; phage(gi169257204), E-value: 6e-05
CDS 18: 5038978..5039589, conserved hypothetical protein; BMAGB8_0285, E-value: 0.0
CDS 19: 5039669..5040835, succinyl-CoA synthetase beta chain; BMAGB8_0286, E-value: 0.0
CDS 20: 5040932..5041861, succinate-CoA ligase, alpha subunit subfamily; BMAGB8_0287, E-value: 0.0
CDS 21: 5042012..5042725, integral membrane protein, TerC family; BMAGB8_0288, E-value: 0.0
CDS 22: 5042893..5043462, PHAGE_Deftia_14: hypothetical protein; BMAGB8_0289; phage(gi282599056), E-value: 9e-05
CDS 23: 5043700..5045487, O-antigen polymerase family protein; BMAGB8_0290, E-value: 0.0
CDS 24: 5046026..5046499, PHAGE_Human__2: EBNA-1; BMAGB8_0291; phage(gi139424506), E-value: 8e-06
CDS 25: 5046518..5046739, conserved hypothetical protein; BMAGB8_0292, E-value: 0.0
CDS 26: 5046736..5047101, TonB domain protein; BMAGB8_0293, E-value: 0.0
CDS 28: complement(5047461..5048294), PROPHAGE_Xantho_33913: IS1477 transposase; BMAGB8_0294; phage(gi77747809), E-value: 4e-88
Burkholderia mallei GB8 horse 4 , whole genome shotgun
Intact (score > 90)
Questionable (score 70-90)
Incomplete (score < 70)

Length: 5804858 bps
Phages: 3
Portal Protein
Coat Protein
Tail Shaft
Attachment Site
Phage-like Protein
Hypotheical Protein
Fiber Protein
Plate Protein

