gi|00000000|ref|CP010151.1|  Escherichia coli strain D8, complete genome. 5130646, gc%: 50.39%

CDS_POSITION                       BLAST_HIT                                                                            EVALUE              prophage_PRO_SEQ
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

#### region 1 ####
267386..267397                     attL                                                                                 N/A                 ATCCGGTAAAAC
complement(269585..270223)         PHAGE_Escher_RCS47_NC_042128: exonuclease A; RG41_01230; phage(gi100018)             1.28e-108           EscherichiacoliMSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNIPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS
complement(270220..271482)         PHAGE_Escher_RCS47_NC_042128: hypothetical protein; RG41_01235; phage(gi100019)      2.25e-169           EscherichiacoliMIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVESQSTGRCGVVSAHVFEQGVKAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGRPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLCQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFFQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS
complement(271479..272387)         PHAGE_Escher_RCS47_NC_042128: hypothetical protein; RG41_01240; phage(gi100020)      3.66e-146           EscherichiacoliMKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNATQVKQVLFGEKGVAQHLKPGTAVMVSSTIASADAQEIATALAGFGLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGSEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS
272583..273350                     PHAGE_Escher_RCS47_NC_042128: DNA helicase; RG41_01245; phage(gi100021)              6.97e-94            EscherichiacoliMIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGRCIYLDAGTTTLAIAQHLTHMEPLTVVTNDFVIADYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKLDLSLLVAKNE
complement(273401..274057)         PHAGE_Escher_RCS47_NC_042128: hypothetical protein; RG41_01250; phage(gi100100)      1.53e-62            EscherichiacoliMPSTRYQKINAHHYRHIWTVGDIHGDYQLLQSRLHQLSFCPETDLLISVGDNIDRGPESLNVLRLLNQPWFISVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDTIKYVIAHADYPGKEYQFGKEIAESELLWPVDRVQKSLNSELQKINGADFFIFGHMMFDNIQTFANQIYIDTGSPKSGRLSFYKIK
complement(274163..276730)         PHAGE_Bacter_B124_14_NC_016770: putative mismatch repair protein; RG41_01255; phage(gi374671664)     1.66e-12            EscherichiacoliMQESIDKDLSNHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMERQQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLENVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV
276627..276638                     attR                                                                                 N/A                 ATCCGGTAAAAC
276877..278334                     PHAGE_Pseudo_D3_NC_002484: integrase; RG41_01260; phage(gi9635596)                   2.09e-07            EscherichiacoliMKMTLTQSIIINKLSIDVKPDLDQQGRVAYLPNPERKPYIITDNHRDSPVGFGVKISATKKTYIIQRRVASADKRPLTGGKAPKQVIRSTIGNVSDFANIDQARDAARKFVETMKLTRRNPNAVKREAEASELTISEVFAQYRSHLMGRSKPAKPNTLNVLNKAENRLKEWENLRVKDLTGNEILRKFDEIASRARTAAEQTFRWVNVAVRHAIEIEAGNAQTQQRQPTLSYNPFSILTIQKKFRTRSQLEESYRAKGVRNPLSPKDTLGRFLTALHNKRSFNRLGCDYLLLTVLTGARKEETASLCWREALTEEEARTTSYVDLENRIIRFYDTKNRNDHELPICDATKRILEDRRDLVNETEKRADKRKWVFPARSSRSKVGHYSDSKSLREYICQEAGIGKLGMHDLRRTFGRVAEGLTSYSVVKRLLNHRNTTDPTERYAMPDNDRIYEALQSIELHMLMTAPELYNTLLASAKYQPLPTN

#### region 2 ####
complement(1904122..1904415)       PHAGE_Halomo_phiHAP_1_NC_010342: hypothetical protein; RG41_09200; phage(gi167832364)     1.96e-07            EscherichiacoliMDISPLLHALCAVAAQILVGLFTGNWAYGAIAGCTFFIAREHTQAEYRWIEMFGHGKRMNMPWWGGFDPRAWDVASLMDFAVPVVACLLVWLLVNRG
complement(1904428..1904706)       PHAGE_Entero_JenP2_NC_028997: hypothetical protein; RG41_09205; phage(gi971763992)     1.15e-32            EscherichiacoliMKDLTLKFHDKLQFKAFLSSLDWEEDEDLQNKLLVDEIGFTYTETGVTEEGEPVCVRNDGYFVNIRILDDLFDVSVFSDYVVELETPLREWS
1904638..1904651                   attL                                                                                 N/A                 CCAGTCAAGAGATG
complement(1904703..1906763)       PHAGE_Entero_JenP1_NC_029028: tail fiber protein; RG41_09210; phage(gi971767090)     0.0                 EscherichiacoliMAVQISGVLKDGAGKPIQNCTIQLKAKRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGAITVYEDSKPGTLNDFLGAATEDDVRPEALYRFEKMVEEVARNAEAASQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARASEEASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAADRAEVAAEVTAEPYANIVPPLPDVWIPFNDSLDMIAGFSPGYKKIAIGDDVVQVASDKQVNFSRASTATYINKSGELKTAEINEPRFECDGLLIEGQRTNYMLNSESPASWGKSSNMDVPETGTDSFGFTYGKFVCNDSLVGQTSAINMASIAATKSVDVSGDNKYVTTSCRFKTERQVRLRIRFDKYDGSATTFLGDAYIDTQTLEINMTGGAAGRITARVRKDKTTGWIFAEATIQAIDGELKIGSQIQYSPERGGATVSGDYIYLATPQVENGPCVSSFIISGGSATTRASDLVSIPTRNNLYKLPFTFLLEIHKNWDIAPNAAPRVWDIAAANTGQSAIAAINRGSGKLYMSLSNPSGLYVNSAATDVFAEKTTFGCIAKADGHFHVVTNGKAVNEVYCEYNGVTADKNIRFGGQTNTGERHLFGHIRNFRIWHKELNDRQLKEVV
complement(1906822..1910304)       PHAGE_Entero_DE3_NC_042057: recombination endonuclease subunit; RG41_09215; phage(gi100054)     0.0                 EscherichiacoliMGKGSSKGHTPREAKDNLKSSQMLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNIFGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVDNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQNCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPDNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNNQTRTLTLDREITLPSSGTTLISLADGQGNPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFMLRLTVAADDGSERLVSTARTTETTYRFRQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSQIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETTARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGQPSDDASGYLNFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSITGVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGVENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKYLTAPTITSGGNPPTFSLTPDGRLSAKNADISGNVNANSGTLNNVTINQNCRILGKLSANQIEGDIVKTVGKAFPRNGSYASGTITVTVYDDQAFDRQIVVPPVLFRGGKHENFNSNNQQSYWYSTCKLQVLKNGQEIFQQPATDVSRVFSSVIDMPAGHGHVTLTFNVSSYGANNWTPTTSISDLLVVVMKKSTAGISIS
complement(1910365..1910937)       PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09220; phage(gi100053)        1.46e-130           EscherichiacoliMKTGAEAIRALATQLPVFRQKLSDGWYQVRIAGRDVSTSGLTAQLHETLPDGAVIHIVPRVAGAKSGGVFQIVLGAAAIAGSFFTAGATLAAWGAAIGAGGMTGILFSLGASMVLGGVAQMLAPKARTPRTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEISTADEGDGGQVVVIGR
complement(1910934..1911677)       PHAGE_Entero_mEp460_NC_019716: tail fiber component; RG41_09225; phage(gi428782332)     3.28e-180           EscherichiacoliMTETESAILAHARRCAPAESCGFVVRAPEGERYFPCVNISGEPEAYFRMAPEDWLQAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGAIHKFRCVPHLTGRRFEHGVTDCYTLFRDAYHLAGIEMPDFHREDDWWRNGQNLYLDNLEATGLYQVPLSAAQPGDVLLCCFGSSVPNHAAIYCGDGELLHHIPEQLSKRERYTDKWQRRTHSLWRHRAWRASAFTGIYNDLAAASTFV
complement(1911683..1912381)       PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09230; phage(gi100051)        1.69e-169           EscherichiacoliMQDIRQETLNECTRAEQSASVVLWEIDLTEVGGERYFFCNEQNEKGEPVTWQGRQYQPYPIQGSSFELNGKGTSTRPTLTVSNLYGMVTGMVEDLQSLVGGTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASFVLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDITKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ
complement(1912381..1912710)       PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09235; phage(gi100050)        4.46e-69            EscherichiacoliMKTFRWKVKPGMDVTSAPSVREVRFGDGYSQRAPAGLNADLKTYSVTLSVSREEATALESFLAEHGGWKAFLWTPPYGYRQIKVTCAKWSSRVSMLRVEFSAEFEQVVN
complement(1912707..1915268)       PHAGE_Entero_DE3_NC_042057: RNA polymerase sigma factor; RG41_09240; phage(gi100049)     0.0                 EscherichiacoliMAEPVGDLVVDLSLDAARFDEQMARVRRHFSGTESDAKKTAAVVEQSLSRQALAAQKAGISVGQYKAAMRMLPAQFTDVATQLAGGQSPWLILLQQGGQVKDSFGGMIPMFRGLAGAITLPMVGATSLAVATGALAYAWYQGNSTLSDFNKTLVLSGNQSGLTADRMLVLSRAGQAAGLTFNQTSESLSALVKAGVSGEAQIASISQSVARFSSASGVEVDKVAEAFGKLTTDPTSGLTAMARQFHNVTAEQIAYVAQLQRSGDESGALQAANEAATKGFDDQTRRLKENMGTLETWADRTARAFKSMWDAVLDIGRPDTAQEMLIKAEAAFKKADDIWNLRKDDYFVNDEARARYWDDREKARLALEAARKKAEQQSQQDKNAQQQSDTEASRLKYTEEAQKAYERLQTPLEKYTARQEELNKALKDGKILQADYNTLMAAAKKDYEATLKKPKQSGVKVSAGDRQEDSAHAALLTLQAELRTLEKHAGANEKISQQRRDLWKAESQFAVLEEAAQRRQLSAQEKSLLAHKDETLEYKRQLAALGDKVTYQERLNALAQQADKFAQQQRAKRAAIDAKSRGLTDRQAEREATEQRLKEQYGDNPLALNNVMSEQKKTWAAEDLLRGNWMAGLKSGWSEWEESATDSMSQVKSAATQTFDGIAQNMAAMLTGSEQNWRSFTRSVLSMMTEILLKQAMVGIVGSIGSAIGGAVGGGASASGGTAIQAAAAKLHFATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYATGGYVGGTGSPAQMRRSEGIRFEQNNNVVIQNDGTNGLPGPQMMKAVYDMARKGARDEIQAQMRDGGLFSGGGR
complement(1915261..1915695)       PHAGE_Entero_DE3_NC_042057: rnaseH; RG41_09245; phage(gi100048)                      2.55e-97            EscherichiacoliMFDGELSFALKLAREMGRPDWRAMLAGMSSTEYADWHRFYSTHYFHDVLLDMHFSGLTYTVLSLFFSDPDMHPLDFSLLNRREDDEEPEDDVLMQKAAGLTGGVRFDPDGNEVIPASPDMAGMTEDDVMLMTVSEGIAGGVRYG
complement(1915677..1916099)       PHAGE_Entero_DE3_NC_042057: dsDNA binding protein; RG41_09250; phage(gi100047)       9.54e-93            EscherichiacoliMFLKTESFEHNGVTVTLSELSALQRIEHLALMKQQAEQAESDSNRQVTVEDAIRTGAFVVAMSLWHNHPQKTKQPSMNEAVKQIEQEVLTTWPAEAISHAENVVYRLSGMYGFVVNDAPDQAEDSGPAEPVSAGKCSTVS
complement(1916115..1916855)       PHAGE_Entero_DE3_NC_042057: late promoter transcription; RG41_09255; phage(gi100046)     3.70e-169           EscherichiacoliMPTPNPLAPVKGAGTTLWVYNGSGDPYANPLSDNDWSRLAKIKDLTPGELTAESYDDSYLDDEDADWTATGQGQKSAGDTSFTLAWMPGEQGQQALLAWFNEGDTRAYKIRFPNGTVDVFRGWVSSIGKAVTAKEVITRTVKVTNVGRPSMAEDRSTVTAATGMTVTPASSSVVKGQSTTLTVAFQPEGATDKSFRAVSADKTKATVSVSGMTITVNGVAAGKVNIPVVSGNGELAAVAEITVTDS
complement(1916863..1917258)       PHAGE_Entero_DE3_NC_042057: loader of DNA helicase; RG41_09260; phage(gi100045)      3.21e-88            EscherichiacoliMKHTEFRAAVLDALEKHDTGATLFDGRPAVFDEADFPAIAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDSWMESRIYPAMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM
complement(1917255..1917833)       PHAGE_Entero_DE3_NC_042057: ssDNA binding protein; RG41_09265; phage(gi100044)       5.87e-131           EscherichiacoliMAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASLVARETKVRRKLVKERARLKRATVKNPQARIKVNRGDLPVIRLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVPLTTAFKQNIERIRRERLPKELGYALQHQLRMVIKR
complement(1917845..1918198)       PHAGE_Entero_DE3_NC_042057: RNA polymerase binding protein; RG41_09270; phage(gi100043)     1.63e-48            EscherichiacoliMRDFQNAFDAALAGVDSTIVEVMGISAQFTSGAQRGGEVHGVFDDPESLGFASSGIRIEGSNPSLFVLTDTVCAVRRGDTLTINGEMFWVDRVSPDDGGSCYLWLNRGQPPAASRRR
complement(1918191..1918565)       PHAGE_Entero_HK225_NC_019717: head assembly protein Fi; RG41_09275; phage(gi428782384)     3.04e-05            EscherichiacoliMATKEQNLKRLDELALILGREPDISGSAAEIAQRVAEWEEEMQSSGDDVQVMNMDIRERETAAHDVREETSGALTRIRVLTCLHLCGVDGETGESVELADVGRVILIMSSDAKTHVDGGMAVYA
complement(1918617..1919645)       PHAGE_Entero_DE3_NC_042057: clamp-loader subunit; RG41_09280; phage(gi100041)        7.31e-150           EscherichiacoliMGLFTTRQLLGYTEQKVKFRALFLELFFRRTVNFHTEEVMLDKITGKTPVAAYVSPVVEGKVLRHRGGETRVLRPGYVKPKHEFNYQQAVERLPGEDPSQLNDPAYRRLRIITDNLKQEEHAIVQVEEMQAVNAVLYGKYTMEGDQFEKIEVDFGRSTKNNITQGSGKEWSKQDRDTFDPTHDIDLYCDLASGLVNIAIMDGTVWRLLNGFKLFREKLDTRRGSNSQLETAVKDLGAVVSFKGYYGDLAIVVAKTSYIAEDGIEKRYLPDGMLVLGNTAADGIRCYGAIQDAQALSEGVVASSRYPKHWLTVGDPAREFTMTQSAPLMVLPDPDEFVVVQVK
complement(1919703..1920050)       PHAGE_Entero_DE3_NC_042057: clamp-loader subunit; RG41_09285; phage(gi100040)        4.44e-29            EscherichiacoliMVTKTITEQRAEVRIFAGNDPAHTATGSSGISSPTPALTPLMLDEATGKLVVWDGQKAGSAVGILVLPLEGTETALTYYKSGTFATEAIHWPESVDEHKKANAFAGSALSHAALP
complement(1920087..1921592)       PHAGE_Entero_DE3_NC_042057: RegA; RG41_09290; phage(gi100039)                        2.55e-135           EscherichiacoliMRRNLSHIIAAAFNEPLLLEPAYARVFFCALGREMGAASLSVPQQQVQFDAPGMLAETDEYMAGGKRPARVYRVVNGIAVLPVTGTLVHRLGGMRPFSGMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCNDTACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGNQFEALPAEVRQDMQQRIDAARRMFAEKVAMYTGLSVDAVTGTEAAVFEGQSGIEAGLADELINASDAISVMATALNSNVRGGTMPQLTATEAAAQENQRVMGILTCQEAKGREQLATMLAGQQGMSVEQARAILAAAAPQQPVASTQSEADRIMACEEANGREQLAATLAAMPEMTVEKARPILAASPQADAGPSLRDQIMALDEAKGAEAQAEQLAACPGMTVESARAVLAAGSGKAEPVSASTTAMFEHFMANHSPAAVQGGVAQTSADGDADVKMLMAMP
complement(1921582..1923174)       PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09295; phage(gi100038)        0.0                 EscherichiacoliMKRTPVLIDVNGVPLRESLSYNGGGAGFGGQMAEWLPPAQSADAALLPALRLGNARADDLVRNNGIAANAVALHKDHIVGHMFLISYRPNWRWLGMRETAAKSFVDEVEAAWSEYAEGMFGEIDVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTETTQLFRTRFKAVSPKRVDTPGHGMGNRFLRAGVEVDRYGRAVAYHICEDDFPFSGSGRWERIPRELPTGRPAMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESELDTEKAFEYIAGAPQEQKDNPLINILEKFSSWYDTNNVTLGGVKIPHLFPGDDLKLQTAQDSDNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARLATQMFSCWLEEALLRGIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQESVMRIEAGLSTYEKELALMGEDYQDIFRQQVRESAERQKAGLSRPVWIEQAYQQQIAESRRPEEETTPRET
complement(1923171..1923377)       PHAGE_Entero_HK225_NC_019717: head-tail connector gpW; RG41_09300; phage(gi428782379)     1.67e-16            EscherichiacoliMVTVAELQALRQARLDLLTGKRVVSVQKDGRRIEYTAASLDELNRAINDAESVLGTTRCRRRPLGVRL
complement(1923361..1925289)       PHAGE_Entero_DE3_NC_042057: DNA polymerase; RG41_09305; phage(gi100036)              0.0                 EscherichiacoliMISDAQKAANAAGAIATGLLSLIIPVPLTTVQWANKHYYLPKESSYTPGRWETLPFQVGIMNCMGNDLIRTVNLIKSARVGYTKMLLGVEAYFIEHKSRNSLLFQPTDSAAEDFMKSHVEPTIRDVPALLELAPWFGRKHRDNTLTLKRFSSGVGFWCLGGAAAKNYREKSVDVVCYDELSSFEPDVEKEGSPTLLGDKRIEGSVWPKSIRGSTPKIKGSCQIEKAANESAHFMRFYVPCPHCGEEQYLKFGDDASPFGLKWEKNKPESVFYLCEHHGCVIHQSELDQSNGRWICENTGMWTRDGLMFFSARGDEIPPPRSITFHIWTAYSPFTTWVQIVYDWLDALKDPNGLKTFVNTTLGETWEEAVGEKLDHQVLMDKVVRYTAAVPARVVYLTAGIDSQRNRFEMYVWGWAPGEEAFLVDKIIIMGRPDEEETLLRVDAAINKKYRHADGTEMTISRVCWDIGGIDGEIVYQRSKKHGVFRVLPVKGASVYGKPVITMPKTRNQRGVYLCEVGTDTAKEILYARMKADPTPVDEATSYAIRFPDDPEIFSQTEAQQLVAEELVEKWEKGKMRLLWDNKKRRNEALDCLVYAYAALRVSVQRWQLDLAVLAKSREEETTRPTLKELAAKLSGGVNGYSR
complement(1925261..1925809)       PHAGE_Entero_DE3_NC_042057: RecA-like recombination protein; RG41_09310; phage(gi100035)     7.70e-64            EscherichiacoliMKVNKKRLAEIFNVDPRTIERWQSQGLPCASKGSKGIESVFDTAMAIQWYAQRETDIENEKLRKELDDLRAAAESDLQPGTIDYERYRLTKAQADAQELKNAREDGVVLETELFTFILQRVAQEISGILVRVPLTLQRKYPDISPSHLDVVKTEIAKASNVAAKAGENVGGWIDDFRRAEGS
complement(1925891..1926061)       DNA-packaging protein; RG41_09315                                                    N/A                 EscherichiacoliMLAVSDNYSRFRVLSVDPTGYGAATSRFFTIYENFSGKIVSVLLEYNFLFFLILHP
1926077..1926397                   hypothetical protein; RG41_09320                                                     N/A                 EscherichiacoliMVLAIMCRHKCVVYRHFTQRHHQMKIRNILAISLATSSFSCLAFKSSPNVLPGPTNQLTAVESKIIGHFYAPHSTLPGTTITGTCDASPVPGCTCPFCTMLRSQNR
1926285..1926638                   hypothetical protein; RG41_09325                                                     N/A                 EscherichiacoliMPHTVHYPEQPSQGHVTPPPSRDAPVRFVLCCVAKTDNIRIYLVFHDEFTQRLIEEGKMVSKSKAHCRRMLQALQQTRAGIFDQLENCQHTLPEYIAISSETSATLIHRVPPEKKKK
complement(1926761..1927087)       membrane protein; RG41_09330                                                         N/A                 EscherichiacoliMKRRLLLLFLLSVLAVGCSQQKADEPRQLVTVYPRYPEYAAANYIKGLVEVKFDIGADGTVTRIVFLRSEPHNLFRDEVVKAMAKWRFEKNRPCQGVKRQFIFTPSRP
1927287..1927400                   arginyl-tRNA synthetase; RG41_09335                                                  N/A                 EscherichiacoliMCIVFNYSRTLTQKEFPVGLRSWLMREYGDDTAQLKG
1927568..1927861                   PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09340; phage(gi100032)        3.87e-55            EscherichiacoliMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ
complement(1927893..1928354)       PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09345; phage(gi100031)        1.42e-105           EscherichiacoliMNRVTAIISALVICIIVCLSWAVNHYRDNAITYKAQRDKNARELTLANAAITDIQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR
complement(1928351..1928848)       PHAGE_Entero_SfI_NC_027339: lysozyme; RG41_09350; phage(gi849250312)                 1.02e-116           EscherichiacoliMPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLATVARQINPYIKVDIPETMRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREICLWGQQ
complement(1928848..1929063)       PHAGE_Entero_VT2phi_272_NC_028656: holin; RG41_09355; phage(gi966197762)             5.28e-37            EscherichiacoliMKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE
1929652..1930734                   porin; RG41_09360                                                                    N/A                 EscherichiacoliMKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYARLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKNDRTDVTEANGDGFGFSTTYEYEGFGVGATYAKSDRTDGQVAYGKSKFNASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFGNNHIANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKDLGVHGDRDLVKYVDVGATYYFNKNMSTFVDYKINLIDDSKFTKTAGIDTDDIVAVGLVYQF
complement(1930923..1931306)       PHAGE_Shigel_Stx_NC_029120: antitermination protein Q; RG41_09365; phage(gi985761278)     2.27e-71            EscherichiacoliMRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVCGMTFMSLASKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLDMDIVANNSN
complement(1931392..1931532)       PHAGE_Entero_mEp237_NC_019704: hypothetical protein; RG41_09370; phage(gi435439319)     9.77e-13            EscherichiacoliMMFEFNMAELLRHRWGRLRLYRFLGSVLTDYRILKNYAKTLTGTGV
complement(1931529..1931891)       PHAGE_Salmon_118970_sal4_NC_030919: membrane-associated initiation of head vertex; RG41_09375; phage(gi100034)     5.08e-38            EscherichiacoliMNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE
complement(1931911..1932105)       protein ninF; RG41_09380                                                             N/A                 EscherichiacoliMLSPSQSLQYLKESIERASMCTEWILSRFSAYRRLPVKGMPSKSMLHMQKNARWKVWREHRLCG
complement(1932098..1932439)       PHAGE_Salmon_118970_sal4_NC_030919: hypothetical protein; RG41_09385; phage(gi100030)     5.53e-78            EscherichiacoliMDYSQLSDFEINRMVGDIIFKGLWASKPETSGNNTNKWYYGNADTTFEPLNHLPDYCNDPSASWPIIEKYRISILDQLTEWCVDAKGVSPIFDTRPLRAAMIVFLLMQEANNA
complement(1932442..1932618)       PHAGE_Entero_HK106_NC_019768: NinE protein; RG41_09390; phage(gi428783333)           3.53e-34            EscherichiacoliMRRQRRSITDIICENCKYLPTKRSRNKRKLIPKESDVKTFNYTAHLWDIRWLRYRARK
complement(1932615..1933142)       PHAGE_Entero_Sf101_NC_027398: DNA methylase; RG41_09395; phage(gi849122310)          1.33e-127           EscherichiacoliMTIKSNTPAHDKDCWQTPLWLFDALDIEFGFWLDSAASDKNALCAHWLTEDDDALNSEWVSHGAIWNNPPYSNIRPWVEKAAEQCIQQRQTVVMLVPEDMSVGWFSKALESVDEVRIITDGRINFIEPSTGLEKKGNSKGSMLLIWRPFISPRRMFTTVSKAALMAIGQGVRRAA
complement(1933139..1933579)       PHAGE_Entero_Sf101_NC_027398: NinB; RG41_09400; phage(gi849122309)                   2.41e-104           EscherichiacoliMKKLTFEIRSPAHQQNAIHAVQQILPDPTKPIVVTIQERNRSLDQNRKLWACLGDVSRQVEWHGRWLDAESWKCVFTAALKQQDVVPNLAGNGFVVIGQSTSRMRVSEFAELLELIQAFGTERGVKWSDEARLALEWKARWGDRAA
complement(1933653..1933943)       PHAGE_Escher_PA28_NC_041935: replication and recombination DNA helicase; RG41_09405; phage(gi100033)     2.23e-59            EscherichiacoliMTGKEAIIHYLGTHNSFCAPDVAALTGATVTSINQAAAKMARAGLLVIEGKVWRTVYYRFATKEEREGKMSTNLIFKECRQSATMKRILAVYGVKR
complement(1933940..1934641)       PHAGE_Escher_PA28_NC_041935: hypothetical protein; RG41_09410; phage(gi100032)       4.04e-168           EscherichiacoliMKNIAAQMVNFDREQMRRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELTCMTARINRGETIPEPVKQLPVMGGRPLNRVQALAKIAEIKAKLGLKGASV
complement(1934638..1935537)       PHAGE_Entero_DE3_NC_042057: DCTP pyrophosphatase; RG41_09415; phage(gi100024)        0.0                 EscherichiacoliMTNTAKILNFCRGNFAKQERNVADLDDGYARLSNMLLEAYSGADLTKRQFKVLLAILRKTYGWNKPMDRITDSQLSEITKLPVKRCNEAKLELVRMNIIKQQGGMFGPNKNISEWCIPQNEGKSPKTRDKTSLKLGDCYPSKQGDTKDTITKEKRKDYSSENSGESSDQPENDLSVVKPDAAIQSGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTACKPKLDLTNTDWIYGVDL
complement(1935570..1935866)       PHAGE_Stx2_vB_EcoP_24B_NC_027984: CII; RG41_09420; phage(gi937456259)                1.86e-62            EscherichiacoliMERTSYSKLSQRDVDRAETDLLINLSAITQRGLAKMIGCHESKISRTDWRFIASVLCAFGMASDISPISRAFKYALDGLTNKKRPAATERSEQIQMEF
complement(1936008..1936223)       PHAGE_Entero_VT2_Sakai_NC_000902: similar to C2 of bacteriophage L; RG41_09425; phage(gi9633420)     1.00e-47            EscherichiacoliMSNLRKYRESLNISQTTLAKAVGCTQGAIGHWESGRRFPDLKTCRALVACLNKLGAKVSLDDVFPPEHKAA
1936341..1936994                   PHAGE_Stx2_II_NC_004914: CI protein; RG41_09430; phage(gi32171122)                   4.16e-162           EscherichiacoliMKWYELARSRMKELGITQEKLAEELGMTQGGIGHWLRGSRHPSLSDIGVVFKYLGIDNISFNHDGTFSPVGEYSSAPVKKQYEYPVFSHVQAGMFSPELRTFTKGDAERLVSTTKKASDSAFWLEVEGNSMTAPTGSKPSFPDGMLILVDPEQAVEPGDFCIARLGGDEFTFKKLIRDSGQVFLQPLNPQYPMIPCNESCSVVGKVIASQWPEETFG
1937034..1937591                   hypothetical protein; RG41_09435                                                     N/A                 EscherichiacoliMAKIDDYQPSQVEVDKVLYCKKIVNFSGVKWKQKPSRSDMWLQAHIIPLDEDCIPIQGLKFELKWKPDQDSEPDDPISYPKINIIAFYHNKRVFAVDTYHFDKHTNSYKVDHPKYQDIIYGAHYHVYYEEAGYYSDRIAFPIEDDINPDDLVGYWNYFCKHLNITYSGRIPLPLEDESGQMGFGI
1937588..1938340                   PHAGE_Burkho_phi6442_NC_009235: gp47, conserved hypothetical protein; RG41_09440; phage(gi134288670)     2.15e-06            EscherichiacoliMMCSTVISQLGFECHPIGKTLRIISPFTYCDDGEHVGAFIREVNGRYLVSDRCDALMNMEARGISLTKKRLDEIRQLLLKEGAELNARGEIIAWATEKDVGAITSNIIRAGILASTLSLDWYQPVQAEKFESMVIDYLYHTELRDALSLRENVYGLSGHQITVPVTIKTDIPKYVFTSSVKHGGSWNSAYSLLGKLIDLKASSEEYNNRFVVIDSEAIGDQMQQLSLLFHESSQVLPFSKRETWVKRLAA
1938617..1938799                   PHAGE_Entero_YYZ_2008_NC_011356: hypothetical protein; RG41_09450; phage(gi209427747)     2.83e-38            EscherichiacoliMEQTGRLFKQRRLSTTWLKSQITQPHKLWDAMPKQPSQEELRDCIAKVYSGGIHVQKNRI
1938777..1939049                   PHAGE_Entero_VT2_Sakai_NC_000902: antitermination protein; RG41_09455; phage(gi9633417)     3.27e-58            EscherichiacoliMSRKTEFKGTAASRRRARRANLQSQEAISSDKLHRPTPSRVVLQCKLKPAMRAEVITLTTLTRKYEGSTCLPNVALYAAGYRKSKQLTAR
complement(1939066..1939647)       PHAGE_Salmon_118970_sal4_NC_030919: hypothetical protein; RG41_09460; phage(gi100016)     1.06e-104           EscherichiacoliMNNSWWQELMHFFLQGMTLKQLIHMLIILIILIIVMPVSVKEWINLHSPEILPHYWMYYILLFCVSYVLNGVVNSAYHAVTERIEVFAAQKRKSKEEKYVQDLFDSLTLGERAYLAFAVAANNQLQTEKGAHESISLLKKGLLVRRPPAVGYPDTDRFVIPESYRHECYIRFAGKADSLMDELIAQDKHGKNK
1939861..1940061                   PHAGE_Entero_DE3_NC_042057: exonuclease A; RG41_09465; phage(gi100018)               3.73e-43            EscherichiacoliMTTTIDTNQWCGQFKRCNGCKLQSECMVKPEEMFPVMEDGKYVDKWAIRTTAMIARELGKQNNKAA
1940244..1940612                   PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09470; phage(gi100017)        1.27e-85            EscherichiacoliMSNIKKYIIDYDWKASIEIEIDHDVMTEEKLHQINNFWSDSEYRLNKHGSVLNAVLIMLAQHALLIAISSDLNAYGVVCEFDWNDGNGQEGWPPMDGSEGIRITDIDTSGIFDSDDMTIKAA
1940685..1940849                   PHAGE_Escher_PA28_NC_041935: hypothetical protein; RG41_09475; phage(gi100023)       6.94e-34            EscherichiacoliMQYAIAGWPVAGCPSESLLERITRKLRDGWKRLIDILNQPGVPKNGSNNYGYPD
1940818..1940961                   PHAGE_Entero_SfI_NC_027339: host-killing protein; RG41_09480; phage(gi849250261)     2.17e-27            EscherichiacoliMDQTIMAIQTKFTIATFIGDEKMFREAVDAYKKWILILKLRSSKSIH
1941037..1941333                   PHAGE_Escher_PA28_NC_041935: DNA helicase; RG41_09485; phage(gi100021)               1.08e-65            EscherichiacoliMNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYIVETIAHHQADIDSEV
1941339..1942124                   PHAGE_Entero_933W_NC_000924: Bet protein; RG41_09490; phage(gi9632480)               0.0                 EscherichiacoliMSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA
1942121..1942801                   PHAGE_Escher_PA28_NC_041935: hypothetical protein; RG41_09495; phage(gi100019)       6.51e-169           EscherichiacoliMTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEISFVFGEQWR
1942798..1942980                   PHAGE_Entero_DE3_NC_042057: hypothetical protein; RG41_09500; phage(gi100011)        4.01e-37            EscherichiacoliMTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEKINNKREAVCTKHLLLS
1942953..1943144                   PHAGE_Escher_PA28_NC_041935: exonuclease A; RG41_09505; phage(gi100018)              2.77e-38            EscherichiacoliMHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLATSLSQKLEMMVAKAEADERDQV
1943141..1943224                   hypothetical protein; RG41_09510                                                     N/A                 EscherichiacoliMTTTECIFLAAGFIFCVLMLADMGLVQ
1943221..1943436                   PHAGE_Escher_PA28_NC_041935: hypothetical protein; RG41_09515; phage(gi100017)       1.65e-47            EscherichiacoliMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER
1943535..1943756                   PHAGE_Escher_PA28_NC_041935: hypothetical protein; RG41_09520; phage(gi100016)       1.32e-44            EscherichiacoliMADIIDSASEIEELQRNTAIKMRRLNYQTISATHCCECGDPIDERRRLVVQGCRTCASCQEDLELISKQRGSK
1943753..1944517                   PHAGE_Salmon_SJ46_NC_031129: hypothetical protein; RG41_09525; phage(gi100094)       7.14e-93            EscherichiacoliMSEINYQALREAAEKATCGEWSLEYGESRFDGDYALIHREVAGYIPICRIEGAHPESGFDEDFQMEQQANAEFIAAANPATVLALLDERERNQQYIKRRDQENEDIALTVGKLRVELEETKSKLNEQREYYEGVISDGGKRIAELEKSEEQLINERDHAESALADMYFAATGDEPEWSNWFGFSDAVDAVVDRIADLEAKQPSPVVPEGLVKAVRFYEQVKRENPPVETEAWKDAIDWVLKESCQAVNIDTNGD
1944519..1944773                   PHAGE_Entero_cdtI_NC_009514: Valyl-tRNA synthetase; RG41_09530; phage(gi148609417)     5.48e-45            EscherichiacoliMSTFTDKELIKEIKERISSLDVRDNVERRAYEIALASLEENPVAWLHSDNGLGIPAITRSKNIADSWLSKGWYVQPLYMPSQCQ
1944791..1945324                   PHAGE_Shigel_Ss_VASD_NC_028685: EaA protein; RG41_09535; phage(gi966200433)          5.90e-68            EscherichiacoliMPNPLSMYAVDAVAAIAEVRGWNACRAAMLHAGNFRENANSSTNNFREISETSTNSPVIPGEVLSAILKFARVRADFDDFDGDRRGIGDCLDEAEQELIVTINKYASQLAAEPIATNDVREQQTAVPPVPVIQADVAQAIENLRQKLVECNRYNYCADAVKGVEDACHAAMLQGKRE
1945326..1945535                   PHAGE_Escher_PA28_NC_041935: hypothetical protein; RG41_09540; phage(gi100013)       1.92e-40            EscherichiacoliMAIAASYTMHLYCDCRQCTEGVYPVPDFGEYIGTSWSGCAKEARKDGWRISKDKTRTFAPGHKVLRINT
1945532..1946272                   PHAGE_Entero_UAB_Phi20_NC_031019: hypothetical protein; RG41_09545; phage(gi100055)     3.58e-66            EscherichiacoliMTTITKERIELFIKNPLENGLTRGEQMELARIALASLEREQIRHEHAKWSDSTFGCVGPIGPLKHLSKEALEAAAEPDDLSEWADMQFLLWDAQRRAGISDAEITVAMEDKLKINMERQWPEPKDGEPRLHIKEPGNSPVIPDGLSTVCAEAYQVVGVMADALGVFGDAAVQKVLDNLSQQKLVHRDVLPFSLPVTPDGWVMVPKQVTPEISNAINVVGQRCTCGNCSQRLWDLLLDATQQGVNRG
1946265..1946549                   PHAGE_Entero_ST104_NC_005841: ORF6; RG41_09550; phage(gi46358654)                    6.93e-59            EscherichiacoliMANLQLAVKGEYFDAMIRGEKTEEYRLCNDYWKKRLVNRKHDRLIITKGYPKRDDSSRRVDVPYDGYEVKTITHPHFGDKPVKVYAIKVNIGTE
1946575..1946814                   PHAGE_Entero_SfI_NC_027339: hypothetical protein; RG41_09555; phage(gi849250253)     1.85e-50            EscherichiacoliMFRVIYPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK
1946868..1946881                   attR                                                                                 N/A                 CCAGTCAAGAGATG
1946954..1947190                   excisionase; RG41_09560                                                              N/A                 EscherichiacoliMSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC
1947180..1948322                   PHAGE_Entero_HK022_NC_002166: integrase; RG41_09565; phage(gi9634144)                3.92e-75            EscherichiacoliMAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFDSVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEPGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWIIVDIKTA
complement(1948436..1949686)       isocitrate dehydrogenase; RG41_09570                                                 N/A                 EscherichiacoliMESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM
1949858..1950511                   23S rRNA pseudouridylate synthase; RG41_09575                                        N/A                 EscherichiacoliMRQFIISENTMQKTSFRNHKVKRFSSQRSTRRKPENQPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPVQGVYAAGRLDRDSEGLLVLTNNGALQARLTQPGKRTGKIYYVQVEGIPTQDALEALRNGVTLNDGPTLPAGAEMVEEPEWLWPRNPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGNYSLDNLANGEWRDATD
1950521..1950982                   PHAGE_Strept_YDN12_NC_028974: hydrolase; RG41_09580; phage(gi971762176)              1.07e-06            EscherichiacoliMFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAARELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSDIDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTKGVI

#### region 3 ####
complement(3042740..3043666)       PHAGE_Plankt_PaV_LD_NC_016564: ABC transporter; RG41_14905; phage(gi371496158)       3.72e-21            EscherichiacoliMTIALELQQLKKTYPGGVQALRGIDLQVEAGDFYALLGPNGAGKSTTIGIISSLVNKTSGRVSVFGYDLEKDVVNAKRQLGLVPQEFNFNPFETVQQIVVNQAGYYGVERKEAYIRSEKYLKQLDLWGKRNERARMLSGGMKRRLMIARALMHEPKLLILDEPTAGVDIELRRSMWGFLKDLNDKGTTIILTTHYLEEAEMLCRNIGIIQHGELVENTSMKALLAKLKSETFILDLAPKSPLPKLDGYQYRLVDTATLEVEVLREQGINSVFTQLSEQGIQVLSMRNKANRLEELFVSLVNEKQGDRT
3043775..3044437                   carbonic anhydrase; RG41_14910                                                       N/A                 EscherichiacoliMKDIDTLISNNALWSKMLVEEDPGFFEKLAQAQKPRFLWIGCSDSRVPAERLTGLEPGELFVHRNVANLVIHTDLNCLSVVQYAVDVLEVEHIIICGHYGCGGVQAAVENPELGLINNWLLHIRDIWFKHSSLLGEMPQERRLDTLCELNVMEQVYNLGHSTIMQSAWKRGQKVTIHGWAYGIHDGLLRDLDVTATNRETLEQRYRHGISNLKLKHINHK
complement(3044478..3045014)       hypoxanthine phosphoribosyltransferase; RG41_14915                                   N/A                 EscherichiacoliMKHIVEVMIPEAEIKARIAELGRQITERYKDSGSDMVLVGLLRGSFMFMADLCREVQVSHEVDFMTASSYGSGMSTTRDVKILKDLDEDIRGKDVLIVEDIIDSGNTLSKVREILSLREPKSLAICTLLDKPSRREVNVPVEFIGFSIPDEFVVGYGIDYAQRYRHLPYIGKVILLDE
3045080..3045097                   attL                                                                                 N/A                 GTGATACACCGAGTGATA
complement(3045586..3045870)       hypothetical protein; RG41_14920                                                     N/A                 EscherichiacoliMTETDLLKIIRCVTGVCQATGKPEAKQPDTVTAENYARVVAKVMRRDVIELNGVDMRNIRTRVLELLAYRRRSQQRRESAKNTYQWRKPEHLRR
complement(3045876..3047537)       PHAGE_Brucel_BiPBO1_NC_031264: hypothetical protein; RG41_14925; phage(gi100002)     3.25e-104           EscherichiacoliMTAWHEYAEGVKNGKITACKRLKQAVKRYFSDLENSLYTFDPEVVERFIAFSRVCPHVKGAMRGSPIELEPWQQFAFACILGFKVKATGRRKYTSAFIEVPRKNAKSTVAAILANWFLVMENGQQDIYTAAVSRDQARIVFDDARQMCLLSRPLRKRVNIQAHKVIHPKTNSLLKPLAAKAATIEGTNPSLAIVDEYHLHPDNGVYSALELGMGARPEGLLFAITTSGSNVVSACKQHYDYCCQILDGEEMNESMFVLIYELDDESEVDDPAMWIKANPNIDVSVDREKLASTIQKARGIPSQWVEMLTKRFNIWCQGATPWMGNGAWAECAGTFAEADLYGQECYAGLDLSSTSDISSVCYAFPVGKKIMLVSRHYLPEFQLQNPANKNRAIYRQWVKAGWIRTTPGDCIDYDRIRDDIMADAENFNIRLVGFDTWNATHLRTQLQGAGFEVEPFPQTYLRFSPAAKSFEVFVNRKVIVHRGDPVLAWSMSNVVMQSDANANIKPNKKKSSNKIDPSVAALMAFGTFQAEHEEFAFDMSDSQKERLAAFDGV
complement(3047521..3047877)       PHAGE_Psychr_Psymv2_NC_023734: terminase small subunit; RG41_14930; phage(gi593779760)     3.30e-16            EscherichiacoliMARPPKAPAYLDEIAVRQWKEKSRQLSGREDLTPADWSNLELYCVNYSIYRKAVEDLATRGFSIVNSQGSESRNPALSAKADAERIMIKMASLLGFDPVSRRRNPPETEEEDELDRLA
complement(3047997..3048173)       hypothetical protein; RG41_14935                                                     N/A                 EscherichiacoliMTEQEQTRLIRGLIRQRDTWKTQETEHKANRTGRTKLTTAKRLTDRDREVMECFRNRW
complement(3048166..3048606)       PHAGE_Burkho_phiE125_NC_003309: putative class I holin; RG41_14940; phage(gi17975232)     9.06e-16            EscherichiacoliMPWQPLRRCTEPGCNKRVKSGKCEEHRRAAWRAEDARRGHRRARGYSRQWDKYRALYLSKNPLCVRCLAKGIYTPALVVDHIIPINGGGDVLFWPEWNHQALCQTCHNRKTTREDPATKANRKAGMYHEQEERAAHRNDWMYGDDD
complement(3048606..3048908)       PHAGE_Shigel_SfII_NC_021857: head-tail connector protein; RG41_14945; phage(gi526244642)     9.69e-12            EscherichiacoliMSEARITPDEVRAHLRLDDDLSGEGELLKMYTDAALEACQKHIGKRFEDGLEFTPAMRVGCLMYIAFLYENREAVSPVEHSELPMAISALWSVYRDVGVY
complement(3048901..3050115)       PHAGE_Shigel_SfIV_NC_022749: portal protein; RG41_14950; phage(gi557307529)          4.83e-57            EscherichiacoliMWWPFSRKKSEQRNLSIDDFLALSGVPNTGSGEYVSAGTAESLPAVMNAVSVIAEAVATMPCYLYLVRNDKGREAREWLDSHPVDILLNEQPNSCQTPYQFKRTMMRHCLLNGNAYAVIEWGQDGQPKSLHPYAPGCVVPERTGAHKYRYTITEPYTGTVRTYLQEEVLHLRYASDDGFLGRSPVTICREALGLGLAQQRHGASIMKDGMMAAGIITSGEWLDGVKGKQALDALERYKGAKNAGKTPILEGGMDYRQLGMSNQDAEWLASRRFSIEDIARMFNVSPIFLQEYSNSTYSNFSEASRAFLTMTMRPWLANFEQQIKAALLVASPVPGTRYLVEFDSADLLRATPTERYATYEKGIKNGIMNPNEAREREGMPPREGGDEFSQAWKQTVEIKGRKDE
complement(3050117..3050677)       PHAGE_Rhizob_16_3_NC_011103: p004; RG41_14955; phage(gi195546534)                    3.82e-38            EscherichiacoliMKNTDFEIRTSELTASNKKLVGYAVRWNSLSEIIWDEFREQFTPGAFADYLAAGNDVRCLYEHDYTQLLGRTKSGTLVLTEDNTGLRFELTPPDTQLGKDVLTLVERGDITGMSFGFRALCEEWSIAQKPYLRTVTAAELREITITSMPAYPESGVEIAHRSLFAQHPELCPTGNNRHRWSELAGL
complement(3050732..3051901)       PHAGE_Entero_SfI_NC_027339: phage major capsid protein; RG41_14960; phage(gi849250290)     1.34e-64            EscherichiacoliMKKLIELRQQKNALKNQMRSLLEKADSENRSLNAEEGKQFDELRAKADALDTEISRLESVADEERSKPGTGIQKLSSDELRNYIVTGDVRSLSTSTDSGRDGGYTVIPELDREVMRQLQDDSVMRVIATVKTAKSNEFQKLVSTGGATVGRGTEGSARSETNTPKIERVTIKLNPIYAYPKTTQEILDFSEVDILGWLSSEIADTFASTEEDDFVNGNGNGKPKGFMAYTRAATSDKTRAFGTIEKMVAASGTAITADELIDILYKLKAKYRKNAVWVMNSGTAGTLQKLKNENGDYIWRDSLKEGAPDMLLGRPVYCLESMPDIGAGKAPLAVGDFSRGYFIVDHVTGIRTRPDNITEPGFYKVHTDKYLGGGVVDSNAIKILEMKAG
complement(3052176..3052424)       hypothetical protein; RG41_14965                                                     N/A                 EscherichiacoliMPVTFEEVQQHKKFHGIDDLETTTAKKYRRLLSSDALFVVDHHDFLRSSLTGEIFATNREQVEAMIEYLWKIRRRMRDSMKQ
complement(3052442..3052864)       PHAGE_Staphy_tp310_1_NC_009761: ss-DNA-binding protein; RG41_14970; phage(gi156603906)     5.45e-12            EscherichiacoliMTAQIAAYGRLVADPQLKTTSKGTQMAMASMAVPLPCSQADDGTATMWLSVLAFGRQAEALAKHRKGELLSVAGNMQVSQWTGQNGETRQGWQVIADSVISARTARPGGKKGQQGQATDALNRAKQQAGNDDPYGDNIPF
complement(3052861..3053106)       hypothetical protein; RG41_14975                                                     N/A                 EscherichiacoliMTAKHTKKSQSHALDLTEHWLRVSIKIIDRNAGEGYAKAHPELISSFMTTAAANFATLTEREIAEAEQVTTINVKTGEQTA
complement(3053394..3055211)       PHAGE_Entero_P4_NC_001609: DNA primase; RG41_14980; phage(gi9627512)                 2.51e-157           EscherichiacoliMKKAPNLKHQPRDKMTEVIIFAGSDAWAHANQWQEQDGRLAGDNVPPVWLGEQQLAELDNLQIVPDGRYRVRLYQAGLLRPGLVNTIGQKLAAAGVRDADYYPEGMHSQKRENWREYLERERAELAEKKKVVELPVKKKERVKDDNASSLALNQMGASQRGEVLLAHYGGELAIHADSDTVHHYNGVVWEPVQDKELQRAMAQIFIDAEISYSQNAIKSAVDTMKLSLPVMGNTARNLIGFSNGVFDTRTGNFREHNKNDWLLIASELPFSPPAEGETLATHAPNFWKWLRRSVAENDRKADRVLAALFMVLANRYDWQLFIEVTGPGGSGKSVMAEICTMLAGKANTVSASMKALEDARERALVVGFSLIIMPDMTRYAGDGAGIKAITGGDKVAIDPKHKAPYSTRIPAVVLAVNNNAMSFSDRSGGISRRRVIFNFSEVVPENERDPMLAEKIEGELAVVIRHLLTRFADQDEARRLLYEQQKSEEALAIKREGDSLVDFCGYLMASVMCDGLLVGNAEIVPFSPRRYLYHAYLAYMRAHGFGKPVTLTRFGKDMPGAMAEYGREYMKRKTKHGLRSNVTLTEESEDWMPSCVSVTNDDSKN
complement(3055208..3055507)       hypothetical protein; RG41_14985                                                     N/A                 EscherichiacoliMRTYLSGLTASGYAHPKIIPGAIYLDKNGNRVTVKELMFDRVYFIRDGYSFHSSLNVEIFISRFRREIPLSRNNHVSCMDVDKKLQELKNMIAAWREQK
complement(3055514..3055834)       hypothetical protein; RG41_14990                                                     N/A                 EscherichiacoliMPDMSNYHYLINPHFNCEHDIAKKVYSAADGATDNISMGIASIGSLMWHASENEDYDEKDMRIDMGNIGLLLAMLGQFDISLRCTIENATDALNAIKKANTDSNRG
complement(3055827..3056030)       icd-like protein; RG41_14995                                                         N/A                 EscherichiacoliMLNHTTHPQGRDSHNLNKYIWRFIALSTAQPRVITIEATSEQGARQQSPAGCVMVFAARIRQGVCHA
3056176..3056358                   hypothetical protein; RG41_15000                                                     N/A                 EscherichiacoliMQEENVKPKFDRSGSTKKNIRFEDTLLQQINEVAGPGNFSSWVKNACRDKLRKEGIEPKG
complement(3056479..3057162)       PHAGE_Geobac_E3_NC_029073: toxin-antitoxin-like protein; RG41_15005; phage(gi985758520)     1.49e-24            EscherichiacoliMKNINALNGQGFAHPVASKSDISVQLKQLPAVEFHGQRVVTFAMIDTAHQRPKDTAKKTFQRNRTRFIDGEDFFIVGASEVEWGGAGYLDRGIKRPGKKMNSPIRGRVTLITESGYLLLTKPFNDDLAWQVQRQLVKAYFRCPEFVTFHHVDLPSLEELIAMPVTDAQNAVTRADKHSKQFHGSQGSNGMNLRKKELKVLRPAVRLVDAMGQIDFDKYEWEAAHDRA
complement(3057183..3057467)       PHAGE_Entero_P4_NC_001609: transcriptional regulator; RG41_15010; phage(gi9627517)     1.19e-06            EscherichiacoliMQIITFTPPNPEQRRTILEEYGFKFDRRIREDECSEITSLSRSSRWKMEQQGRFPPRCHFGRNSCAWLLSDVLWWVRNPPAVENVNNPYSRKSA
complement(3057563..3058366)       hypothetical protein; RG41_15015                                                     N/A                 EscherichiacoliMRISHKKDLPTWFDLDNYTQFIGMSDKDLFYQLVVRYDMLENFYRYGGSFYNEPLTNNDEVGRILGNGAVANAIDRATYFYDAFKLPIKREGVLSGGVALRPMSIEDFIYMKDAVDNYMENNFTGTAENLVYHSFTSVSSFLEPNDLFIKVDLSRPDDLLIDVFKKNLKNWREELKIKEKEKTYQSWETIKKKVYDYSLFPVVDLMIWAKSRGACVTNGVLAVAVFPNGEYDSTQIAQTVKPNIEKIFSISSIEKIRSELQDRKILP
complement(3058363..3059586)       PHAGE_Entero_HK140_NC_019710: integrase; RG41_15020; phage(gi428781968)              9.86e-58            EscherichiacoliMAGTNKLSDKKLKALLGSEREAPIMIADGEGLSVKVSKQGNVSWVFSYRLGGRGSKLERLTLGRYPDMPLKLAREKREQCRQWLAGGLDPKTEIELSTEETLKPVTVKDALEYWLVNYARRKRSDEALVRAQLRKHIYPRLGRYPITRCETRHWVACFDEINQTKPMTAGRMFQISKQALRFCKVRRYAASDALAFLTIQDVGQPSGQRDRVLSDSELADVWRCTDSDDQQPYYSRLLKMLVLFGARTMEVRLSRWSEWDFTSWIWTVPKEHSKTREKIVRSIPEAIRPWLEELKRETGKTGLLLGEDRTRQAVSLKGRRLFKDFHHNEPWTLHDLRRTFSTGLNNMGIAPHIVELLLGHALPGVMAIYNRSLYLPEKLDALNKWYDRLELLAGNHSNIVILKAGEK
3059610..3059627                   attR                                                                                 N/A                 GTGATACACCGAGTGATA

#### region 4 ####
4298205..4298219                   attL                                                                                 N/A                 TCAACAAAAAGAGTT
complement(4303077..4305182)       PHAGE_Shigel_Sf6_NC_005344: gene 13 protein; RG41_21110; phage(gi41057291)           3.08e-88            EscherichiacoliMAKAWKDVIASPQYQALAPEQKVQAQEQYFNEVVAPQAGESVEQAKQAFYAAYPLPSTNEIDRSQSATQNIQHTSSDNSLASGYAKLATQQREGLERSAEQGASLGTAMRDAITGESRMTPEMERLQNVASAPELNSLSMDALKAGWSQLFGSDASQEKILQGMGATLRQDEKGNTIVSLPSGDYALNKPGLSPQDLTSFLANALAFTPAGRAGTVLGAIGKSAATDLALQGATSKAGGEDIDPLQTVISAGIGGIGKGLENTASAVSRAVRGDMSPEAKAAVDFASERNLPLMTSDMLKDKTFMQSQAQTLGERVPFFGTGKNRLNQQQARENLVRTFSDGLGGISDKQLYESATKGQQKFIEAAGKRYNRIIDAMGDTPVDLSNTVKTIDNQIAVLSRPGKSQDRAAVKVLQQFKDDITSGPNDLRLARENRTDLRKRFIASSDTVDKDTLQKASDIIYKAYTADMKKAVAKNLGADEAINMARVDRSWSKFNDMMGRTRVQKAIASGKATPEDVTKLVFSQSPSERSQLYRLLDDNGRQNARAAIVQNAVDKATDPSGNISVEKFINALHRNRKQSATFFKGVHGKELDGVIKYLNDTRHAAKANVQNLNGQQLYGLLVGGGIINAAVLAGMLKTAAFVVPAAGAVGGAAKAYESPVIRNALLRLANTPKGSTAYDRAISTVTQSLTRVAQASQKEAQ
complement(4305182..4306603)       PHAGE_Bacter_APSE_2_NC_011551: injection gp20; RG41_21115; phage(gi212499730)        9.61e-155           EscherichiacoliMATWQLGGLPSMVPQNDNAPRSSVPQPVQFQQQPNVGLMALQGLRGVAEINQQARQQQRKAEFQKAYAGAFESGDRNKMRSLISEYPEEFESVQKGMGFIDDDQRNSIGHLATSAQIASSLGTGAFGKFIADNEDEMRRLGISPESVAEMHVNDPQEFQRLAGSMALFSLGHEKYFDIKDRMEGRDIERGKLAETIRSNQAGEALQARGQDISRANALTSAYAPTAAMQNYNQYAQMLKADPEGAAAFAAAAGINTNAKKLMSVRENDDGTVTKYYTDGSEEQGKLNQPISGDGFRPIALPTAQKIMEKSPEGAKKAAGFAYRVRDALDSMDTLKGQLSPQRVAIINNALGNGTLANLTLSPTEQQYVVNANDAIMAILRQETGAAIVPAEMSKYYQMYFPQPGDSTKTIDTKRRKMENQFNSLKAASGRAYDALRVISAVDRGTSSSSQTLPQSEQVSQPAASSNFSSLWGD
complement(4306603..4307280)       PHAGE_Bacter_APSE_2_NC_011551: injection gp7; RG41_21120; phage(gi212499729)         9.95e-65            EscherichiacoliMLITHIAHKHLNRAVYEKGGDGGASKAQAKAQQQAIDLQREQWNTVMNNLKPYAEVGLPALQQLQGLMTLEGQNKAANDFFGSGLYKTQADQARYQNLVSAEATGGLGSTATSNQLSSIAPMLYNNWLSGQMQNYGNLLNVGMNAASGQATAGQNYANNTGQLLQGLGAIRAGQAQQPSSLARGIGGAASGALAGAQLGSVVPGIGNVAGAIGGGLIGLVGGLGF
complement(4307274..4307735)       PHAGE_Bacter_APSE_2_NC_011551: hypothetical protein; RG41_21125; phage(gi212499728)     4.93e-31            EscherichiacoliMELKFIDNPVRLQAFLNEPGNTENIVEPGHTYYIKPDAVYLGIYEGLVLAGVHEVRNFWHSIVECHAVYDPGFRGEYALQGHRLFCKWLLENSPFLNSITMVPDTTKYGRAIIRLLGATRVGHLDDAYIRCGKPVGVTLYQLTRQQYKEFSEC
complement(4307752..4307913)       prophage protein; RG41_21130                                                         N/A                 EscherichiacoliMAISVKPVLISEKQMEAIKKIQEEQRKKSGIGVAPTLHEIARGLIDKALAGCM
complement(4308498..4311254)       PHAGE_Entero_P4_NC_001609: DNA primase; RG41_21135; phage(gi9627512)                 2.68e-36            EscherichiacoliMRNIDLIRQVISASENNWPHVLGCLNINVPDSPRRHAPCPACGGKDRFRFDDNGRGSFICNQCGAGDGLDLIKRVNNCDTTEAALLAADVLGIDYRTTETPEATSQKREQLETERQRREQERLKRAEKDEQQRRDTFSRQFDDMRRKAVNGKSDYLVAKGVGDFTFPVLPDGSLLLALVDKSGAVTAAQTITSHGEKRLLIGSAKRGAYHAINAPETTQSILIAEGLATALSAHLMRPEALTVAAIDAGNLLYVAQVLRDKFPSAQIIIAADNDHSEGRQNTGRIAAEKAALSVSGWVALPPTDHKADWNDYHQKHGIKCATEAFNKSMYQPQGNGVKQEPQTIEGSDFKVMDTDPLKPRIESREDGIYWVSPRADSQSGEIINNESWLCSPLSVIGTGRDDKDQYLILRWLSFGSETPTTAAIPLADIGEREGWRTLKAGGVNVTTKNSLRAILADWLQRSGSRELWRVAHATGWQCGAYIMPDGEIIGTPENPVLFSGRSSAAAGYTVSGSAKSWRDNVARLAFGNYSMMTGIGAALAAPLIGLVGADGFGIHFYEQSSAGKTTTANVASSLYGNPDLLRLTWYGTALGLANEAAAHNDGLMPLDEVGQGADPVSISQSAYALFNGVGKLQGAKDGGNRDLKRWRTVAISTGEMDLETFIATSGRKTKAGQLVRLLNIPLSKAVRFHDYQNGKQHADALKDAYQHHHGAAGREWIKWLADHQQQAIKTVRDCESRWRSLIPSDYGEQVHRVAARFAILEAALLLGEVVTGWDAQTCRDAIQHSYNAWLREFGTGNKEHQQIIEQTEAFLNAYGLSRFAPFPYSPADLPIKDLAGYRQRGEHDESPMIFYTFPATFEKEIACGFNAKQFAEVLKKAGMLTPPNSGRGYQRKSPRIQGRQINVYVLNYQPGDYNSSEE
complement(4311241..4311612)       hypothetical protein; RG41_21140                                                     N/A                 EscherichiacoliMRENAEMALSSAIGEQVAKIAGAVWIHNLHSTSEEKMAIQTPEGRTITTSLKPSDVCDLICAFMYPAMRTVHGDKWKLATTAEFDMWLNNDGMLTDYGITKWQMLVSHIANAIDHVGYGDAKH
complement(4311605..4311946)       prophage protein; RG41_21145                                                         N/A                 EscherichiacoliMITKNFRLNALANKYASALYNHITSTSGGDYFMVDADGEAVRVEIVNGVKGVRSLIDSYTLAAMKVFYPQWETVVIELLDRCVTKDGLTDVGREIWQSMVNDMGATVAGGSHA
complement(4311957..4312559)       PHAGE_Entero_phiP27_NC_003356: hypothetical protein; RG41_21150; phage(gi18249870)     3.56e-27            EscherichiacoliMNNFLTFHAEATPDGVNIMHRSNDGMTERVETVSYIDAVNRLDAGDYDDKPDEGMSIHLAIADGGNQGYFDYTSQHNVIMWRWLIATVFMLEMREENGTVSIIDDTGNPSEVAVYSNGIVAMPLYPVAERLAMANNIEGAMIERFGIESGTERAIIFYRAMMDVEQGALTPFGRETLAELHYSFIAELNENGMPAEPVTH
complement(4312552..4312773)       prophage protein; RG41_21155                                                         N/A                 EscherichiacoliMTNIQLIEAQCRIEQVQTVLGFWLEGASPSNRDKLMIGAVMSLLNGVPEAIQEADELLGKYELQNHSGEAKHE
complement(4312770..4313033)       nicotinic acetylcholine receptor subunit beta; RG41_21160                            N/A                 EscherichiacoliMNINLIYRHPCELEIESLLSREELYPDTFTLADRTTERLTRARTGLVHVMNEILPSVGGEQATVINSWLQKVTSLIDISLIDAESAK
complement(4313030..4313224)       prophage protein; RG41_21165                                                         N/A                 EscherichiacoliMNNSINAPRLTSALQLIEQAAAVLVAVSLSAEEMDAADVVDAIKACSSLVNDARAELVILGGEK
complement(4313217..4314284)       PHAGE_Entero_SfV_NC_003444: immunity region; RG41_21170; phage(gi19549024)           2.86e-15            EscherichiacoliMLTVQKKIFSLAGMSPKSSNITAKSGINTADTSKVYHFLVVGADALTMSEITVDSVSVEKVGGCAREFLVVDGFLCSRSDSTKTFVHTRDVNEMSAMYCASGASNSEFSESIKKSLPLCGNTVYGYKAPHKTGAGIGVLVNRMATYDAPSVFFYVVGLTHPFFGRWCIIQRLCQSMVAQAGASSEAPVSIRAGYANPVWATTSEIGVSGGSVTCYRMEAATCWLLPLPKNRNLSGLSPQFAAIARQLPPKFIILLPSLNAMLAVLWCAITSAFLLVVSAWRWHMIKTYDVHMDPLERTSQIITLTEVINDILVSNSPSRDERLKALLAILDLAVRDVHFLLEGGEMPGKTGATNE
complement(4314278..4314460)       prophage protein; RG41_21175                                                         N/A                 EscherichiacoliMFKPTGTPQPQKRYKDAHGALVTVESVSHNRVTFYRDGYQSPCVQPLARFMKEFAEVNKC
complement(4314453..4315286)       PHAGE_Escher_TL_2011c_NC_019442: putative antirepressor; RG41_21180; phage(gi418487055)     2.42e-28            EscherichiacoliMNIEKSRLISEAAPHLNASLGTINGNEFAAIVPVIPGHIGGRETNIVSAKALHKALGVGKDFSTWITDRISEYDFTIGHDYSVHKTISPNLGKSPNGAAYSKIKQSGRPGKDYLLSVGMAKELAMIERNDQGRAIRRYFIQCEEELQRSVPEIAARYRRQLKARISAANNFKPMCDALNMARAEQGKTTQQHHYTNESNMISRIVLGGLTAKQWARINGYSGEPRDHMNAEQLEHLSYLESTNITLIDMGMEYEQRKGELTRLSQRWLAKRLEAVHV
complement(4315299..4315730)       prophage protein; RG41_21185                                                         N/A                 EscherichiacoliMEKKNRPLQAANSDTRVSDVTPLTKSLQAPKRTPKKHRARVYMLRTGIEGWTENDILRYCRLSSGRNYATELERQLGITLERIDEKNPDGIGTHLRYRFSCRGDVLKVITHINHLANINDHNGLSQQEIADILKLYPDAFNAA
complement(4315730..4315933)       regulatory protein; RG41_21190                                                       N/A                 EscherichiacoliMNTDTIPTKGYIRRFRLAELLGVSVSTIDRKVRNGSLPRPVKLGEKITAFDAVEIHNWLAERRGKVA
complement(4316046..4316891)       hypothetical protein; RG41_21195                                                     N/A                 EscherichiacoliMKKLPPNLFFRLNRAANLLEIEVDDLLLMGVSGSIFLSTIIHQYEGVLYQAGNLPPDHGSKDIENLLFGSIISTREDTERYLNEGVFDGMKCLASGLWHVPSDFIASLAGVKRAPDIPLIFSPAYIKKELDITSGRYFFYDFREHDTTVSSVEDLYISRPWVEYIAESMDKNQPIPSILHLGSHDLYEKEFKNIEHGNRARHSLNRENSIMALIYVKRHYPEECKGKNGKETNEAWANATIDHWPHVGGDYDEPSTDYLKKIISDMDRLPEKRTTAGKKKS
complement(4316987..4318246)       PHAGE_Burkho_BcepC6B_NC_005887: putative integrase protein; RG41_21200; phage(gi48697215)     1.14e-51            EscherichiacoliMPKLTDMQIRAWIKAGDRFDGKADGNGLYICYPKNYTVPFWRFRYKLAGKQRAMVIGSYSELSLSKARETAKELSARVALGYDVAAEKQERKAEALAKMEAEKNAMRVSDLAAEYFERQILPRWKHPDILRRRIDKDINPCIGHMKVEDVKPRHIDDMLKGIVDRGAPTIATDVLRWTRRIFDYGIKRHALEINPCSAFEVSDAGGKEVSRDRWLTRDELIRLFEAMRTAKGFSRQNELTFKLLLALCVRKMELCAARWEEFDLDGAVWHLPEERSKNGDPIDIPLPSPAVEWLRELHTFSCNSAWVLPARKMQNRMIPHIQESTLPVALAKVRAEIPDVPNFTIHDFRRTARTHLAALGVDPVVAERCLNHRIKGVEGIYNRHQYFDERKAALAQWADLLVALESGKDHNVTPLRRAN
complement(4318435..4319298)       hypothetical protein; RG41_21205                                                     N/A                 EscherichiacoliMIRSMTAYARREIKGEWGSATWEMRSVNQRYLETYFRLPEQFRSLEPVVRERIRSRLTRGKVECTLRYEPDVSAQGELILNEKLAKQLVTAANWVKMQSDEGEINPVDILRWPGVMAAQEQDLDAIAAEILAALDGTLDDFIIARETEGQALKVLIEQRLEGVTAEVVKVRAHMPEILQWQRERLVAKLEDAQVQLENNRLEQELVLLAQRIDVAEELDRLEAHVKETYNILKKKEAVGRRLDFMMQEFNRESNTLASKSINAEVTNSAIELKVLIEQMREQIQNIE
4319425..4320141                   ribonuclease PH; RG41_21210                                                          N/A                 EscherichiacoliMRPAGRSNNQVRPVTLTRNYTKHAEGSVLVEFGDTKVLCTASIEEGVPRFLKGQGQGWITAEYGMLPRSTHTRNAREAAKGKQGGRTMEIQRLIARALRAAVDLKALGEFTITLDCDVLQADGGTRTASITGACVALADALQKLVENGKLKTNPMKGMVAAVSVGIVNGEAVCDLEYVEDSAAETDMNVVMTEDGRIIEVQGTAEGEPFTHEELLTLLALARGGIESIVATQKAALAN
4320207..4320848                   PHAGE_Prochl_P_SSM2_NC_006883: orotate phosphoribosyltransferase; RG41_21215; phage(gi61805915)     5.86e-05            EscherichiacoliMKPYQRQFIEFALSKQVLKFGEFTLKSGRKSPYFFNAGLFNTGRDLALLGRFYAEALVDSGIEFDLLFGPAYKGIPIATTTAVALAEHHDLDLPYCFNRKEAKDHGEGGNLVGSALQGRIMLVDDVITAGTAIRESMEIIQANGATLAGVLISLDRQERGRGEISAIQEVERDYNCKVISIITLKDLIAYLEEKPEMAEHLAAVKAYREEFGV
4327829..4327843                   attR                                                                                 N/A                 TCAACAAAAAGAGTT

#### region 5 ####
4722873..4722952                   attL                                                                                 N/A                 ATAAAAAAGGCGCTTCCCCATGCCGAGTAGCGCCTTTTTAATCAAGCATTTAGCTAACCTGAATTAGTTCATGCCGTATT
4723128..4724351                   PHAGE_Entero_mEpX1_NC_019709: integrase; RG41_23230; phage(gi428781899)              1.15e-60            EscherichiacoliMAGGTNKLSDTSLRKMLGREIDRDRFYADGDGLSIKVSKIGVLTWYFSFRIGGRESYSQRVKLGNYPDLSLKAAREKREQCRAWLAEGKNPKHRLNVAIHETLKPVTVKEALDYWIREYAVYKRTNVDKHIEQLRKHIFPYIGDYPLSMCETRHWLECFARVRNDAPVAAGYLLQMCKQALKFCRVHRYAVSNALDDLTIDDVGRKQNKRDREHSSQELADIWQECSGKKFKPYYSSLLRLLVVFGCRTQELRLSRITEWDLKDWIWTVPKEHSKGGEKILRPIPVVIRPFIQQLIEQHKNSGLLLGEIKKPEAVSQWGRMIYKRLGHIESWTLHDLRRTFATTLNNMGVAPHVVEQLLGHTLGGVMAIYNRSQYLPEKLDALNKWMERLEVISGQYSNVKILKVAK
4724348..4725085                   hypothetical protein; RG41_23235                                                     N/A                 EscherichiacoliMKKISIPERIYYPLPEAAKKLGCTLNDIYHFAAIGALNISVYFPNCHPCKSLVIPKSLVENIDVDSGGMLQGDRWVITGMKPVGSDFEWVNYFGVASYLAKNFCGFFYLNRESFSELEFIGNDARIVSSFFSTRPESENWECQIMSDGAIEIDRRFLCVMAKDLEHIKDVGTMPGRKMGESPKTLAKKAELIPALIKLIPEMDDIDIGTAPVAKVIAVLEAAAAMKGVFLPATDKNTWAKYLGRT
4725175..4725396                   PHAGE_Entero_P4_NC_001609: transcriptional regulator; RG41_23240; phage(gi9627517)     8.40e-09            EscherichiacoliMSKVKSSQDQFDRIIREEECRRLTGICRTTRYMMERKGVFPARRQLGARSVGWLLSEVNDWLDRQPKAQLKSA
complement(4725746..4726051)       hypothetical protein; RG41_23250                                                     N/A                 EscherichiacoliMATGSKNAKSQSLTARIPHDVIEGMESVKLDGESNAGFIVTAMRGEIVRRQAEGSGENLLVSSLDALAQVEKIGVKAAEEIGQLVTVAREELQRRKVKGQE
4726248..4726451                   icd-like protein; RG41_23255                                                         N/A                 EscherichiacoliMRNHTIHPQGRDSYNLNKYIWRFIALSTAQPRVIHIVATSEQEARQQSPAGCVMVFAARIRQGGGHA
4726444..4726764                   hypothetical protein; RG41_23260                                                     N/A                 EscherichiacoliMPDMSNYQYLINPHFNCEHDIAKKVYSAADGATDNISMAVASIGSLMWHASENEEYDAKAMRIDMGNIGLLLAMLGHFDISLRCTIENATDALNAIKKANTDSNRG
4726771..4727070                   hypothetical protein; RG41_23265                                                     N/A                 EscherichiacoliMRTYLSGLTASGYAHPKIIPGAIYLDKNGNRVTVKELMFDRVYFIRDGYSFHSSLNVEIFICRFRREIPLSRNNHVSCMDVDKKLQELKNMIAAWREQK
4727067..4728884                   PHAGE_Entero_P4_NC_001609: DNA primase; RG41_23270; phage(gi9627512)                 3.78e-157           EscherichiacoliMKKAPNLKHQPRDKMTEVIIFAGSDAWAHAKQWQEQDGRLAGDNVPPVWLGEQQLAELDNLQIVPDGRYRVRLYQAGLLRPGFVNTIGQKLAAAGVRDADYYPEGMHSQKRENWREYLERERGELAEKKKVVELPVKKKERVKDDNASSLALNQMGASQRGEVLLAHYGGELAIHADSDTVHHYNGVVWEPVQDKELQRAMAQIFIDAEISYSQNAIKSAVDTMKLSLPVMGNTARNLIGFSNGVFDTRTGNFREHNKNDWLLIASELPFSPPAEGETLATHAPNFWKWLRRSVAENDRKADRVLAALFMVLANRYDWQLFIEVTGPGGSGKSVMAEICTMLAGKANTVSASMKALEDARERALVVGFSLIIMPDMTRYAGDGAGIKAITGGDKVAIDPKHKAPYSTRIPAVVLAVNNNAMSFSDRSGGISRRRVIFNFSEVVPENERDPMLAEKIEGELAVVIRHLLTRFANQDEARRLLYEQQKSEEALAIKREGDSLVDFCGYLMASVMCDGLLVGNAEIVPFSPRRYLYHAYLAYMRAHGFGKPVTLTRFGKDMPGAMAEYGREYMKRKTKHGLRSNVTLTEESEDWMPSCVSVTNDDSKN
4729269..4729529                   hypothetical protein; RG41_23275                                                     N/A                 EscherichiacoliMPILTHEINNYFDQFGLHDIASMPTTQYRQALANGGLLWIDHHDFIRSTLSDEILATNQEQVNVLIEHLQVLRKKMPLPPNWMSEK
complement(4729564..4730073)       PHAGE_Cronob_ENT39118_NC_019934: hypothetical protein; RG41_23280; phage(gi431811067)     1.07e-06            EscherichiacoliMDLNVMAYPEKFTIAGVEYKGKRNSAEKKVLIPYTEEPDINIGDEITQKLGKGEIILKVIDMSFLPGGTLNVGTNHPHMLTLKVENLTANEHKPKQSSQNTFNIGNISGTQVQLGEHNNLIVNLSITELVEKVSQSQDPQAKNLLKDLLNNSTVASLVGAGASALLTML
complement(4730103..4730729)       hypothetical protein; RG41_23285                                                     N/A                 EscherichiacoliMKVTAYDGTEICYQGLAFKGTPVQIFWNGFIDPYIENNSEIVLEQTSALAIECQFSIEESIEEAKLLLLVMIRRLYYEMAETDKILRGDGFSFPEKKDVSGYIESISQKIIEHAEIEKLKKTHPKQNIFNIATVNSQYVQLGTNNNINIQELSEFFSKIASTGEKEIITLSKNLLKKAYSKNLLSKENYEALISNLNTQTKYTFKSTS
4731127..4732296                   PHAGE_Entero_SfI_NC_027339: phage major capsid protein; RG41_23290; phage(gi849250290)     9.78e-66            EscherichiacoliMKKLIELRQQKNALKNQMRSLLEKADSENRSLNDDEGKQFDELRAKADSLDTDISRLESVADEERSKPGTGIQKVSSDELRNYIVTGDVRSLSTSTDSGRDGGYTVIPELDREVMRQLQDDSVMRVIATVKTAKSNEFQKLVSTGGATVGRGTEGSARSETNTPKIERVTIKLNPIYAYPKTTQEILDFSEVDILGWLSSEIADTFASTEEDDFVNGDGNGKPKGFMAYTRAATSDKTRAFGTIEKMVAASGTAITADELIDILYKLKAKYRKNAVWVMNSGTAGTLQKLKNENGDYIWRDSLKEGAPDMLLGRPVYCLESMPDIGAGKAPLAVGDFSRGYFIVDHVTGIRTRPDNITEPGFYKVHTDKYLGGGVVDSNAIKILEMKAG
4732351..4732911                   PHAGE_Rhizob_16_3_NC_011103: p004; RG41_23295; phage(gi195546534)                    3.82e-38            EscherichiacoliMKNTDFEIRTSELTASNKKLVGYAVRWNSLSEIIWDEFREQFTPGAFADYLAAGNDVRCLYEHDYTQLLGRTKSGTLVLTEDNTGLRFELTPPDTQLGKDVLTLVERGDITGMSFGFRALCEEWSIAQKPYLRTVTAAELREITITSMPAYPESGVEIAHRSLFAQHPELCPTGNNRHRWSELAGL
4732913..4734127                   PHAGE_Shigel_SfIV_NC_022749: portal protein; RG41_23300; phage(gi557307529)          4.83e-57            EscherichiacoliMWWPFSRKKSEQRNLSIDDFLALSGVPNTGSGEYVSAGTAESLPAVMNAVSVIAEAVATMPCYLYLVRNDKGREAREWLDSHPVDILLNEQPNSCQTPYQFKRTMMRHCLLNGNAYAVIEWGQDGQPKSLHPYAPGCVVPERTGAHKYRYTITEPYTGTVRTYLQEEVLHLRYASDDGFLGRSPVTICREALGLGLAQQRHGASIMKDGMMAAGIITSGEWLDGVKGKQALDALERYKGAKNAGKTPILEGGMDYRQLGMSNQDAEWLASRRFSIEDIARMFNVSPIFLQEYSNSTYSNFSEASRAFLTMTMRPWLANFEQQIKAALLVASPVPGTRYLVEFDSADLLRATPTERYATYEKGIKNGIMNPNEAREREGMPPREGGDEFSQAWKQTVEIKGRKDE
4734120..4734422                   PHAGE_Shigel_SfII_NC_021857: head-tail connector protein; RG41_23305; phage(gi526244642)     9.69e-12            EscherichiacoliMSEARITPDEVRAHLRLDDDLSGEGELLKMYTDAALEACQKHIGKRFEDGLEFTPAMRVGCLMYIAFLYENREAVSPVEHSELPMAISALWSVYRDVGVY
4734422..4734862                   PHAGE_Burkho_phiE125_NC_003309: putative class I holin; RG41_23310; phage(gi17975232)     9.06e-16            EscherichiacoliMPWQPLRRCTEPGCNKRVKSGKCEEHRRAAWRAEDARRGHRRARGYSRQWDKYRALYLSKNPLCVRCLAKGIYTPALVVDHIIPINGGGDVLFWPEWNHQALCQTCHNRKTTREDPATKANRKAGMYHEQEERAAHRNDWMYGDDD
4734855..4735031                   hypothetical protein; RG41_23315                                                     N/A                 EscherichiacoliMTEQEQTRLIRGLIRQRDTWKTQETEHKANRTGRTKLTTAKRLTDRDREVMECFRNRW
4735151..4735507                   PHAGE_Psychr_Psymv2_NC_023734: terminase small subunit; RG41_23320; phage(gi593779760)     3.30e-16            EscherichiacoliMARPPKAPAYLDEIAVRQWKEKSRQLSGREDLTPADWSNLELYCVNYSIYRKAVEDLATRGFSIVNSQGSESRNPALSAKADAERIMIKMASLLGFDPVSRRRNPPETEEEDELDRLA
4735491..4737152                   PHAGE_Brucel_BiPBO1_NC_031264: hypothetical protein; RG41_23325; phage(gi100002)     4.24e-104           EscherichiacoliMTAWHEYAEGVKNGKITACKRLKQAVKRYFSDLENSLYTFDPEVVERFIAFSRVCPHVKGAMRGSPIELEPWQQFAFACILGFKVKATGRRKYTSAFIEVPRKNAKSTVAAILANWFLVMENGQQDIYTAAVSRDQARIVFDDARQMCLLSRPLRKRVNIQAHKVIHPKTNSLLKPLAAKAATIEGTNPSLAIVDEYHLHPDNGVYSALELGMGARPEGLLFAITTSGSNVVSACKQHYDYCCQILDGEEMNESMFVLIYELDDESEVDDPAMWIKANPNIDVSVDREKLASTIQKARGIPSQWVEMLTKRFNIWCQGATPWMGNGAWAECAGTFAEADLYGQECYAGLDLSSTSDISSVCYAFPVGKKIMLVSRHYLPEFQLQNPANKNRAIYRQWVKAGWIRTTPGDCIDYDRIRDDIMADAENFNIRLVGFDTWNATHLRTQLQGAGFEVEPFPQTYLRFSPAAKSFEVFVNRKVIVHRGDPVLAWSMSNVVMQSDANANIKPNKKKSSNKIDPSVAALMAFGTFQAEHEEFAFDMSDSHKERLAAFDGV
4738158..4738237                   attR                                                                                 N/A                 ATAAAAAAGGCGCTTCCCCATGCCGAGTAGCGCCTTTTTAATCAAGCATTTAGCTAACCTGAATTAGTTCATGCCGTATT
