BLASTP 2.2.17 [Aug-26-2007] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= domain1 (218 letters) Database: uniref90 70,056,378 sequences; 23,724,371,664 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef90_B8CG97 Uncharacterized protein n=1 Tax=Thalassiosira ps... 432 e-119 UniRef90_UPI00017546FD Cadmium-specific carbonic anhydrase n=1 T... 351 2e-94 UniRef90_Q50EL4 Cadmium-specific carbonic anhydrase (Fragment) n... 347 5e-93 UniRef90_UPI00026BAC49 Cadmium-specific carbonic anhydrase n=1 T... 336 8e-90 UniRef90_K0RDT8 Carbonic anhydrase n=1 Tax=Thalassiosira oceanic... 244 4e-62 UniRef90_C1N5U2 Predicted protein n=1 Tax=Micromonas pusilla (st... 242 1e-61 UniRef90_C1ECX3 Uncharacterized protein n=1 Tax=Micromonas commo... 211 5e-52 UniRef90_A6FY58 Uncharacterized protein n=1 Tax=Plesiocystis pac... 183 1e-43 UniRef90_A0A1F5DLS4 Uncharacterized protein n=1 Tax=Candidatus B... 96 2e-17 UniRef90_A0A0G1QGY3 Uncharacterized protein n=7 Tax=Microgenomat... 71 8e-10 UniRef90_A0A1F7IY24 Uncharacterized protein n=1 Tax=Candidatus R... 70 2e-09 UniRef90_A0A0G1CI10 Uncharacterized protein n=1 Tax=Candidatus G... 69 3e-09 UniRef90_A0A1F5KFU7 Uncharacterized protein n=1 Tax=Candidatus D... 52 4e-04 >UniRef90_B8CG97 Uncharacterized protein n=1 Tax=Thalassiosira pseudonana TaxID=35128 RepID=B8CG97_THAPS Length = 237 Score = 432 bits (1112), Expect = e-119, Method: Composition-based stats. Identities = 218/218 (100%), Positives = 218/218 (100%) Query: 1 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM 60 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM Sbjct: 20 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM 79 Query: 61 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG 120 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG Sbjct: 80 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG 139 Query: 121 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 180 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI Sbjct: 140 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 199 Query: 181 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA 218 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA Sbjct: 200 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA 237 >UniRef90_UPI00017546FD Cadmium-specific carbonic anhydrase n=1 Tax=Thalassiosira weissflogii TaxID=67004 RepID=UPI00017546FD Length = 213 Score = 351 bits (901), Expect = 2e-94, Method: Composition-based stats. Identities = 168/208 (80%), Positives = 188/208 (90%) Query: 10 LTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGI 69 LTP IVAALQ RGW+AEI++ S+ +MV+VDP GILKCVDGRGSDNT+ GPKMPGGI Sbjct: 5 LTPDQIVAALQERGWQAEIVTEFSLLNEMVDVDPQGILKCVDGRGSDNTQFCGPKMPGGI 64 Query: 70 YAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPR 129 YAIAHNRG T+++GLK+ITKEVASKGHVPSVHGDHS+DMLGCGFF+LWVTG FD MGYPR Sbjct: 65 YAIAHNRGVTTLEGLKQITKEVASKGHVPSVHGDHSSDMLGCGFFKLWVTGRFDDMGYPR 124 Query: 130 PEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKF 189 P+FDADQGA AV+ +GGVIEMHHGSH EKVVYINLVENKTLEPDE+DQRFIVDGWAA KF Sbjct: 125 PQFDADQGAKAVENAGGVIEMHHGSHAEKVVYINLVENKTLEPDEDDQRFIVDGWAAGKF 184 Query: 190 NLDVVKFLVAAAATVEMLGGPRIAKIVV 217 LDV KFL+AAAATVEMLGGP+ AKIV+ Sbjct: 185 GLDVPKFLIAAAATVEMLGGPKKAKIVI 212 >UniRef90_Q50EL4 Cadmium-specific carbonic anhydrase (Fragment) n=2 Tax=Thalassiosira weissflogii TaxID=67004 RepID=Q50EL4_THAWE Length = 616 Score = 347 bits (890), Expect = 5e-93, Method: Composition-based stats. Identities = 165/210 (78%), Positives = 189/210 (90%) Query: 8 PPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 P ++P I ALQ RGW+AEI++ +S++ +V+V P GILKCVDGRGSDNTRM GPKMPG Sbjct: 196 PSISPAQIAEALQGRGWDAEIVTDASMAGQLVDVRPEGILKCVDGRGSDNTRMGGPKMPG 255 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 GIYAIAHNRG TS++GLK+ITKEVASKGH+PSVHGDHS+DMLGCGFF+LWVTG FD MGY Sbjct: 256 GIYAIAHNRGVTSIEGLKQITKEVASKGHLPSVHGDHSSDMLGCGFFKLWVTGRFDDMGY 315 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 PRP+FDADQGA AVK++GG+IEMHHGSHTEKVVYINL+ NKTLEP+ENDQRFIVDGWAA Sbjct: 316 PRPQFDADQGANAVKDAGGIIEMHHGSHTEKVVYINLLANKTLEPNENDQRFIVDGWAAD 375 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 KF LDV KFL+AAAATVEMLGGP+ AKIVV Sbjct: 376 KFGLDVPKFLIAAAATVEMLGGPKNAKIVV 405 Score = 339 bits (870), Expect = 9e-91, Method: Composition-based stats. Identities = 162/210 (77%), Positives = 186/210 (88%) Query: 8 PPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 P +TP IV+AL+ RGW+A I+ AS++S ++ VDP GILKCVDGRGSDNT+ GPKMPG Sbjct: 406 PSITPPQIVSALRGRGWKASIVKASTMSSELKRVDPQGILKCVDGRGSDNTQFGGPKMPG 465 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 GIYAIAHNRG T+++GLK+IT+EVASKGHVPSVHGDHS+DMLGCGFF+LW+TG FD MGY Sbjct: 466 GIYAIAHNRGVTTLEGLKDITREVASKGHVPSVHGDHSSDMLGCGFFKLWLTGRFDDMGY 525 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 PRPEFDADQGA AV+ +GGVIEMHHGSH EKVVYINLV TLEP+E+DQRFIVDGWAA Sbjct: 526 PRPEFDADQGALAVRAAGGVIEMHHGSHEEKVVYINLVSGMTLEPNEHDQRFIVDGWAAS 585 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 KF LDVVKFLVAAAATVEMLGGP+ AKIV+ Sbjct: 586 KFGLDVVKFLVAAAATVEMLGGPKKAKIVI 615 Score = 331 bits (849), Expect = 3e-88, Method: Composition-based stats. Identities = 158/195 (81%), Positives = 178/195 (91%) Query: 23 GWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGIYAIAHNRGTTSVD 82 GW+AEI++ S+ +MV+VDP GILKCVDGRGSDNT+ GPKMPGGIYAIAHNRG T+++ Sbjct: 1 GWQAEIVTEFSLLNEMVDVDPQGILKCVDGRGSDNTQFCGPKMPGGIYAIAHNRGVTTLE 60 Query: 83 GLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPRPEFDADQGAAAVK 142 GLK+ITKEVASKGHVPSVHGDHS+DMLGCGFF+LWVTG FD MGYPRP+FDADQGA AV+ Sbjct: 61 GLKQITKEVASKGHVPSVHGDHSSDMLGCGFFKLWVTGRFDDMGYPRPQFDADQGAKAVE 120 Query: 143 ESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKFNLDVVKFLVAAAA 202 +GGVIEMHHGSH EKVVYINLVENKTLEPDE+DQRFIVDGWAA KF LDV KFL+AAAA Sbjct: 121 NAGGVIEMHHGSHAEKVVYINLVENKTLEPDEDDQRFIVDGWAAGKFGLDVPKFLIAAAA 180 Query: 203 TVEMLGGPRIAKIVV 217 TVEMLGGP+ AKIV+ Sbjct: 181 TVEMLGGPKKAKIVI 195 >UniRef90_UPI00026BAC49 Cadmium-specific carbonic anhydrase n=1 Tax=Thalassiosira weissflogii TaxID=67004 RepID=UPI00026BAC49 Length = 231 Score = 336 bits (862), Expect = 8e-90, Method: Composition-based stats. Identities = 161/208 (77%), Positives = 185/208 (88%) Query: 10 LTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGI 69 +TP IV+AL+ RGW+A I+ AS++S ++ VDP GILKCVDGRGSDNT+ GPKMPGGI Sbjct: 23 ITPPQIVSALRGRGWKASIVKASTMSSELKRVDPQGILKCVDGRGSDNTQFGGPKMPGGI 82 Query: 70 YAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPR 129 YAIAHNRG T+++GLK+IT+EVASKGHVPSVHGDHS+DMLGCGFF+LW+TG FD MGYPR Sbjct: 83 YAIAHNRGVTTLEGLKDITREVASKGHVPSVHGDHSSDMLGCGFFKLWLTGRFDDMGYPR 142 Query: 130 PEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKF 189 PEFDADQGA AV+ +GGVIEMHHGSH EKVVYINLV TLEP+E+DQRFIVDGWAA KF Sbjct: 143 PEFDADQGALAVRAAGGVIEMHHGSHEEKVVYINLVSGMTLEPNEHDQRFIVDGWAASKF 202 Query: 190 NLDVVKFLVAAAATVEMLGGPRIAKIVV 217 LDVVKFLVAAAATVEMLGGP+ AKIV+ Sbjct: 203 GLDVVKFLVAAAATVEMLGGPKKAKIVI 230 >UniRef90_K0RDT8 Carbonic anhydrase n=1 Tax=Thalassiosira oceanica TaxID=159749 RepID=K0RDT8_THAOC Length = 276 Score = 244 bits (623), Expect = 4e-62, Method: Composition-based stats. Identities = 125/212 (58%), Positives = 159/212 (75%), Gaps = 7/212 (3%) Query: 10 LTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGI 69 LTP+D+V LQ RGWEA I+ S S D+V V+ +G LKCVDGRG D+T GPKM GG+ Sbjct: 67 LTPEDVVGVLQGRGWEATIVKQSECS-DLVPVESSGYLKCVDGRGVDHTNTRGPKMLGGV 125 Query: 70 YAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM---- 125 YAIAHNRG + D L++I +EV+ KG++PSVHGD +MLGCG+ +LW+TG+F + Sbjct: 126 YAIAHNRGLKTTDDLQDICREVSEKGYIPSVHGDGDGNMLGCGYCKLWLTGKFADLDPVK 185 Query: 126 GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWA 185 G P P + AD GAAAVK G V EM GSH EK VYIN VE++T+EP+ +DQ+F+VD WA Sbjct: 186 GAP-PTYSADDGAAAVKAKGQV-EMCKGSHAEKFVYINFVEDQTIEPNHDDQKFVVDAWA 243 Query: 186 AIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 A+KF+LDV +LV AAATVE LGGP+IAK+VV Sbjct: 244 AMKFDLDVPSYLVTAAATVERLGGPKIAKLVV 275 >UniRef90_C1N5U2 Predicted protein n=1 Tax=Micromonas pusilla (strain CCMP1545) TaxID=564608 RepID=C1N5U2_MICPC Length = 222 Score = 242 bits (618), Expect = 1e-61, Method: Composition-based stats. Identities = 123/215 (57%), Positives = 155/215 (72%), Gaps = 6/215 (2%) Query: 7 APPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMP 66 AP LTP+D+V LQ RGW AEI+ A+ ++ D+V+V P G LKCVDGR D+ AGPKM Sbjct: 9 APELTPEDVVGVLQDRGWTAEIVKAADVA-DLVDVSPTGYLKCVDGRAVDHNNTAGPKML 67 Query: 67 GGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM- 125 GG+YAIAHNRG + L+ I EVA GHVPSVHGD +MLGCG+ +LW+TG+F + Sbjct: 68 GGVYAIAHNRGKKTTADLEAICAEVAKAGHVPSVHGDGDGNMLGCGYCKLWLTGKFADLD 127 Query: 126 ---GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVD 182 G P P + AD+GAAAVK GG +EM G H EK VYIN V +KT+EP+ ++Q+F+VD Sbjct: 128 PVKGAP-PTYSADEGAAAVKSGGGKVEMCKGKHAEKFVYINFVADKTVEPNGDNQKFVVD 186 Query: 183 GWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 W A KF LD+ +LV AAATVE LGGP+IAK+VV Sbjct: 187 AWCAKKFKLDIPSYLVTAAATVERLGGPKIAKLVV 221 >UniRef90_C1ECX3 Uncharacterized protein n=1 Tax=Micromonas commoda (strain RCC299 / NOUM17 / CCMP2709) TaxID=296587 RepID=C1ECX3_MICCC Length = 465 Score = 211 bits (536), Expect = 5e-52, Method: Composition-based stats. Identities = 109/207 (52%), Positives = 140/207 (67%), Gaps = 6/207 (2%) Query: 6 SAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSD--NTRMAGP 63 + P P +IV ALQ RGW AEI + S + +V+V P G LKCVDGRGSD + GP Sbjct: 228 AEPRFGPAEIVGALQGRGWSAEIQTQSRNAYQLVKVSPNGFLKCVDGRGSDAKGDQQRGP 287 Query: 64 KMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFD 123 KM GG+Y IA NRG + L+ I +EV + GHVP+VHGD +LGCGF +LW+ +F Sbjct: 288 KMLGGVYGIAVNRGIKTTKELEAICQEVKAAGHVPTVHGDEGG-ILGCGFCKLWLNDKFA 346 Query: 124 SMGY---PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 180 G +P+F A+ G+ V+++GGV+E H G HTEKVVY+N ++ TLEP+ +DQRFI Sbjct: 347 DEGMVNESKPKFSAEDGSKTVEKAGGVVENHVGKHTEKVVYLNFIDGMTLEPNADDQRFI 406 Query: 181 VDGWAAIKFNLDVVKFLVAAAATVEML 207 VD WAA KFNLDV K+ V AAATVE L Sbjct: 407 VDAWAAGKFNLDVPKYCVTAAATVEKL 433 Score = 198 bits (504), Expect = 2e-48, Method: Composition-based stats. Identities = 108/209 (51%), Positives = 134/209 (64%), Gaps = 11/209 (5%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSI-----SQDMVEVDPAGILKCVDGRGSD--NTRMA 61 PL+ D+ AL SRGW+A I+ + +V+VDPAG LKCVDGRGSD + Sbjct: 4 PLSYGDLGVALASRGWKASILDDRDFCTLFPKEKLVDVDPAGFLKCVDGRGSDAVGKQQH 63 Query: 62 GPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGE 121 GPKM GG+Y IA NRG + L+ I +EV + GHVP+VHGD +LGCGF +LW+ G+ Sbjct: 64 GPKMLGGVYGIAVNRGIKTTKELEAICQEVKAAGHVPTVHGDEGG-ILGCGFCKLWMNGK 122 Query: 122 FDSMG---YPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQR 178 F G P+F ADQGAA VK +GGV+E H HTEK V +N V KT P+ DQR Sbjct: 123 FTDEGGVATAPPDFTADQGAACVKAAGGVVENHVAKHTEKYVILNFVPGKTFVPNGKDQR 182 Query: 179 FIVDGWAAIKFNLDVVKFLVAAAATVEML 207 FIVD WA KFNLD+ K+ + AAATVE L Sbjct: 183 FIVDCWALGKFNLDITKYALTAAATVEKL 211 >UniRef90_A6FY58 Uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 TaxID=391625 RepID=A6FY58_9DELT Length = 226 Score = 183 bits (464), Expect = 1e-43, Method: Composition-based stats. Identities = 103/226 (45%), Positives = 139/226 (61%), Gaps = 19/226 (8%) Query: 10 LTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGI 69 +TP+DI AAL++RGW A I+ S +S D+V+V G++KCVDGR S + M GPK GG+ Sbjct: 1 MTPQDIKAALEARGWTATIVPRSEVS-DIVDVGGDGLMKCVDGRPSFHPAMNGPKTLGGV 59 Query: 70 YAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADM--LGCGFFRLWVTG------- 120 YAIA R V GL + T++VA+ GHVPSVHGD A+ +GCG+F+LW TG Sbjct: 60 YAIASMRDARDVAGLVQATRDVAAFGHVPSVHGDQHAEPPPMGCGYFKLWKTGKLMNLAP 119 Query: 121 -----EFDSMGYPR----PEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLE 171 EF + P+ P + A++G+ V GGV E G+H E+ V INLV + T E Sbjct: 120 EGKEDEFKASELPKGIVPPNYSAEEGSEIVLSEGGVYETLEGAHEEQEVVINLVTDTTFE 179 Query: 172 PDENDQRFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 P QRF+VD W KFN+D ++L AA TVE+L R A+I+V Sbjct: 180 PSRESQRFVVDAWITDKFNIDAGRYLTVAAKTVELLSDVRKARIIV 225 >UniRef90_A0A1F5DLS4 Uncharacterized protein n=1 Tax=Candidatus Beckwithbacteria bacterium RIFCSPHIGHO2_12_FULL_47_17 TaxID=1797460 RepID=A0A1F5DLS4_9BACT Length = 203 Score = 95.9 bits (237), Expect = 2e-17, Method: Composition-based stats. Identities = 65/206 (31%), Positives = 100/206 (48%), Gaps = 16/206 (7%) Query: 19 LQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNT--RMAGPKMPGGIY---AIA 73 L +GWE + +V V+ G C DGR +T ++ PK+ GG+ A+ Sbjct: 5 LVRQGWEVK----EGNRDKLVPVEADGFGPCGDGRKPKDTQIKLRAPKILGGVLGKAALG 60 Query: 74 HNRGTTSVDGLKEI---TKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPRP 130 + G +I +++ + G PSVHGD GCGF RLW G+ D++ PR Sbjct: 61 SGKAAAQTIGEYDIRLACRDIKAAGFTPSVHGDTKHGKKGCGFGRLWSEGKLDNV--PRL 118 Query: 131 EFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKFN 190 ++ + V E GG G H E+ V +N + + TLEPD FI+D WAA KF Sbjct: 119 NVSLERVSEIVNEEGGQYIELDGEHEEQRVMVNFIPDMTLEPD--GSCFIIDAWAADKFG 176 Query: 191 LDVVKFLVAAAATVEMLGGPRIAKIV 216 ++ + L A V L GP++ +++ Sbjct: 177 INQERLLQNAVEVVVKLNGPKVIELI 202 >UniRef90_A0A0G1QGY3 Uncharacterized protein n=7 Tax=Microgenomates group TaxID=1794810 RepID=A0A0G1QGY3_9BACT Length = 214 Score = 70.9 bits (172), Expect = 8e-10, Method: Composition-based stats. Identities = 66/218 (30%), Positives = 95/218 (43%), Gaps = 22/218 (10%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDN-----TRMAGP 63 P T + ++ + GWE + S +V V G++ CVDGR D + GP Sbjct: 6 PSTNRTMLERMLGSGWEVKEGDPSL----LVRVVRGGLVHCVDGRKVDQFLVPQKIVRGP 61 Query: 64 KMPGGIYAIA----HNRGTTSVDG--LKEITKEVASKGHVPSVHGDHSADMLGCGFFRLW 117 K+ GG +A +G + VD ++ + + + G VP VH D L CG F L Sbjct: 62 KIQGGAEGVALLLAKAQGVSEVDESWFRKACQVIKNSGFVPGVH---DFDHLHCGHFNLA 118 Query: 118 VTGEFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQ 177 G+F+ M PR A + V E GG G H E V+ +N N TL P N + Sbjct: 119 SQGKFEGM--PRFTITAGDMSRIVGEFGGSQVHLAGQHEEYVMRVNWDPNMTLIP--NKE 174 Query: 178 RFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKI 215 F +D W A ++ L AA TV L R ++ Sbjct: 175 AFNLDAWYANVIGINQETLLDNAAKTVMGLSSVRTVEV 212 >UniRef90_A0A1F7IY24 Uncharacterized protein n=1 Tax=Candidatus Roizmanbacteria bacterium RIFCSPLOWO2_01_FULL_38_12 TaxID=1802061 RepID=A0A1F7IY24_9BACT Length = 226 Score = 69.7 bits (169), Expect = 2e-09, Method: Composition-based stats. Identities = 53/167 (31%), Positives = 79/167 (47%), Gaps = 8/167 (4%) Query: 46 ILKCVDGRGSDNT----RMAGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVH 101 +L C D R + GP + GG IA R +++G++ T ++++ G+ +H Sbjct: 52 VLNCGDDRFKNGEVPEDHRYGPSIFGGAVGIAALRREPTLEGVRRATLDISALGYRAGMH 111 Query: 102 GDHSADMLGCGFFRLWVTGEFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVY 161 GD D LGCGF RL + G F+ + P D + E GG G HT + Sbjct: 112 GDVENDELGCGFNRLLLNGYFNGV-VGTPAIDLKTARQVLDEHGGSYVDLSGIHTAVGLN 170 Query: 162 INLVENKTLEPDENDQRFIVDGWAAIKFN-LDVVKFLVAAAATVEML 207 N V T+ D N+ F VDGW A+ + ++ + L AATVE L Sbjct: 171 FNFVPGTTILSDGNN--FGVDGWFALLIDGVEPDRLLELTAATVEAL 215 >UniRef90_A0A0G1CI10 Uncharacterized protein n=1 Tax=Candidatus Gottesmanbacteria bacterium GW2011_GWB1_43_11 TaxID=1618446 RepID=A0A0G1CI10_9BACT Length = 205 Score = 68.9 bits (167), Expect = 3e-09, Method: Composition-based stats. Identities = 55/210 (26%), Positives = 89/210 (42%), Gaps = 15/210 (7%) Query: 10 LTPK--DIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 LTP+ + A RGW+ E S +V+V C DGR D GP + G Sbjct: 7 LTPQTTSLKDAFLRRGWQVE--EVGSRQAPLVKVRRGAKFGCGDGRNPD----LGPALFG 60 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 + + G + + G+ P++HGD + CGFF W+ G+ G Sbjct: 61 SFWGVMATLTGGESLGAERAKIAIRDLGYQPTIHGDEHGE-FACGFFEKWMHGKLP--GV 117 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 +P F+ ++ + V + H E+ +++N V + T+ PD +RF VD W Sbjct: 118 YQPNFNENELPHILDRVTRV--RYRDKHQERELWLNPVSSTTIRPDT--RRFRVDLWFGE 173 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 + + + VE+L R AKI+V Sbjct: 174 ALGIPRESLIDTSIIVVELLSQVRTAKIIV 203 >UniRef90_A0A1F5KFU7 Uncharacterized protein n=1 Tax=Candidatus Daviesbacteria bacterium RIFCSPHIGHO2_02_FULL_43_12 TaxID=1797776 RepID=A0A1F5KFU7_9BACT Length = 220 Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 11/121 (9%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGS--DNTRMAGPKMP 66 PL +DI+ A + W+ EI+ AS+ Q +V P L+C D R + G ++ Sbjct: 7 PLLARDILQA-RKHNWQVEIVKASNTEQG--QVHPGAALECGDVRFDWLEGRTCWGYRIL 63 Query: 67 GGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMG 126 G + A+A + ++ G + EV G P HG C FF LW TG + Sbjct: 64 GQVNAVAALKTGGNIVGFNQANAEVRRCGCTPGTHGP------SCAFFELWTTGRLKEVP 117 Query: 127 Y 127 + Sbjct: 118 F 118 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef90_B8CG97 Uncharacterized protein n=1 Tax=Thalassiosira ps... 378 e-102 UniRef90_UPI00017546FD Cadmium-specific carbonic anhydrase n=1 T... 359 8e-97 UniRef90_Q50EL4 Cadmium-specific carbonic anhydrase (Fragment) n... 356 1e-95 UniRef90_UPI00026BAC49 Cadmium-specific carbonic anhydrase n=1 T... 352 1e-94 UniRef90_K0RDT8 Carbonic anhydrase n=1 Tax=Thalassiosira oceanic... 329 1e-87 UniRef90_C1N5U2 Predicted protein n=1 Tax=Micromonas pusilla (st... 322 2e-85 UniRef90_C1ECX3 Uncharacterized protein n=1 Tax=Micromonas commo... 312 2e-82 UniRef90_A6FY58 Uncharacterized protein n=1 Tax=Plesiocystis pac... 292 1e-76 UniRef90_A0A0G1CI10 Uncharacterized protein n=1 Tax=Candidatus G... 263 6e-68 UniRef90_A0A0G1QGY3 Uncharacterized protein n=7 Tax=Microgenomat... 252 1e-64 UniRef90_A0A1F5DLS4 Uncharacterized protein n=1 Tax=Candidatus B... 250 1e-63 UniRef90_A0A1F7IY24 Uncharacterized protein n=1 Tax=Candidatus R... 215 2e-53 Sequences not found previously or not previously below threshold: UniRef90_A0A1F5ZI62 Uncharacterized protein n=1 Tax=Candidatus G... 107 7e-21 UniRef90_A0A1F5KFU7 Uncharacterized protein n=1 Tax=Candidatus D... 79 3e-12 UniRef90_A0A1G1VSB7 Uncharacterized protein n=1 Tax=Candidatus C... 52 4e-04 >UniRef90_B8CG97 Uncharacterized protein n=1 Tax=Thalassiosira pseudonana TaxID=35128 RepID=B8CG97_THAPS Length = 237 Score = 378 bits (971), Expect = e-102, Method: Composition-based stats. Identities = 218/218 (100%), Positives = 218/218 (100%) Query: 1 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM 60 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM Sbjct: 20 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM 79 Query: 61 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG 120 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG Sbjct: 80 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG 139 Query: 121 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 180 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI Sbjct: 140 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 199 Query: 181 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA 218 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA Sbjct: 200 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA 237 >UniRef90_UPI00017546FD Cadmium-specific carbonic anhydrase n=1 Tax=Thalassiosira weissflogii TaxID=67004 RepID=UPI00017546FD Length = 213 Score = 359 bits (923), Expect = 8e-97, Method: Composition-based stats. Identities = 168/209 (80%), Positives = 188/209 (89%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGG 68 LTP IVAALQ RGW+AEI++ S+ +MV+VDP GILKCVDGRGSDNT+ GPKMPGG Sbjct: 4 SLTPDQIVAALQERGWQAEIVTEFSLLNEMVDVDPQGILKCVDGRGSDNTQFCGPKMPGG 63 Query: 69 IYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYP 128 IYAIAHNRG T+++GLK+ITKEVASKGHVPSVHGDHS+DMLGCGFF+LWVTG FD MGYP Sbjct: 64 IYAIAHNRGVTTLEGLKQITKEVASKGHVPSVHGDHSSDMLGCGFFKLWVTGRFDDMGYP 123 Query: 129 RPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIK 188 RP+FDADQGA AV+ +GGVIEMHHGSH EKVVYINLVENKTLEPDE+DQRFIVDGWAA K Sbjct: 124 RPQFDADQGAKAVENAGGVIEMHHGSHAEKVVYINLVENKTLEPDEDDQRFIVDGWAAGK 183 Query: 189 FNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 F LDV KFL+AAAATVEMLGGP+ AKIV+ Sbjct: 184 FGLDVPKFLIAAAATVEMLGGPKKAKIVI 212 >UniRef90_Q50EL4 Cadmium-specific carbonic anhydrase (Fragment) n=2 Tax=Thalassiosira weissflogii TaxID=67004 RepID=Q50EL4_THAWE Length = 616 Score = 356 bits (913), Expect = 1e-95, Method: Composition-based stats. Identities = 165/210 (78%), Positives = 189/210 (90%) Query: 8 PPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 P ++P I ALQ RGW+AEI++ +S++ +V+V P GILKCVDGRGSDNTRM GPKMPG Sbjct: 196 PSISPAQIAEALQGRGWDAEIVTDASMAGQLVDVRPEGILKCVDGRGSDNTRMGGPKMPG 255 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 GIYAIAHNRG TS++GLK+ITKEVASKGH+PSVHGDHS+DMLGCGFF+LWVTG FD MGY Sbjct: 256 GIYAIAHNRGVTSIEGLKQITKEVASKGHLPSVHGDHSSDMLGCGFFKLWVTGRFDDMGY 315 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 PRP+FDADQGA AVK++GG+IEMHHGSHTEKVVYINL+ NKTLEP+ENDQRFIVDGWAA Sbjct: 316 PRPQFDADQGANAVKDAGGIIEMHHGSHTEKVVYINLLANKTLEPNENDQRFIVDGWAAD 375 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 KF LDV KFL+AAAATVEMLGGP+ AKIVV Sbjct: 376 KFGLDVPKFLIAAAATVEMLGGPKNAKIVV 405 Score = 355 bits (912), Expect = 1e-95, Method: Composition-based stats. Identities = 162/210 (77%), Positives = 186/210 (88%) Query: 8 PPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 P +TP IV+AL+ RGW+A I+ AS++S ++ VDP GILKCVDGRGSDNT+ GPKMPG Sbjct: 406 PSITPPQIVSALRGRGWKASIVKASTMSSELKRVDPQGILKCVDGRGSDNTQFGGPKMPG 465 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 GIYAIAHNRG T+++GLK+IT+EVASKGHVPSVHGDHS+DMLGCGFF+LW+TG FD MGY Sbjct: 466 GIYAIAHNRGVTTLEGLKDITREVASKGHVPSVHGDHSSDMLGCGFFKLWLTGRFDDMGY 525 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 PRPEFDADQGA AV+ +GGVIEMHHGSH EKVVYINLV TLEP+E+DQRFIVDGWAA Sbjct: 526 PRPEFDADQGALAVRAAGGVIEMHHGSHEEKVVYINLVSGMTLEPNEHDQRFIVDGWAAS 585 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 KF LDVVKFLVAAAATVEMLGGP+ AKIV+ Sbjct: 586 KFGLDVVKFLVAAAATVEMLGGPKKAKIVI 615 Score = 337 bits (865), Expect = 4e-90, Method: Composition-based stats. Identities = 158/195 (81%), Positives = 178/195 (91%) Query: 23 GWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGIYAIAHNRGTTSVD 82 GW+AEI++ S+ +MV+VDP GILKCVDGRGSDNT+ GPKMPGGIYAIAHNRG T+++ Sbjct: 1 GWQAEIVTEFSLLNEMVDVDPQGILKCVDGRGSDNTQFCGPKMPGGIYAIAHNRGVTTLE 60 Query: 83 GLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPRPEFDADQGAAAVK 142 GLK+ITKEVASKGHVPSVHGDHS+DMLGCGFF+LWVTG FD MGYPRP+FDADQGA AV+ Sbjct: 61 GLKQITKEVASKGHVPSVHGDHSSDMLGCGFFKLWVTGRFDDMGYPRPQFDADQGAKAVE 120 Query: 143 ESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKFNLDVVKFLVAAAA 202 +GGVIEMHHGSH EKVVYINLVENKTLEPDE+DQRFIVDGWAA KF LDV KFL+AAAA Sbjct: 121 NAGGVIEMHHGSHAEKVVYINLVENKTLEPDEDDQRFIVDGWAAGKFGLDVPKFLIAAAA 180 Query: 203 TVEMLGGPRIAKIVV 217 TVEMLGGP+ AKIV+ Sbjct: 181 TVEMLGGPKKAKIVI 195 >UniRef90_UPI00026BAC49 Cadmium-specific carbonic anhydrase n=1 Tax=Thalassiosira weissflogii TaxID=67004 RepID=UPI00026BAC49 Length = 231 Score = 352 bits (903), Expect = 1e-94, Method: Composition-based stats. Identities = 161/209 (77%), Positives = 185/209 (88%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGG 68 +TP IV+AL+ RGW+A I+ AS++S ++ VDP GILKCVDGRGSDNT+ GPKMPGG Sbjct: 22 SITPPQIVSALRGRGWKASIVKASTMSSELKRVDPQGILKCVDGRGSDNTQFGGPKMPGG 81 Query: 69 IYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYP 128 IYAIAHNRG T+++GLK+IT+EVASKGHVPSVHGDHS+DMLGCGFF+LW+TG FD MGYP Sbjct: 82 IYAIAHNRGVTTLEGLKDITREVASKGHVPSVHGDHSSDMLGCGFFKLWLTGRFDDMGYP 141 Query: 129 RPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIK 188 RPEFDADQGA AV+ +GGVIEMHHGSH EKVVYINLV TLEP+E+DQRFIVDGWAA K Sbjct: 142 RPEFDADQGALAVRAAGGVIEMHHGSHEEKVVYINLVSGMTLEPNEHDQRFIVDGWAASK 201 Query: 189 FNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 F LDVVKFLVAAAATVEMLGGP+ AKIV+ Sbjct: 202 FGLDVVKFLVAAAATVEMLGGPKKAKIVI 230 >UniRef90_K0RDT8 Carbonic anhydrase n=1 Tax=Thalassiosira oceanica TaxID=159749 RepID=K0RDT8_THAOC Length = 276 Score = 329 bits (843), Expect = 1e-87, Method: Composition-based stats. Identities = 125/212 (58%), Positives = 159/212 (75%), Gaps = 7/212 (3%) Query: 10 LTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGI 69 LTP+D+V LQ RGWEA I+ S S D+V V+ +G LKCVDGRG D+T GPKM GG+ Sbjct: 67 LTPEDVVGVLQGRGWEATIVKQSECS-DLVPVESSGYLKCVDGRGVDHTNTRGPKMLGGV 125 Query: 70 YAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM---- 125 YAIAHNRG + D L++I +EV+ KG++PSVHGD +MLGCG+ +LW+TG+F + Sbjct: 126 YAIAHNRGLKTTDDLQDICREVSEKGYIPSVHGDGDGNMLGCGYCKLWLTGKFADLDPVK 185 Query: 126 GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWA 185 G P P + AD GAAAVK G V EM GSH EK VYIN VE++T+EP+ +DQ+F+VD WA Sbjct: 186 GAP-PTYSADDGAAAVKAKGQV-EMCKGSHAEKFVYINFVEDQTIEPNHDDQKFVVDAWA 243 Query: 186 AIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 A+KF+LDV +LV AAATVE LGGP+IAK+VV Sbjct: 244 AMKFDLDVPSYLVTAAATVERLGGPKIAKLVV 275 >UniRef90_C1N5U2 Predicted protein n=1 Tax=Micromonas pusilla (strain CCMP1545) TaxID=564608 RepID=C1N5U2_MICPC Length = 222 Score = 322 bits (825), Expect = 2e-85, Method: Composition-based stats. Identities = 123/215 (57%), Positives = 155/215 (72%), Gaps = 6/215 (2%) Query: 7 APPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMP 66 AP LTP+D+V LQ RGW AEI+ A+ ++ D+V+V P G LKCVDGR D+ AGPKM Sbjct: 9 APELTPEDVVGVLQDRGWTAEIVKAADVA-DLVDVSPTGYLKCVDGRAVDHNNTAGPKML 67 Query: 67 GGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM- 125 GG+YAIAHNRG + L+ I EVA GHVPSVHGD +MLGCG+ +LW+TG+F + Sbjct: 68 GGVYAIAHNRGKKTTADLEAICAEVAKAGHVPSVHGDGDGNMLGCGYCKLWLTGKFADLD 127 Query: 126 ---GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVD 182 G P P + AD+GAAAVK GG +EM G H EK VYIN V +KT+EP+ ++Q+F+VD Sbjct: 128 PVKGAP-PTYSADEGAAAVKSGGGKVEMCKGKHAEKFVYINFVADKTVEPNGDNQKFVVD 186 Query: 183 GWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 W A KF LD+ +LV AAATVE LGGP+IAK+VV Sbjct: 187 AWCAKKFKLDIPSYLVTAAATVERLGGPKIAKLVV 221 >UniRef90_C1ECX3 Uncharacterized protein n=1 Tax=Micromonas commoda (strain RCC299 / NOUM17 / CCMP2709) TaxID=296587 RepID=C1ECX3_MICCC Length = 465 Score = 312 bits (799), Expect = 2e-82, Method: Composition-based stats. Identities = 112/222 (50%), Positives = 145/222 (65%), Gaps = 11/222 (4%) Query: 6 SAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSD--NTRMAGP 63 + P P +IV ALQ RGW AEI + S + +V+V P G LKCVDGRGSD + GP Sbjct: 228 AEPRFGPAEIVGALQGRGWSAEIQTQSRNAYQLVKVSPNGFLKCVDGRGSDAKGDQQRGP 287 Query: 64 KMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFD 123 KM GG+Y IA NRG + L+ I +EV + GHVP+VHGD +LGCGF +LW+ +F Sbjct: 288 KMLGGVYGIAVNRGIKTTKELEAICQEVKAAGHVPTVHGDE-GGILGCGFCKLWLNDKFA 346 Query: 124 SMGY---PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 180 G +P+F A+ G+ V+++GGV+E H G HTEKVVY+N ++ TLEP+ +DQRFI Sbjct: 347 DEGMVNESKPKFSAEDGSKTVEKAGGVVENHVGKHTEKVVYLNFIDGMTLEPNADDQRFI 406 Query: 181 VDGWAAIKFNLDVVKFLVAAAATVEMLGG-----PRIAKIVV 217 VD WAA KFNLDV K+ V AAATVE L P A ++V Sbjct: 407 VDAWAAGKFNLDVPKYCVTAAATVEKLNPGQAPCPWKAVLIV 448 Score = 251 bits (641), Expect = 4e-64, Method: Composition-based stats. Identities = 108/210 (51%), Positives = 134/210 (63%), Gaps = 11/210 (5%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSI-----SQDMVEVDPAGILKCVDGRGSD--NTRMA 61 PL+ D+ AL SRGW+A I+ + +V+VDPAG LKCVDGRGSD + Sbjct: 4 PLSYGDLGVALASRGWKASILDDRDFCTLFPKEKLVDVDPAGFLKCVDGRGSDAVGKQQH 63 Query: 62 GPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGE 121 GPKM GG+Y IA NRG + L+ I +EV + GHVP+VHGD +LGCGF +LW+ G+ Sbjct: 64 GPKMLGGVYGIAVNRGIKTTKELEAICQEVKAAGHVPTVHGDE-GGILGCGFCKLWMNGK 122 Query: 122 FDSMG---YPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQR 178 F G P+F ADQGAA VK +GGV+E H HTEK V +N V KT P+ DQR Sbjct: 123 FTDEGGVATAPPDFTADQGAACVKAAGGVVENHVAKHTEKYVILNFVPGKTFVPNGKDQR 182 Query: 179 FIVDGWAAIKFNLDVVKFLVAAAATVEMLG 208 FIVD WA KFNLD+ K+ + AAATVE L Sbjct: 183 FIVDCWALGKFNLDITKYALTAAATVEKLN 212 >UniRef90_A6FY58 Uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 TaxID=391625 RepID=A6FY58_9DELT Length = 226 Score = 292 bits (748), Expect = 1e-76, Method: Composition-based stats. Identities = 102/226 (45%), Positives = 139/226 (61%), Gaps = 19/226 (8%) Query: 10 LTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGI 69 +TP+DI AAL++RGW A I+ S +S D+V+V G++KCVDGR S + M GPK GG+ Sbjct: 1 MTPQDIKAALEARGWTATIVPRSEVS-DIVDVGGDGLMKCVDGRPSFHPAMNGPKTLGGV 59 Query: 70 YAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADM--LGCGFFRLWVTGE------ 121 YAIA R V GL + T++VA+ GHVPSVHGD A+ +GCG+F+LW TG+ Sbjct: 60 YAIASMRDARDVAGLVQATRDVAAFGHVPSVHGDQHAEPPPMGCGYFKLWKTGKLMNLAP 119 Query: 122 ------FDSMGYPR----PEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLE 171 F + P+ P + A++G+ V GGV E G+H E+ V INLV + T E Sbjct: 120 EGKEDEFKASELPKGIVPPNYSAEEGSEIVLSEGGVYETLEGAHEEQEVVINLVTDTTFE 179 Query: 172 PDENDQRFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 P QRF+VD W KFN+D ++L AA TVE+L R A+I+V Sbjct: 180 PSRESQRFVVDAWITDKFNIDAGRYLTVAAKTVELLSDVRKARIIV 225 >UniRef90_A0A0G1CI10 Uncharacterized protein n=1 Tax=Candidatus Gottesmanbacteria bacterium GW2011_GWB1_43_11 TaxID=1618446 RepID=A0A0G1CI10_9BACT Length = 205 Score = 263 bits (674), Expect = 6e-68, Method: Composition-based stats. Identities = 55/210 (26%), Positives = 90/210 (42%), Gaps = 15/210 (7%) Query: 10 LTPKD--IVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 LTP+ + A RGW+ E + S +V+V C DGR D GP + G Sbjct: 7 LTPQTTSLKDAFLRRGWQVEEV--GSRQAPLVKVRRGAKFGCGDGRNPD----LGPALFG 60 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 + + G + + G+ P++HGD + CGFF W+ G+ G Sbjct: 61 SFWGVMATLTGGESLGAERAKIAIRDLGYQPTIHGDEHGE-FACGFFEKWMHGKLP--GV 117 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 +P F+ ++ + V + H E+ +++N V + T+ PD +RF VD W Sbjct: 118 YQPNFNENELPHILDRVTRV--RYRDKHQERELWLNPVSSTTIRPDT--RRFRVDLWFGE 173 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 + + + VE+L R AKI+V Sbjct: 174 ALGIPRESLIDTSIIVVELLSQVRTAKIIV 203 >UniRef90_A0A0G1QGY3 Uncharacterized protein n=7 Tax=Microgenomates group TaxID=1794810 RepID=A0A0G1QGY3_9BACT Length = 214 Score = 252 bits (645), Expect = 1e-64, Method: Composition-based stats. Identities = 66/222 (29%), Positives = 96/222 (43%), Gaps = 22/222 (9%) Query: 5 TSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDN-----TR 59 + P T + ++ + GWE + S +V V G++ CVDGR D Sbjct: 2 SPEIPSTNRTMLERMLGSGWEVKEGDPSL----LVRVVRGGLVHCVDGRKVDQFLVPQKI 57 Query: 60 MAGPKMPGGIYAIA----HNRGTTSVDG--LKEITKEVASKGHVPSVHGDHSADMLGCGF 113 + GPK+ GG +A +G + VD ++ + + + G VP VH D L CG Sbjct: 58 VRGPKIQGGAEGVALLLAKAQGVSEVDESWFRKACQVIKNSGFVPGVH---DFDHLHCGH 114 Query: 114 FRLWVTGEFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPD 173 F L G+F+ M PR A + V E GG G H E V+ +N N TL P Sbjct: 115 FNLASQGKFEGM--PRFTITAGDMSRIVGEFGGSQVHLAGQHEEYVMRVNWDPNMTLIP- 171 Query: 174 ENDQRFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKI 215 N + F +D W A ++ L AA TV L R ++ Sbjct: 172 -NKEAFNLDAWYANVIGINQETLLDNAAKTVMGLSSVRTVEV 212 >UniRef90_A0A1F5DLS4 Uncharacterized protein n=1 Tax=Candidatus Beckwithbacteria bacterium RIFCSPHIGHO2_12_FULL_47_17 TaxID=1797460 RepID=A0A1F5DLS4_9BACT Length = 203 Score = 250 bits (638), Expect = 1e-63, Method: Composition-based stats. Identities = 64/210 (30%), Positives = 99/210 (47%), Gaps = 16/210 (7%) Query: 15 IVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTR--MAGPKMPGGIYA- 71 + L +GWE + +V V+ G C DGR +T+ + PK+ GG+ Sbjct: 1 MFDDLVRQGWEVKEG----NRDKLVPVEADGFGPCGDGRKPKDTQIKLRAPKILGGVLGK 56 Query: 72 --IAHNRGTTSVDGLKEI---TKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMG 126 + + G +I +++ + G PSVHGD GCGF RLW G+ D+ Sbjct: 57 AALGSGKAAAQTIGEYDIRLACRDIKAAGFTPSVHGDTKHGKKGCGFGRLWSEGKLDN-- 114 Query: 127 YPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAA 186 PR ++ + V E GG G H E+ V +N + + TLEPD FI+D WAA Sbjct: 115 VPRLNVSLERVSEIVNEEGGQYIELDGEHEEQRVMVNFIPDMTLEPDG--SCFIIDAWAA 172 Query: 187 IKFNLDVVKFLVAAAATVEMLGGPRIAKIV 216 KF ++ + L A V L GP++ +++ Sbjct: 173 DKFGINQERLLQNAVEVVVKLNGPKVIELI 202 >UniRef90_A0A1F7IY24 Uncharacterized protein n=1 Tax=Candidatus Roizmanbacteria bacterium RIFCSPLOWO2_01_FULL_38_12 TaxID=1802061 RepID=A0A1F7IY24_9BACT Length = 226 Score = 215 bits (549), Expect = 2e-53, Method: Composition-based stats. Identities = 57/194 (29%), Positives = 84/194 (43%), Gaps = 12/194 (6%) Query: 19 LQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNT----RMAGPKMPGGIYAIAH 74 RGW + +V +L C D R + GP + GG IA Sbjct: 29 FLERGWNVKHGDNGI----LVGTSFQSVLNCGDDRFKNGEVPEDHRYGPSIFGGAVGIAA 84 Query: 75 NRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPRPEFDA 134 R +++G++ T ++++ G+ +HGD D LGCGF RL + G F+ + P D Sbjct: 85 LRREPTLEGVRRATLDISALGYRAGMHGDVENDELGCGFNRLLLNGYFNGV-VGTPAIDL 143 Query: 135 DQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKF-NLDV 193 + E GG G HT + N V T+ D N+ F VDGW A+ ++ Sbjct: 144 KTARQVLDEHGGSYVDLSGIHTAVGLNFNFVPGTTILSDGNN--FGVDGWFALLIDGVEP 201 Query: 194 VKFLVAAAATVEML 207 + L AATVE L Sbjct: 202 DRLLELTAATVEAL 215 >UniRef90_A0A1F5ZI62 Uncharacterized protein n=1 Tax=Candidatus Gottesmanbacteria bacterium RBG_13_45_10 TaxID=1798370 RepID=A0A1F5ZI62_9BACT Length = 238 Score = 107 bits (268), Expect = 7e-21, Method: Composition-based stats. Identities = 52/221 (23%), Positives = 85/221 (38%), Gaps = 23/221 (10%) Query: 13 KDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGR----GSDNTRMAGP----K 64 + + GW+ + + +V + C DGR ++ + P Sbjct: 22 RQAAERFRHYGWKVVDVEQKGMVLPLVIGKGPLSVICGDGRYARYFQNHKELN-PQCTIS 80 Query: 65 MPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADM-----LGCGFFRLWVT 119 + GG Y R +++GL+ + + G V HGD + CGF W Sbjct: 81 IFGGAYGAQALRFGGTLEGLRTLAEYANKNGLVFRTHGDEHGEHHEPADFNCGFLGKWAE 140 Query: 120 GEFDSM---GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDEND 176 + + P+ EF D A A + G ++ G H E+V+ +N T+ P Sbjct: 141 RKLRGVMPLEIPKQEFP-DMLAHA-QTLGFGHDILPGVHEERVLVLNFAPGTTVAPQAT- 197 Query: 177 QRFIVDGWAAIKFNLDVVKFLVAAAATVEML-GGPRIAKIV 216 RF VDGW A + L + + + TVE+L R IV Sbjct: 198 -RFRVDGWVAGSY-LGLTNLVDVSRQTVELLKKDVRAVTIV 236 >UniRef90_A0A1F5KFU7 Uncharacterized protein n=1 Tax=Candidatus Daviesbacteria bacterium RIFCSPHIGHO2_02_FULL_43_12 TaxID=1797776 RepID=A0A1F5KFU7_9BACT Length = 220 Score = 79.1 bits (194), Expect = 3e-12, Method: Composition-based stats. Identities = 57/225 (25%), Positives = 90/225 (40%), Gaps = 28/225 (12%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGS--DNTRMAGPKMP 66 PL +DI+ A + W+ EI+ AS+ Q +V P L+C D R + G ++ Sbjct: 7 PLLARDILQA-RKHNWQVEIVKASNTEQG--QVHPGAALECGDVRFDWLEGRTCWGYRIL 63 Query: 67 GGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM- 125 G + A+A + ++ G + EV G P HG C FF LW TG + Sbjct: 64 GQVNAVAALKTGGNIVGFNQANAEVRRCGCTPGTHGPS------CAFFELWTTGRLKEVP 117 Query: 126 ---GYP------RPEFDADQGAAAVKESGGVIEMH--HGSHTEKVVYINLVENKTLEPDE 174 P R + ++ +GGV + GSH + + N + T Sbjct: 118 FRYDVPMQRMRDRLTGTGNPIKRKMQLAGGVHFVLEDRGSHA-RHLDFNALVGMTDCSGS 176 Query: 175 NDQRFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRI--AKIVV 217 D D A + + + + AA VE L P I A+I++ Sbjct: 177 GDAYRQNDAPLA-QLQIPLRTRMAYAAEVVE-LARPEIIKARIII 219 >UniRef90_A0A1G1VSB7 Uncharacterized protein n=1 Tax=Candidatus Chisholmbacteria bacterium RIFCSPHIGHO2_01_FULL_52_32 TaxID=1797591 RepID=A0A1G1VSB7_9BACT Length = 292 Score = 51.7 bits (123), Expect = 4e-04, Method: Composition-based stats. Identities = 45/216 (20%), Positives = 70/216 (32%), Gaps = 41/216 (18%) Query: 38 MVEVDPAGILKCVDGRGS----------------DNTRMAGPKMPGGIYAIAHNRGTTSV 81 M+ V IL C+D R + A G + AI R Sbjct: 39 MIPVKERIILGCMDERKIVALIDPKTGQKLDYSGFSVGRAAGATLGLVDAI---RNVRVT 95 Query: 82 DGLKEITKEVASKGHVPSVHGDHS---ADMLGCGFFRLWVTGEFDSMGYPRPEFDADQGA 138 ++I K ++ G V + H D + GCG L E + RP D Sbjct: 96 ILREQILKALSENGVVATNHIDTHAKEGEYTGCGHGALRAMAE-SGSLFDRPAVDLVWRM 154 Query: 139 AAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPD---ENDQRFIVDG--------WAAI 187 + +E+G + + G HT + +N + NK L+P + F +D W Sbjct: 155 SGFEETGTLRMVLDGEHTAQGFLVNPLSNKVLDPTSAFASQSFFSLDLGIYREVLRWIQG 214 Query: 188 KFNLDVV-------KFLVAAAATVEMLGGPRIAKIV 216 K A V +L +I + V Sbjct: 215 ALGFGDEVLQSILMKLTRNTLADVFILSNAKITEAV 250 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef90_B8CG97 Uncharacterized protein n=1 Tax=Thalassiosira ps... 374 e-101 UniRef90_UPI00017546FD Cadmium-specific carbonic anhydrase n=1 T... 359 8e-97 UniRef90_Q50EL4 Cadmium-specific carbonic anhydrase (Fragment) n... 357 4e-96 UniRef90_UPI00026BAC49 Cadmium-specific carbonic anhydrase n=1 T... 354 4e-95 UniRef90_K0RDT8 Carbonic anhydrase n=1 Tax=Thalassiosira oceanic... 321 2e-85 UniRef90_C1N5U2 Predicted protein n=1 Tax=Micromonas pusilla (st... 319 2e-84 UniRef90_C1ECX3 Uncharacterized protein n=1 Tax=Micromonas commo... 311 3e-82 UniRef90_A6FY58 Uncharacterized protein n=1 Tax=Plesiocystis pac... 289 1e-75 UniRef90_A0A0G1CI10 Uncharacterized protein n=1 Tax=Candidatus G... 257 7e-66 UniRef90_A0A0G1QGY3 Uncharacterized protein n=7 Tax=Microgenomat... 254 4e-65 UniRef90_A0A1F5DLS4 Uncharacterized protein n=1 Tax=Candidatus B... 247 5e-63 UniRef90_A0A1F5ZI62 Uncharacterized protein n=1 Tax=Candidatus G... 227 6e-57 UniRef90_A0A1F7IY24 Uncharacterized protein n=1 Tax=Candidatus R... 226 8e-57 UniRef90_A0A1F5KFU7 Uncharacterized protein n=1 Tax=Candidatus D... 226 2e-56 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef90_B8CG97 Uncharacterized protein n=1 Tax=Thalassiosira pseudonana TaxID=35128 RepID=B8CG97_THAPS Length = 237 Score = 374 bits (961), Expect = e-101, Method: Composition-based stats. Identities = 218/218 (100%), Positives = 218/218 (100%) Query: 1 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM 60 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM Sbjct: 20 GKDDTSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRM 79 Query: 61 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG 120 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG Sbjct: 80 AGPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTG 139 Query: 121 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 180 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI Sbjct: 140 EFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 199 Query: 181 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA 218 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA Sbjct: 200 VDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVVA 237 >UniRef90_UPI00017546FD Cadmium-specific carbonic anhydrase n=1 Tax=Thalassiosira weissflogii TaxID=67004 RepID=UPI00017546FD Length = 213 Score = 359 bits (923), Expect = 8e-97, Method: Composition-based stats. Identities = 168/209 (80%), Positives = 188/209 (89%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGG 68 LTP IVAALQ RGW+AEI++ S+ +MV+VDP GILKCVDGRGSDNT+ GPKMPGG Sbjct: 4 SLTPDQIVAALQERGWQAEIVTEFSLLNEMVDVDPQGILKCVDGRGSDNTQFCGPKMPGG 63 Query: 69 IYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYP 128 IYAIAHNRG T+++GLK+ITKEVASKGHVPSVHGDHS+DMLGCGFF+LWVTG FD MGYP Sbjct: 64 IYAIAHNRGVTTLEGLKQITKEVASKGHVPSVHGDHSSDMLGCGFFKLWVTGRFDDMGYP 123 Query: 129 RPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIK 188 RP+FDADQGA AV+ +GGVIEMHHGSH EKVVYINLVENKTLEPDE+DQRFIVDGWAA K Sbjct: 124 RPQFDADQGAKAVENAGGVIEMHHGSHAEKVVYINLVENKTLEPDEDDQRFIVDGWAAGK 183 Query: 189 FNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 F LDV KFL+AAAATVEMLGGP+ AKIV+ Sbjct: 184 FGLDVPKFLIAAAATVEMLGGPKKAKIVI 212 >UniRef90_Q50EL4 Cadmium-specific carbonic anhydrase (Fragment) n=2 Tax=Thalassiosira weissflogii TaxID=67004 RepID=Q50EL4_THAWE Length = 616 Score = 357 bits (917), Expect = 4e-96, Method: Composition-based stats. Identities = 162/210 (77%), Positives = 186/210 (88%) Query: 8 PPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 P +TP IV+AL+ RGW+A I+ AS++S ++ VDP GILKCVDGRGSDNT+ GPKMPG Sbjct: 406 PSITPPQIVSALRGRGWKASIVKASTMSSELKRVDPQGILKCVDGRGSDNTQFGGPKMPG 465 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 GIYAIAHNRG T+++GLK+IT+EVASKGHVPSVHGDHS+DMLGCGFF+LW+TG FD MGY Sbjct: 466 GIYAIAHNRGVTTLEGLKDITREVASKGHVPSVHGDHSSDMLGCGFFKLWLTGRFDDMGY 525 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 PRPEFDADQGA AV+ +GGVIEMHHGSH EKVVYINLV TLEP+E+DQRFIVDGWAA Sbjct: 526 PRPEFDADQGALAVRAAGGVIEMHHGSHEEKVVYINLVSGMTLEPNEHDQRFIVDGWAAS 585 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 KF LDVVKFLVAAAATVEMLGGP+ AKIV+ Sbjct: 586 KFGLDVVKFLVAAAATVEMLGGPKKAKIVI 615 Score = 353 bits (907), Expect = 6e-95, Method: Composition-based stats. Identities = 165/210 (78%), Positives = 189/210 (90%) Query: 8 PPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 P ++P I ALQ RGW+AEI++ +S++ +V+V P GILKCVDGRGSDNTRM GPKMPG Sbjct: 196 PSISPAQIAEALQGRGWDAEIVTDASMAGQLVDVRPEGILKCVDGRGSDNTRMGGPKMPG 255 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 GIYAIAHNRG TS++GLK+ITKEVASKGH+PSVHGDHS+DMLGCGFF+LWVTG FD MGY Sbjct: 256 GIYAIAHNRGVTSIEGLKQITKEVASKGHLPSVHGDHSSDMLGCGFFKLWVTGRFDDMGY 315 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 PRP+FDADQGA AVK++GG+IEMHHGSHTEKVVYINL+ NKTLEP+ENDQRFIVDGWAA Sbjct: 316 PRPQFDADQGANAVKDAGGIIEMHHGSHTEKVVYINLLANKTLEPNENDQRFIVDGWAAD 375 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 KF LDV KFL+AAAATVEMLGGP+ AKIVV Sbjct: 376 KFGLDVPKFLIAAAATVEMLGGPKNAKIVV 405 Score = 337 bits (865), Expect = 4e-90, Method: Composition-based stats. Identities = 158/195 (81%), Positives = 178/195 (91%) Query: 23 GWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGIYAIAHNRGTTSVD 82 GW+AEI++ S+ +MV+VDP GILKCVDGRGSDNT+ GPKMPGGIYAIAHNRG T+++ Sbjct: 1 GWQAEIVTEFSLLNEMVDVDPQGILKCVDGRGSDNTQFCGPKMPGGIYAIAHNRGVTTLE 60 Query: 83 GLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPRPEFDADQGAAAVK 142 GLK+ITKEVASKGHVPSVHGDHS+DMLGCGFF+LWVTG FD MGYPRP+FDADQGA AV+ Sbjct: 61 GLKQITKEVASKGHVPSVHGDHSSDMLGCGFFKLWVTGRFDDMGYPRPQFDADQGAKAVE 120 Query: 143 ESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKFNLDVVKFLVAAAA 202 +GGVIEMHHGSH EKVVYINLVENKTLEPDE+DQRFIVDGWAA KF LDV KFL+AAAA Sbjct: 121 NAGGVIEMHHGSHAEKVVYINLVENKTLEPDEDDQRFIVDGWAAGKFGLDVPKFLIAAAA 180 Query: 203 TVEMLGGPRIAKIVV 217 TVEMLGGP+ AKIV+ Sbjct: 181 TVEMLGGPKKAKIVI 195 >UniRef90_UPI00026BAC49 Cadmium-specific carbonic anhydrase n=1 Tax=Thalassiosira weissflogii TaxID=67004 RepID=UPI00026BAC49 Length = 231 Score = 354 bits (909), Expect = 4e-95, Method: Composition-based stats. Identities = 161/209 (77%), Positives = 185/209 (88%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGG 68 +TP IV+AL+ RGW+A I+ AS++S ++ VDP GILKCVDGRGSDNT+ GPKMPGG Sbjct: 22 SITPPQIVSALRGRGWKASIVKASTMSSELKRVDPQGILKCVDGRGSDNTQFGGPKMPGG 81 Query: 69 IYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYP 128 IYAIAHNRG T+++GLK+IT+EVASKGHVPSVHGDHS+DMLGCGFF+LW+TG FD MGYP Sbjct: 82 IYAIAHNRGVTTLEGLKDITREVASKGHVPSVHGDHSSDMLGCGFFKLWLTGRFDDMGYP 141 Query: 129 RPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIK 188 RPEFDADQGA AV+ +GGVIEMHHGSH EKVVYINLV TLEP+E+DQRFIVDGWAA K Sbjct: 142 RPEFDADQGALAVRAAGGVIEMHHGSHEEKVVYINLVSGMTLEPNEHDQRFIVDGWAASK 201 Query: 189 FNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 F LDVVKFLVAAAATVEMLGGP+ AKIV+ Sbjct: 202 FGLDVVKFLVAAAATVEMLGGPKKAKIVI 230 >UniRef90_K0RDT8 Carbonic anhydrase n=1 Tax=Thalassiosira oceanica TaxID=159749 RepID=K0RDT8_THAOC Length = 276 Score = 321 bits (824), Expect = 2e-85, Method: Composition-based stats. Identities = 125/216 (57%), Positives = 159/216 (73%), Gaps = 7/216 (3%) Query: 6 SAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKM 65 LTP+D+V LQ RGWEA I+ S S D+V V+ +G LKCVDGRG D+T GPKM Sbjct: 63 PIMTLTPEDVVGVLQGRGWEATIVKQSECS-DLVPVESSGYLKCVDGRGVDHTNTRGPKM 121 Query: 66 PGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM 125 GG+YAIAHNRG + D L++I +EV+ KG++PSVHGD +MLGCG+ +LW+TG+F + Sbjct: 122 LGGVYAIAHNRGLKTTDDLQDICREVSEKGYIPSVHGDGDGNMLGCGYCKLWLTGKFADL 181 Query: 126 ----GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIV 181 G P P + AD GAAAVK G V EM GSH EK VYIN VE++T+EP+ +DQ+F+V Sbjct: 182 DPVKGAP-PTYSADDGAAAVKAKGQV-EMCKGSHAEKFVYINFVEDQTIEPNHDDQKFVV 239 Query: 182 DGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 D WAA+KF+LDV +LV AAATVE LGGP+IAK+VV Sbjct: 240 DAWAAMKFDLDVPSYLVTAAATVERLGGPKIAKLVV 275 >UniRef90_C1N5U2 Predicted protein n=1 Tax=Micromonas pusilla (strain CCMP1545) TaxID=564608 RepID=C1N5U2_MICPC Length = 222 Score = 319 bits (817), Expect = 2e-84, Method: Composition-based stats. Identities = 123/216 (56%), Positives = 155/216 (71%), Gaps = 6/216 (2%) Query: 6 SAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKM 65 AP LTP+D+V LQ RGW AEI+ A+ ++ D+V+V P G LKCVDGR D+ AGPKM Sbjct: 8 PAPELTPEDVVGVLQDRGWTAEIVKAADVA-DLVDVSPTGYLKCVDGRAVDHNNTAGPKM 66 Query: 66 PGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM 125 GG+YAIAHNRG + L+ I EVA GHVPSVHGD +MLGCG+ +LW+TG+F + Sbjct: 67 LGGVYAIAHNRGKKTTADLEAICAEVAKAGHVPSVHGDGDGNMLGCGYCKLWLTGKFADL 126 Query: 126 ----GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIV 181 G P P + AD+GAAAVK GG +EM G H EK VYIN V +KT+EP+ ++Q+F+V Sbjct: 127 DPVKGAP-PTYSADEGAAAVKSGGGKVEMCKGKHAEKFVYINFVADKTVEPNGDNQKFVV 185 Query: 182 DGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 D W A KF LD+ +LV AAATVE LGGP+IAK+VV Sbjct: 186 DAWCAKKFKLDIPSYLVTAAATVERLGGPKIAKLVV 221 >UniRef90_C1ECX3 Uncharacterized protein n=1 Tax=Micromonas commoda (strain RCC299 / NOUM17 / CCMP2709) TaxID=296587 RepID=C1ECX3_MICCC Length = 465 Score = 311 bits (797), Expect = 3e-82, Method: Composition-based stats. Identities = 112/222 (50%), Positives = 145/222 (65%), Gaps = 11/222 (4%) Query: 6 SAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSD--NTRMAGP 63 + P P +IV ALQ RGW AEI + S + +V+V P G LKCVDGRGSD + GP Sbjct: 228 AEPRFGPAEIVGALQGRGWSAEIQTQSRNAYQLVKVSPNGFLKCVDGRGSDAKGDQQRGP 287 Query: 64 KMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFD 123 KM GG+Y IA NRG + L+ I +EV + GHVP+VHGD +LGCGF +LW+ +F Sbjct: 288 KMLGGVYGIAVNRGIKTTKELEAICQEVKAAGHVPTVHGDE-GGILGCGFCKLWLNDKFA 346 Query: 124 SMGY---PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFI 180 G +P+F A+ G+ V+++GGV+E H G HTEKVVY+N ++ TLEP+ +DQRFI Sbjct: 347 DEGMVNESKPKFSAEDGSKTVEKAGGVVENHVGKHTEKVVYLNFIDGMTLEPNADDQRFI 406 Query: 181 VDGWAAIKFNLDVVKFLVAAAATVEMLGG-----PRIAKIVV 217 VD WAA KFNLDV K+ V AAATVE L P A ++V Sbjct: 407 VDAWAAGKFNLDVPKYCVTAAATVEKLNPGQAPCPWKAVLIV 448 Score = 252 bits (645), Expect = 1e-64, Method: Composition-based stats. Identities = 113/224 (50%), Positives = 139/224 (62%), Gaps = 18/224 (8%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSI-----SQDMVEVDPAGILKCVDGRGSD--NTRMA 61 PL+ D+ AL SRGW+A I+ + +V+VDPAG LKCVDGRGSD + Sbjct: 4 PLSYGDLGVALASRGWKASILDDRDFCTLFPKEKLVDVDPAGFLKCVDGRGSDAVGKQQH 63 Query: 62 GPKMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGE 121 GPKM GG+Y IA NRG + L+ I +EV + GHVP+VHGD +LGCGF +LW+ G+ Sbjct: 64 GPKMLGGVYGIAVNRGIKTTKELEAICQEVKAAGHVPTVHGDE-GGILGCGFCKLWMNGK 122 Query: 122 FDSMG----YPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQ 177 F G P P+F ADQGAA VK +GGV+E H HTEK V +N V KT P+ DQ Sbjct: 123 FTDEGGVATAP-PDFTADQGAACVKAAGGVVENHVAKHTEKYVILNFVPGKTFVPNGKDQ 181 Query: 178 RFIVDGWAAIKFNLDVVKFLVAAAATVEMLGG-----PRIAKIV 216 RFIVD WA KFNLD+ K+ + AAATVE L P A IV Sbjct: 182 RFIVDCWALGKFNLDITKYALTAAATVEKLNPGQKPCPWKAYIV 225 >UniRef90_A6FY58 Uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 TaxID=391625 RepID=A6FY58_9DELT Length = 226 Score = 289 bits (741), Expect = 1e-75, Method: Composition-based stats. Identities = 102/226 (45%), Positives = 139/226 (61%), Gaps = 19/226 (8%) Query: 10 LTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGI 69 +TP+DI AAL++RGW A I+ S +S D+V+V G++KCVDGR S + M GPK GG+ Sbjct: 1 MTPQDIKAALEARGWTATIVPRSEVS-DIVDVGGDGLMKCVDGRPSFHPAMNGPKTLGGV 59 Query: 70 YAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADM--LGCGFFRLWVTGE------ 121 YAIA R V GL + T++VA+ GHVPSVHGD A+ +GCG+F+LW TG+ Sbjct: 60 YAIASMRDARDVAGLVQATRDVAAFGHVPSVHGDQHAEPPPMGCGYFKLWKTGKLMNLAP 119 Query: 122 ------FDSMGYPR----PEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLE 171 F + P+ P + A++G+ V GGV E G+H E+ V INLV + T E Sbjct: 120 EGKEDEFKASELPKGIVPPNYSAEEGSEIVLSEGGVYETLEGAHEEQEVVINLVTDTTFE 179 Query: 172 PDENDQRFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 P QRF+VD W KFN+D ++L AA TVE+L R A+I+V Sbjct: 180 PSRESQRFVVDAWITDKFNIDAGRYLTVAAKTVELLSDVRKARIIV 225 >UniRef90_A0A0G1CI10 Uncharacterized protein n=1 Tax=Candidatus Gottesmanbacteria bacterium GW2011_GWB1_43_11 TaxID=1618446 RepID=A0A0G1CI10_9BACT Length = 205 Score = 257 bits (656), Expect = 7e-66, Method: Composition-based stats. Identities = 55/210 (26%), Positives = 90/210 (42%), Gaps = 15/210 (7%) Query: 10 LTPKD--IVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPG 67 LTP+ + A RGW+ E + S +V+V C DGR D GP + G Sbjct: 7 LTPQTTSLKDAFLRRGWQVEEV--GSRQAPLVKVRRGAKFGCGDGRNPD----LGPALFG 60 Query: 68 GIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGY 127 + + G + + G+ P++HGD + CGFF W+ G+ G Sbjct: 61 SFWGVMATLTGGESLGAERAKIAIRDLGYQPTIHGDEHGE-FACGFFEKWMHGKLP--GV 117 Query: 128 PRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAI 187 +P F+ ++ + V + H E+ +++N V + T+ PD +RF VD W Sbjct: 118 YQPNFNENELPHILDRVTRV--RYRDKHQERELWLNPVSSTTIRPDT--RRFRVDLWFGE 173 Query: 188 KFNLDVVKFLVAAAATVEMLGGPRIAKIVV 217 + + + VE+L R AKI+V Sbjct: 174 ALGIPRESLIDTSIIVVELLSQVRTAKIIV 203 >UniRef90_A0A0G1QGY3 Uncharacterized protein n=7 Tax=Microgenomates group TaxID=1794810 RepID=A0A0G1QGY3_9BACT Length = 214 Score = 254 bits (649), Expect = 4e-65, Method: Composition-based stats. Identities = 66/222 (29%), Positives = 96/222 (43%), Gaps = 22/222 (9%) Query: 5 TSAPPLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDN-----TR 59 + P T + ++ + GWE + S +V V G++ CVDGR D Sbjct: 2 SPEIPSTNRTMLERMLGSGWEVKEGDPSL----LVRVVRGGLVHCVDGRKVDQFLVPQKI 57 Query: 60 MAGPKMPGGIYAIA----HNRGTTSVDG--LKEITKEVASKGHVPSVHGDHSADMLGCGF 113 + GPK+ GG +A +G + VD ++ + + + G VP VH D L CG Sbjct: 58 VRGPKIQGGAEGVALLLAKAQGVSEVDESWFRKACQVIKNSGFVPGVH---DFDHLHCGH 114 Query: 114 FRLWVTGEFDSMGYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPD 173 F L G+F+ M PR A + V E GG G H E V+ +N N TL P Sbjct: 115 FNLASQGKFEGM--PRFTITAGDMSRIVGEFGGSQVHLAGQHEEYVMRVNWDPNMTLIP- 171 Query: 174 ENDQRFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRIAKI 215 N + F +D W A ++ L AA TV L R ++ Sbjct: 172 -NKEAFNLDAWYANVIGINQETLLDNAAKTVMGLSSVRTVEV 212 >UniRef90_A0A1F5DLS4 Uncharacterized protein n=1 Tax=Candidatus Beckwithbacteria bacterium RIFCSPHIGHO2_12_FULL_47_17 TaxID=1797460 RepID=A0A1F5DLS4_9BACT Length = 203 Score = 247 bits (632), Expect = 5e-63, Method: Composition-based stats. Identities = 65/210 (30%), Positives = 99/210 (47%), Gaps = 16/210 (7%) Query: 15 IVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNTR--MAGPKMPGGIYAI 72 + L +GWE + +V V+ G C DGR +T+ + PK+ GG+ Sbjct: 1 MFDDLVRQGWEVKEG----NRDKLVPVEADGFGPCGDGRKPKDTQIKLRAPKILGGVLGK 56 Query: 73 AHN---RGTTSVDGLKEI---TKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMG 126 A + G +I +++ + G PSVHGD GCGF RLW G+ D+ Sbjct: 57 AALGSGKAAAQTIGEYDIRLACRDIKAAGFTPSVHGDTKHGKKGCGFGRLWSEGKLDN-- 114 Query: 127 YPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAA 186 PR ++ + V E GG G H E+ V +N + + TLEPD FI+D WAA Sbjct: 115 VPRLNVSLERVSEIVNEEGGQYIELDGEHEEQRVMVNFIPDMTLEPDG--SCFIIDAWAA 172 Query: 187 IKFNLDVVKFLVAAAATVEMLGGPRIAKIV 216 KF ++ + L A V L GP++ +++ Sbjct: 173 DKFGINQERLLQNAVEVVVKLNGPKVIELI 202 >UniRef90_A0A1F5ZI62 Uncharacterized protein n=1 Tax=Candidatus Gottesmanbacteria bacterium RBG_13_45_10 TaxID=1798370 RepID=A0A1F5ZI62_9BACT Length = 238 Score = 227 bits (579), Expect = 6e-57, Method: Composition-based stats. Identities = 52/222 (23%), Positives = 85/222 (38%), Gaps = 23/222 (10%) Query: 12 PKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGR----GSDNTRMAGP---- 63 + + GW+ + + +V + C DGR ++ + P Sbjct: 21 ARQAAERFRHYGWKVVDVEQKGMVLPLVIGKGPLSVICGDGRYARYFQNHKELN-PQCTI 79 Query: 64 KMPGGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADM-----LGCGFFRLWV 118 + GG Y R +++GL+ + + G V HGD + CGF W Sbjct: 80 SIFGGAYGAQALRFGGTLEGLRTLAEYANKNGLVFRTHGDEHGEHHEPADFNCGFLGKWA 139 Query: 119 TGEFDSM---GYPRPEFDADQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDEN 175 + + P+ EF D A A + G ++ G H E+V+ +N T+ P Sbjct: 140 ERKLRGVMPLEIPKQEFP-DMLAHA-QTLGFGHDILPGVHEERVLVLNFAPGTTVAPQAT 197 Query: 176 DQRFIVDGWAAIKFNLDVVKFLVAAAATVEML-GGPRIAKIV 216 RF VDGW A + L + + + TVE+L R IV Sbjct: 198 --RFRVDGWVAGSY-LGLTNLVDVSRQTVELLKKDVRAVTIV 236 >UniRef90_A0A1F7IY24 Uncharacterized protein n=1 Tax=Candidatus Roizmanbacteria bacterium RIFCSPLOWO2_01_FULL_38_12 TaxID=1802061 RepID=A0A1F7IY24_9BACT Length = 226 Score = 226 bits (578), Expect = 8e-57, Method: Composition-based stats. Identities = 58/203 (28%), Positives = 86/203 (42%), Gaps = 13/203 (6%) Query: 19 LQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGSDNT----RMAGPKMPGGIYAIAH 74 RGW + +V +L C D R + GP + GG IA Sbjct: 29 FLERGWNVKHGDNGI----LVGTSFQSVLNCGDDRFKNGEVPEDHRYGPSIFGGAVGIAA 84 Query: 75 NRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPRPEFDA 134 R +++G++ T ++++ G+ +HGD D LGCGF RL + G F+ + P D Sbjct: 85 LRREPTLEGVRRATLDISALGYRAGMHGDVENDELGCGFNRLLLNGYFNGV-VGTPAIDL 143 Query: 135 DQGAAAVKESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKF-NLDV 193 + E GG G HT + N V T+ D N+ F VDGW A+ ++ Sbjct: 144 KTARQVLDEHGGSYVDLSGIHTAVGLNFNFVPGTTILSDGNN--FGVDGWFALLIDGVEP 201 Query: 194 VKFLVAAAATVEMLG-GPRIAKI 215 + L AATVE L + I Sbjct: 202 DRLLELTAATVEALKPDAKNVTI 224 >UniRef90_A0A1F5KFU7 Uncharacterized protein n=1 Tax=Candidatus Daviesbacteria bacterium RIFCSPHIGHO2_02_FULL_43_12 TaxID=1797776 RepID=A0A1F5KFU7_9BACT Length = 220 Score = 226 bits (576), Expect = 2e-56, Method: Composition-based stats. Identities = 57/225 (25%), Positives = 90/225 (40%), Gaps = 28/225 (12%) Query: 9 PLTPKDIVAALQSRGWEAEIISASSISQDMVEVDPAGILKCVDGRGS--DNTRMAGPKMP 66 PL +DI+ A + W+ EI+ AS+ Q +V P L+C D R + G ++ Sbjct: 7 PLLARDILQA-RKHNWQVEIVKASNTEQG--QVHPGAALECGDVRFDWLEGRTCWGYRIL 63 Query: 67 GGIYAIAHNRGTTSVDGLKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSM- 125 G + A+A + ++ G + EV G P HG C FF LW TG + Sbjct: 64 GQVNAVAALKTGGNIVGFNQANAEVRRCGCTPGTHGPS------CAFFELWTTGRLKEVP 117 Query: 126 ---GYP------RPEFDADQGAAAVKESGGVIEMH--HGSHTEKVVYINLVENKTLEPDE 174 P R + ++ +GGV + GSH + + N + T Sbjct: 118 FRYDVPMQRMRDRLTGTGNPIKRKMQLAGGVHFVLEDRGSHA-RHLDFNALVGMTDCSGS 176 Query: 175 NDQRFIVDGWAAIKFNLDVVKFLVAAAATVEMLGGPRI--AKIVV 217 D D A + + + + AA VE L P I A+I++ Sbjct: 177 GDAYRQNDAPLA-QLQIPLRTRMAYAAEVVE-LARPEIIKARIII 219 Database: uniref90 Posted date: Mar 5, 2018 1:12 PM Number of letters in database: 999,999,963 Number of sequences in database: 2,877,805 Database: /home/casp13/uniref/uniref90.01 Posted date: Mar 5, 2018 1:14 PM Number of letters in database: 999,999,867 Number of sequences in database: 2,271,643 Database: /home/casp13/uniref/uniref90.02 Posted date: Mar 5, 2018 1:15 PM Number of letters in database: 999,999,892 Number of sequences in database: 2,337,629 Database: /home/casp13/uniref/uniref90.03 Posted date: Mar 5, 2018 1:16 PM Number of letters in database: 999,999,890 Number of sequences in database: 2,373,365 Database: /home/casp13/uniref/uniref90.04 Posted date: Mar 5, 2018 1:17 PM Number of letters in database: 999,999,958 Number of sequences in database: 2,482,055 Database: /home/casp13/uniref/uniref90.05 Posted date: Mar 5, 2018 1:18 PM Number of letters in database: 999,999,016 Number of sequences in database: 2,691,555 Database: /home/casp13/uniref/uniref90.06 Posted date: Mar 5, 2018 1:20 PM Number of letters in database: 999,999,819 Number of sequences in database: 3,172,423 Database: /home/casp13/uniref/uniref90.07 Posted date: Mar 5, 2018 1:21 PM Number of letters in database: 999,999,879 Number of sequences in database: 3,272,745 Database: /home/casp13/uniref/uniref90.08 Posted date: Mar 5, 2018 1:23 PM Number of letters in database: 999,999,650 Number of sequences in database: 3,282,067 Database: /home/casp13/uniref/uniref90.09 Posted date: Mar 5, 2018 1:24 PM Number of letters in database: 999,999,786 Number of sequences in database: 3,299,491 Database: /home/casp13/uniref/uniref90.10 Posted date: Mar 5, 2018 1:26 PM Number of letters in database: 999,999,996 Number of sequences in database: 3,229,471 Database: /home/casp13/uniref/uniref90.11 Posted date: Mar 5, 2018 1:27 PM Number of letters in database: 999,999,625 Number of sequences in database: 3,282,329 Database: /home/casp13/uniref/uniref90.12 Posted date: Mar 5, 2018 1:29 PM Number of letters in database: 999,999,737 Number of sequences in database: 3,239,830 Database: /home/casp13/uniref/uniref90.13 Posted date: Mar 5, 2018 1:31 PM Number of letters in database: 999,999,688 Number of sequences in database: 3,248,497 Database: /home/casp13/uniref/uniref90.14 Posted date: Mar 5, 2018 1:32 PM Number of letters in database: 999,999,725 Number of sequences in database: 3,191,607 Database: /home/casp13/uniref/uniref90.15 Posted date: Mar 5, 2018 1:33 PM Number of letters in database: 999,999,790 Number of sequences in database: 3,240,857 Database: /home/casp13/uniref/uniref90.16 Posted date: Mar 5, 2018 1:35 PM Number of letters in database: 999,999,892 Number of sequences in database: 3,247,903 Database: /home/casp13/uniref/uniref90.17 Posted date: Mar 5, 2018 1:37 PM Number of letters in database: 999,999,793 Number of sequences in database: 3,514,303 Database: /home/casp13/uniref/uniref90.18 Posted date: Mar 5, 2018 1:39 PM Number of letters in database: 999,999,927 Number of sequences in database: 2,742,274 Database: /home/casp13/uniref/uniref90.19 Posted date: Mar 5, 2018 1:41 PM Number of letters in database: 999,999,903 Number of sequences in database: 2,897,731 Database: /home/casp13/uniref/uniref90.20 Posted date: Mar 5, 2018 1:43 PM Number of letters in database: 999,999,225 Number of sequences in database: 2,744,429 Database: /home/casp13/uniref/uniref90.21 Posted date: Mar 5, 2018 1:44 PM Number of letters in database: 999,999,862 Number of sequences in database: 2,520,923 Database: /home/casp13/uniref/uniref90.22 Posted date: Mar 5, 2018 1:46 PM Number of letters in database: 999,999,379 Number of sequences in database: 2,885,596 Database: /home/casp13/uniref/uniref90.23 Posted date: Mar 5, 2018 1:48 PM Number of letters in database: 724,377,402 Number of sequences in database: 2,009,850 Lambda K H 0.308 0.164 0.485 Lambda K H 0.267 0.0509 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 37,304,106,168 Number of Sequences: 70056378 Number of extensions: 1812182658 Number of successful extensions: 4317003 Number of sequences better than 1.0e-03: 14 Number of HSP's better than 0.0 without gapping: 34 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 4316845 Number of HSP's gapped (non-prelim): 52 length of query: 218 length of database: 23,724,371,664 effective HSP length: 143 effective length of query: 75 effective length of database: 22,296,244,202 effective search space: 1672218315150 effective search space used: 1672218315150 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.8 bits) S2: 121 (50.9 bits)