SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|20093372|ref|NP_619447.1| from Methanosarcina acetivorans C2A

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|20093372|ref|NP_619447.1|
Domain Number 1 Region: 2656-2738
Classification Level Classification E-value
Superfamily PKD domain 1.7e-31
Family PKD domain 0.00023
Further Details:      
 
Domain Number 2 Region: 2030-2118
Classification Level Classification E-value
Superfamily PKD domain 2.62e-30
Family PKD domain 0.00016
Further Details:      
 
Domain Number 3 Region: 2565-2651
Classification Level Classification E-value
Superfamily PKD domain 4.45e-30
Family PKD domain 0.00023
Further Details:      
 
Domain Number 4 Region: 2121-2203
Classification Level Classification E-value
Superfamily PKD domain 2.22e-28
Family PKD domain 0.00029
Further Details:      
 
Domain Number 5 Region: 1102-1186
Classification Level Classification E-value
Superfamily PKD domain 3.4e-28
Family PKD domain 0.00026
Further Details:      
 
Domain Number 6 Region: 464-545
Classification Level Classification E-value
Superfamily PKD domain 1.83e-27
Family PKD domain 0.00026
Further Details:      
 
Domain Number 7 Region: 1021-1103
Classification Level Classification E-value
Superfamily PKD domain 5.49e-27
Family PKD domain 0.00043
Further Details:      
 
Domain Number 8 Region: 545-628
Classification Level Classification E-value
Superfamily PKD domain 1.16e-26
Family PKD domain 0.00029
Further Details:      
 
Domain Number 9 Region: 940-1020
Classification Level Classification E-value
Superfamily PKD domain 1.83e-26
Family PKD domain 0.0003
Further Details:      
 
Domain Number 10 Region: 351-462
Classification Level Classification E-value
Superfamily PKD domain 2.22e-26
Family PKD domain 0.00029
Further Details:      
 
Domain Number 11 Region: 1505-1584
Classification Level Classification E-value
Superfamily PKD domain 1.12e-24
Family PKD domain 0.00025
Further Details:      
 
Domain Number 12 Region: 2203-2288
Classification Level Classification E-value
Superfamily PKD domain 1.83e-24
Family PKD domain 0.00037
Further Details:      
 
Domain Number 13 Region: 1589-1668
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.000000000194
Family Invasin/intimin cell-adhesion fragments 0.0092
Further Details:      
 
Weak hits

Sequence:  gi|20093372|ref|NP_619447.1|
Domain Number - Region: 2897-2948
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.00334
Family Type I dockerin domain 0.0098
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|20093372|ref|NP_619447.1|
Sequence length 2951
Comment cell surface protein [Methanosarcina acetivorans C2A]
Sequence
MTKPKVMNISSKYAIYILPLLLILLLACTASALPTVAHDKVQGEMYVSSTANWESKYSTN
NFDVPNGTVVFARYYVGVWASSATTTSISTIFNGNAFATNPSCYSSGMGVTWIPYDVTDY
VVPGEINTATINSASWGDGRQYGTTLVVVLKNENKSQIEYWIADGLDWLHYGDYVGDEVD
NSFTYFNGTVDLADVQSANLYSTHLTGYNYEDFNGYSLSDPADSVSGDYFNYIRWDNVKA
SLVAENQTVNVGRGGDAYCSPVFHALSIAYKIPDLVPVSLTPVTVVPNTVNTMTATIENQ
GNKDSTSFNVSLLVDGIVVDTQTVTSLESENSTNVDFHWTLDGTANSYTLTVNVDPENAV
NEGNESNNTLTALVGTTTAPIPVADFTATPTAGEVPLTVNFTDQSANSPLSWTWDFNNDG
TMDSIMQNPTYTYDTPGNYTVKLTVSNGGGNDEEVKTDYIFVNYKRPIVNFTANLTKGNA
PLTVQFNDTSLNSPTVWYWDFGDNTISTEQNPVHTYTAAGNYTVNLTVTNAGGSNSSIKT
EYITVGSGVPIANFTANTVSGYTPLEIRFFDTSTINPTSWAWDFGDNTTSTEQNPVHTYI
SPGLYTVNFTVSNDYGSDIKTKANYIYAGKCKFSQNISFSAGGQAIYQQDIMIHRTSGTA
YEENAGGLKIWHVYLGDSCREDYGDLRFTDATGTQLTYYLWPDYTSEEARFCVRLERADQ
PGRLTICYGDPGATTTSNGNATYFLFDQFDGTALDTTKWEPVQDNGISVSSGGLHISSGT
GNFAEIFSRTAVPSGIILQFRIQSKTYSTSLGFGNRDYTDNGSSIGLESFSSSYNSAIYS
GEAYSNLRWAPPRTWTSKSTDGDIIPDTPGGYYTEELVISPDEPLKERRDGEAWTNSTRY
VGVSGTKPIQIEHYRKYGKMDLDYILARAYSSTPPSALGLGPIANFTVNMTNGNAPFTVH
FTDSSKNSPATWIWDFGDNTTSTEQNPVHTYVVAGNYTVNLTVTNSYGSDTGVKTDYISV
GSGVPVANFTTNKTGGNAPLTVKFTDTSTINPATWIWDFGDNTTSTEQNPVHTYVVAGNY
TVNLTATNSYGSNTCVKIGYITVGSGVPVANFTANVTGGYAPLTVQFNDTSTVNPASWSW
DFGDGNTSTEQHPVHTYMSAGTYTISLTVTNTYGSGTETKVDHIYAGMYEYNQQISYSAS
DRAVYQQDIVVHRSNGTAYEENADGLKVWHVYLGDNCRDDYGDVRFTDVTGAKLAYYLWP
DYTSEQARFCVRIENTDQPGTLTICYGNPGITTTSNGNATYFLFDQFDGTVLDTTKWELV
QNNGISVSSGGLHISSGTGNFAEIFSRTTVPSGIILQFRIQSKTYSTSVGFGNRDYTDNG
SSIGLENYDSSYNSAIYSGEAYSNLRWAPPRTWSSKSTDGDIIPDTPDGYYTEELVISPD
EPLKERRDGGTWTNSTRYVGVSGTKPIQIAHYRKYGKMDLDYILARAYSTTPLFASAFSG
EQQTAAPPVASFTATPIYGFPHTIQFTDKSFNFPTEWTWDFGDGTTSPEQNPVHTYAADG
NYTITLTATNEYGTDTETKDYEVVTLFAASLTVSPSSVQLNQSETQYFTAVAVDQLGNVI
SNTSINWSSSNETIGTIDSNGLFTANVPGKVNITASAEGVTGLAEVTVMKASPDFIVSVV
TSPMYPISHNTVTATIENHGSEDASEVTANVTIAGNTTTITVPALAIGSSTTISVKDTAR
WHVGDLVPITVVVDPGNEIDEANETNNVYTKNATISATSQRYNGGRFSDGYDKVNNLFYA
EGNVGVAVAISGNYGSQYGVSGVTLTRKFSADDLDIPAGATIKSARLYQGSTWYGDPGFN
LQFNGHETQEADAKYGDCINGQYAFDVTTYLNTTGDNIAVLTSTNSLNKYAYYATVLIVV
YEADSEPYRQIWVNEGSDCLLADYGADLAWGYTMFDNVSTDSLFSARTITVLESDDGDVN
SINFNGESLPTIKTGGSDPTIKYFSVTDALQGGENELGVTGPSYFNFANAVLEVTQVTAS
EANFTANVTSGNAPLNVEFTDTSTGTPTSWTWDFGDGKNSTEQNPTHTYTAEGTYTVKLT
VSNSFGSDSEEKTGYITAGSVVLAPVANFSVDQTTGTAPLSVQFTDESTNTPTSWTWEFG
DGKTSTEQNPTHTYETIGTYTVKLTATNYGGSNFTIKTDYITVTSNVSAPVASFTFDENS
GRVPFTVQFTDTTTGSVSSWNWDFGDGGTSNEQNPTHTYVTEGSYNVTLTATGPGGSNTI
TSTEPVVVSAPLTSDSYNGGIPLTNVQNGTVSGDLWYDSYYAMETSAQKAFTLPSYTDIK
WARLYVDVYDGHMENNYRGNVEISIDADGDSTYELQKNETFNTTYSFPGEGGTGPVWLSD
HLNRVTSDYQMWYDLTGEISGQTVNVQAITSKIDSNFDGRVKAMTLVVAYDDGDSDEVYY
WVNQGHDTVNPLDTEYTGSTSFGTSTLASGWSSANLTAIYLASVDGIYSFQGTTLTSGTP
QGSYYGDNTWDVSSMLTAGEYSIFTYNKQEEKYYKIPLALMSVKYAGSGPTAPTAGFSAN
VTEGEVPLTVLFSDESTGSPTAWVWDFGDNETSSEQSPVHTYSAAGNYTVTLTVTNAAGS
DSEVKTDYIIVSESSMPEEPVAAFNANVTEGEVPLTVQFSDESTGSPTSWFWDFGDGANS
TEQNPSHTYPSAGNYTVNLTVENAAGSDFELKSDYIEVSDASGSTVTLYFDPTSSSVAEN
ESTEISIVASNFPAGLSGYNLTVAIDDPAVAEIIDIEYPTWALITENSTLPGTSIYMKTI
DLEDSVKEGAADVMLATLTVSGKESGSANLSIGVKRLEDDSGDSIEPALLAGTIEVTLLS
PLPDQEYAPKDLDGDGLYEDLTGNGEFSFVDIVAYFHNMDWIEENMPVEYFDFNGNGRID
FDDVVDMFGMI
Download sequence
Identical sequences Q8THC9
gi|20093372|ref|NP_619447.1| 188937.MA4588 MvR254 NYSGXRC-10384k

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]