SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|20090613|ref|NP_616688.1| from Methanosarcina acetivorans C2A

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|20090613|ref|NP_616688.1|
Domain Number 1 Region: 111-350
Classification Level Classification E-value
Superfamily Quinoprotein alcohol dehydrogenase-like 1.88e-33
Family Quinoprotein alcohol dehydrogenase-like 0.0038
Further Details:      
 
Domain Number 2 Region: 1817-1901
Classification Level Classification E-value
Superfamily PKD domain 2.49e-29
Family PKD domain 0.00024
Further Details:      
 
Domain Number 3 Region: 3020-3101
Classification Level Classification E-value
Superfamily PKD domain 7.85e-29
Family PKD domain 0.00019
Further Details:      
 
Domain Number 4 Region: 3663-3775
Classification Level Classification E-value
Superfamily PKD domain 1.44e-28
Family PKD domain 0.00018
Further Details:      
 
Domain Number 5 Region: 1983-2067
Classification Level Classification E-value
Superfamily PKD domain 1.96e-28
Family PKD domain 0.00022
Further Details:      
 
Domain Number 6 Region: 791-874
Classification Level Classification E-value
Superfamily PKD domain 3.14e-28
Family PKD domain 0.00022
Further Details:      
 
Domain Number 7 Region: 1904-1984
Classification Level Classification E-value
Superfamily PKD domain 5.49e-28
Family PKD domain 0.0002
Further Details:      
 
Domain Number 8 Region: 1181-1262
Classification Level Classification E-value
Superfamily PKD domain 4.05e-27
Family PKD domain 0.00039
Further Details:      
 
Domain Number 9 Region: 1738-1818
Classification Level Classification E-value
Superfamily PKD domain 4.58e-27
Family PKD domain 0.00028
Further Details:      
 
Domain Number 10 Region: 1262-1345
Classification Level Classification E-value
Superfamily PKD domain 1.96e-26
Family PKD domain 0.00037
Further Details:      
 
Domain Number 11 Region: 22-108
Classification Level Classification E-value
Superfamily PKD domain 2.22e-26
Family PKD domain 0.00014
Further Details:      
 
Domain Number 12 Region: 709-792
Classification Level Classification E-value
Superfamily PKD domain 7.19e-26
Family PKD domain 0.00037
Further Details:      
 
Domain Number 13 Region: 2924-3016
Classification Level Classification E-value
Superfamily PKD domain 4.05e-25
Family PKD domain 0.00026
Further Details:      
 
Domain Number 14 Region: 455-541
Classification Level Classification E-value
Superfamily PKD domain 5.36e-25
Family PKD domain 0.00036
Further Details:      
 
Domain Number 15 Region: 2388-2473
Classification Level Classification E-value
Superfamily PKD domain 7.06e-25
Family PKD domain 0.00033
Further Details:      
 
Domain Number 16 Region: 1349-1427
Classification Level Classification E-value
Superfamily PKD domain 1.02e-24
Family PKD domain 0.00023
Further Details:      
 
Domain Number 17 Region: 540-625
Classification Level Classification E-value
Superfamily PKD domain 1.2e-23
Family PKD domain 0.00042
Further Details:      
 
Domain Number 18 Region: 625-709
Classification Level Classification E-value
Superfamily PKD domain 2.75e-23
Family PKD domain 0.00031
Further Details:      
 
Domain Number 19 Region: 3102-3187
Classification Level Classification E-value
Superfamily PKD domain 1.07e-22
Family PKD domain 0.0005
Further Details:      
 
Domain Number 20 Region: 2477-2561
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.00000000000179
Family Invasin/intimin cell-adhesion fragments 0.005
Further Details:      
 
Domain Number 21 Region: 311-458
Classification Level Classification E-value
Superfamily YWTD domain 0.00000000000889
Family YWTD domain 0.028
Further Details:      
 
Domain Number 22 Region: 3610-3688
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.000000133
Family Invasin/intimin cell-adhesion fragments 0.0051
Further Details:      
 
Weak hits

Sequence:  gi|20090613|ref|NP_616688.1|
Domain Number - Region: 3934-3985
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.00275
Family Type I dockerin domain 0.0098
Further Details:      
 
Domain Number - Region: 2331-2393
Classification Level Classification E-value
Superfamily TRAP-like 0.0628
Family PA3696/SPS0176-like 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|20090613|ref|NP_616688.1|
Sequence length 3988
Comment cell surface protein [Methanosarcina acetivorans C2A]
Sequence
MKRIFLILCITVLMCMTGLASADVTTVEPVANFTANATSGTTPLTVQFTDISTNATSWSW
DFENDGTGDSTGQNPAHTYDEAGTYSVKLTVTNTAGSDSEVKTDYITVEGTGTGGIADTP
WPKFQANLNNTGQSPYIGPQINNNIWTYVTGNSIRSSPAIGENRTVYIGSYDGKLYAFNP
DGTLKWSYTTGNQITGSATIGADGTICIGSYDRRLYTINPDGTLKWSYTTGNQIFSSAAI
GEDGTIYVGSRDNKLYALNPDGTLKWSYTTGNQIFSSAAIGEDGTIYIGSLDKKLHALNP
DGTLKWSYTTGNQIYGSPAIGSDGTIYIGSLDSKLHALNPDGTLKWSYTAGNQIYGSPAI
GSDGTIYIGSLDSKLYALNPDGTLKWSCTVGNQILGSAALGSDGTIYIGSYDSKIYAITP
DGTLKWSYATGNRIYGSPAIGSDGTLYIGSFDSSLYAFRDAVPVAGFTVNLRNGEVPLAV
QFTDFSTGNITERSWDFGDNTTSSDRNPIHTYVAAGTYTVNLTVSNIYGSNSSLKADYIT
VGSGIPVGNFTANVTKSIIPPLTVQFNDTSTINPTDWFWDFGDNITSTDRNPIHTYVAAG
TYTVNLTVSNTYGSNSSLKADYITVGSGIPSGNFTANVTSSPAPPLTVQFNDTSTINPTD
WFWNFGDNSTSTEQNPVHTYAADGSYTVNLTVSNSYGSNNCVKADYIMVGSGIPVGNFTV
NVTGGSAPLTVRFNDTSTVNPASWAWDFGDNTASTEQNPVHTYASDGNYTVNLKVSNSYG
SNKCVKTDYITVGSGIPVANFTAKVTSGYVPLIIQFTDTSTVNPTSWAWDFGDNTTSTEQ
NPVHTYASVGNYTVTLKVTNSYGSNTETKAGYIYAGSGFSRNISFTASDQSVYQQEIVIH
RTNGTDYEENTGDLNVWHVYIGDQCREDYGDLRFVDSTGTQLACYLWPDYTAEQALFYVR
LEGADQPGKIQILYGDTGVSTASDADATGYLIDEFSTLNPNWNTSGVASAVINESRLMTT
GNTGTFHSLSSAAVKQSITPIPGAFSAEVNLTYDASEVNCRGELFLVAYTSTDYTAIGYY
DYSSNYYGCFFYSISGTTGDTGKYGSRPGSGSMHLKIIRDASNTISVYEDGTLMGTGTMA
GSITAIGLTNTGYSSSRPGDTAYWDNLIVRSYSTTPPAESGLQPIVNFTANQTGGNAPLT
VQFNDTSRYYPTSWFWEFGDGSNSTEQNPVHTYATEGNYTVNLTVANNFSSNTCVKTNYI
TVGSGIPIANLTANVTGGNAPLTVQFNETSTVNPASWFWDFGDNTSSTEQNPVHNYATEG
NYTVNLTVANSYGSNACVKTDYIMVSSGIPDGNFTSNVTKNYAPFTVQFEDASTVNPTSW
AWDFGDGTTSTEQKPVHTYTTPGNFTVNLTVSNSYGSNSSVKADYIHAGRYIYSQNITYT
AGDQAAYQQELIIHRTNGSPYEENTADGIKVWHVQAGDQCREDYGDVRFTDATGSELAYY
LLPNYTTEQGRFYVRLEEANQPGKLQVLYGDLEVLTTSDADATGYLIDEFSTLNPNWDTS
GVASATIENGQLKTIPHGALISSGQYRTAGVIRSITPIPGEFSIEVDLTYYPYRYSDGDI
FLAVYTSDSYTLIGYYQPTWGTDGGFGTFIDGQYKSFTGDGTRPDAGSMYLKITRDASNK
ISAWENGVLMATGTMAGNITSIGLVSTDLNNAAGGHTAYWDNLAIRSYSTAPPAASGLKP
VGNFSVNTTGGNAPLTVQFNDSSFNSPASWAWDFGDGTTSTEQNPMHTYAADGSYTVNLT
VSNSYGSSTLVKTDYITVGSSIPVANFTASVTGGNAPLTVQFTDTSTINPTSWAWDFGDG
TTSTEQKPVHTYAADGNYTVNLTVSNSYGNNTCVKTDYIIVSSGVPVANFVSSVTGGNAP
LTVQFTDTSTINPTSWAWDFGDGTTSTEQKPVHTYSSDGNYTVNLTVANSYGSNSSVKTD
YITVGSGIPVANFTADVTNGYPRFAVQFTDTSTINPTSWAWDFGDGTTSTEQNPRHTYST
SGTYPVSLTVTNGYGSNTATKTGYIYAGRNEYYQKIDFSAGSQSVYQQVVVIHRSTGTAY
EEDSNGMKVWHLYVGDRCREDYGDLRFTDSTGSQLAYYLQPDYTAAQARFYVRLEGVDQP
GALNVLYGDAGLTTTSNTDVTENAGDESGYLFDAFSSLSTDWDISGVASATIDSDRLKTA
PNTNTLGAYSTAAVKRAITPISGAFTVDVDLTYVPTSTSYPYRSRGELYLVLYNSPASYA
GAGYYDNYYGSSEEYNGSFFYTITEANADTGQGTRPASGSMHLRISRDAANTVSVYEDGV
LQATGTMPGDITAIGLTNTRYSTNSYYYTACWDNLVVRADTVSLPSASQFSGEVRTAAPP
VASFTGTPVYGLPHAVQFTDTSANYPTAWSWDFGDGTTSTEQNPRHTYSADGNYTVTLTA
TNEYGSDTVTISDYKVETMIITSVTVSPSSVRLNETETRQFTAVLLDQKGNVMTNTSVSW
SSSDKAVGSIDASGLFTARAAGKTNVTASAGGLNSSAEVTVIGLAPDLTVLSVSSTTSPV
SNTVSATIRNVGTSDPGVFRTSFSVNGKVTGINVTGLAAGNTTDVSFTDLTRRKTGDIVS
ISATVDPENLVAELNETNNAYAAEVTVGTTGNYYYGGRYYSGIDLETGVYVEGNVALIYS
QGSSGYQTGGGWYSTTVQYTSTDLPIPENAVVKEARLYQSYTWCNGDPGFTLQFNGNTVN
QSAFYADPLKDGDGAEYGFNGQAVYNVTRYFNPEGNTAIIAASRPGGGLYGAVLAVVYED
PGEPHRMVWLDEGCDSLYGGSSDEYVGYAVFNNVTTSRVTSASATTVLPSGGDNGQWTIL
FNKQSVSLTGGAGSDPGYKYYDVTSSLQEGTNELGVRCDGYMNLAAAFLEVTLETASEAN
FTASTTSGNAPLVVQFTDTSTGTPTSWLWDFGDGSTSTEQNPVHTYSTANTSYTVALTVS
TSLGADTETKTDYINVGALVLAPAADFSANVTGGEAPLSVRFTDASANTPTSWEWDFGDG
STSTEQNPVHTYEAKGTYNVTLTASNYGGNNTLIKTDYISVTSDVSAPVANFTIDADTGQ
VPFTVHFTDTSTGSVSNWKWDFGDGSTSTEQNPVHTYLTPGINTVTLAATGVGGISTTTG
IVTATAPLTSDSYNGGIPLTTVRNGTVSGGLWYDSYPGFETSAQETFTLPDYTEIKWARL
YVDVYCGHMQNNYRGNVTIDINADGDNTYELQEREIFNTSYSFPGDGGAGPVWLSDHMNR
VTSDYLMWYDLTDVISSGEVSVRASSGKIDFSFDGRIKAMTLVIAYDDGDSDKVYYWVNQ
GHDTVNPGDDSTGYTGSTEFETALLAGGWDSANLSAVYYASVNGVYTFKGTALSSGTPEG
SYFGADEWDINSMLTAGQNSTLTYDRDGSNYYKIPLALLSVRYENLSLPWQDNCDSLDDW
TCSNCGLVSTTVYEGSYSIGCDASQQSASAERTIRIPSGAKTLRFDAATTSTMYAYNEYV
KFYLDGNEIFSIPVTAAKNWHLYEFDLSGIEPGKHTFKVQADWNGYSWDNIGFYIDNIWV
IADEEVLSVINVTPSEAELSVGENLTFAASAYTQYCESLPDTTFTWSSSSEAVGSINATT
GFFTALTGGTTNVTASAEGMNGSVQITVKASSTAAPVANFSTNVTGGYVPLDVQFTDLST
GEGITGWLWDFGDGANSTEQNPVHTYESAGNYTVNLTVENAAGRDFELKTDYIEVSEASG
STVTLYFDPENSSVAENESTEISIVASNFPAGLSGYNLTVAIDDPAIAEIVDIEYPSWAL
ITQNSTLPGTSIYMKTVDLEDAVKEGAADVVLATLTVSGKEKGSANLSIGVKRLEEDSGD
SIEPALLAGTIEVTLLSPLPDQEYVPRDPDGDGLYEDLTGNGEFSFVDIVAYFHNMDWIE
ENMPVEYFDFNGNGRIDFDDVVDMFAMI
Download sequence
Identical sequences Q8TPZ1
MvR120 NYSGXRC-10384q 188937.MA1762 gi|20090613|ref|NP_616688.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]