SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 9606.ENSP00000420820 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  9606.ENSP00000420820
Domain Number 1 Region: 372-534
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.53e-32
Family MAM domain 0.0093
Further Details:      
 
Domain Number 2 Region: 39-202
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.38e-31
Family MAM domain 0.011
Further Details:      
 
Domain Number 3 Region: 208-365
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.64e-20
Family MAM domain 0.0028
Further Details:      
 
Domain Number 4 Region: 1421-1481
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000142
Family BSTI 0.021
Further Details:      
 
Domain Number 5 Region: 1808-1868
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000000311
Family BSTI 0.034
Further Details:      
 
Domain Number 6 Region: 2578-2624
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000229
Family BSTI 0.036
Further Details:      
 
Domain Number 7 Region: 1043-1095
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000441
Family ATI-like 0.054
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) 9606.ENSP00000420820
Sequence length 2791
Comment (Homo sapiens)
Sequence
MVPPVWTLLLLVGAALFRKEKPPDQKLVVRSSRDNYVLTQCDFEDDAKPLCDWSQVSADD
EDWVRASGPSPTGSTGAPGGYPNGEGSYLHMESNSFHRGGVARLLSPDLWEQGPLCVHFA
HHMFGLSWGAQLRLLLLSGEEGRRPDVLWKHWNTQRPSWMLTTVTVPAGFTLPTRLMFEG
TRGSTAYLDIALDALSIRRGSCNRVCMMQTCSFDIPNDLCDWTWIPTASGAKWTQKKGSS
GKPGVGPDGDFSSPGSGCYMLLDPKNARPGQKAVLLSPVSLSSGCLSFSFHYILRGQSPG
AALHIYASVLGSIRKHTLFSGQPGPNWQAVSVNYTAVGRIQFAVVGVFGKTPEPAVAVDA
TSIAPCGEGFPQCDFEDNAHPFCDWVQTSGDGGHWALGHKNGPVHGMGPAGGFPNAGGHY
IYLEADEFSQAGQSVRLVSRPFCAPGDICVEFAYHMYGLGEGTMLELLLGSPAGSPPIPL
WKRVGSQRPYWQNTSVTVPSGHQQPMQLIFKGIQGSNTASVVAMGFILINPGTCPVKVLP
ELPPVSPVSSTGPSETTGLTENPTISTKKPTVSIEKPSVTTEKPTVPKEKPTIPTEKPTI
STEKPTIPSEKPNMPSEKPTIPSEKPTILTEKPTIPSEKPTIPSEKPTISTEKPTVPTEE
PTTPTEETTTSMEEPVIPTEKPSIPTEKPSIPTEKPTISMEETIISTEKPTISPEKPTIP
TEKPTIPTEKSTISPEKPTTPTEKPTIPTEKPTISPEKPTTPTEKPTISPEKLTIPTEKP
TIPTEKPTIPTEKPTISTEEPTTPTEETTISTEKPSIPMEKPTLPTEETTTSVEETTIST
EKLTIPMEKPTISTEKPTIPTEKPTISPEKLTIPTEKLTIPTEKPTIPIEETTISTEKLT
IPTEKPTISPEKPTISTEKPTIPTEKPTIPTEETTISTEKLTIPTEKPTISPEKLTIPTE
KPTISTEKPTIPTEKLTIPTEKPTIPTEKPTIPTEKLTALRPPHPSPTATGLAALVMSPH
APSTPMTSVILGTTTTSRSSTERCPPNARYESCACPASCKSPRPSCGPLCREGCVCNPGF
LFSDNHCIQASSCNCFYNNDYYEPGAEWFSPNCTEHCRCWPGSRVECQISQCGTHTVCQL
KNGQYGCHPYAGTATCLVYGDPHYVTFDGRHFGFMGKCTYILAQPCGNSTDPFFRVTAKN
EEQGQEGVSCLSKVYVTLPESTVTLLKGRRTLVGGQQVTLPAIPSKGVFLGASGRFVELQ
TEFGLRVRWDGDQQLYVTVSSTYSGKLCGLCGNYDGNSDNDHLKLDGSPAGDKEELGNSW
QTDQDEDQECQKYQVVNSPSCDSSLQSSMSGPGFCGRLVDTHGPFETCLLHVKAASFFDS
CMLDMCGFQGLQHLLCTHMSTMTTTCQDAGHAVKPWREPHFCPMACPPNSKYSLCAKPCP
DTCHSGFSGMFCSDRCVEACECNPGFVLSGLECIPRSQCGCLHPAGSYFKVGERWYKPGC
KELCVCESNNRIRCQPWRCRAQEFCGQQDGIYGCHAQGAATCTASGDPHYLTFDGALHHF
MGTCTYVLTRPCWSRSQDSYFVVSATNENRGGILEVSYIKAVHVTVFDLSISLLRGCKVM
LNGHRVALPVWLAQGRVTIRLSSNLVLLYTNFGLQVRYDGSHLVEVTVPSSYGGQLCGLC
GNYNNNSLDDNLRPDRKLAGDSMQLGAAWKLPESSEPGCFLVGGKPSSCQENSMADAWNK
NCAILINPQGPFSQCHQVVPPQSSFASCVHGQCGTKGDTTALCRSLQAYASLCAQAGQAP
AWRNRTFCPMRCPPGSSYSPCSSPCPDTCSSINNPRDCPKALPCAESCECQKGHILSGTS
CVPLGQCGCTDPAGSYHPVGERWYTENTCTRLCTCSVHNNITCFQSTCKPNQICWALDGL
LRFGPQVWECVSSQGSPTTALMVVTILSRTPALLSWKCATPPWPCPSSRSVPSMRRRKVE
LRLSAFMRSTLTSTMPRSPCRRATVCSTANRSPSPPSPRSLGSVSSPAASTALLTSRSGC
KSSLTGIISRLKSPQPTMERSAACVGTSMMRKRTNCPAMKQIVTVNLTVGKIRTLTQVVR
VSWMSSRFQRNSRRTRVETAGRPTSAGRGKSARQRSGLLCGPSAPPATSRPSWWTVQTPS
VSSEVSTRPSARLCKPSGPPARARGSSPHSGETAASALWNALPTAATPTAFPPAHPPAGT
WMAGVRAPKSPLPALRAAFVSPAMCVKTSVSPEVSVAARMPMVAPSLWARAGSPAVARRS
VSAREEPFSAGTSDAPLGPTASSLPTTATAIVSQTSLNNAQSMATPVTSHLTASATACKA
APMFSRLWTYCLRGWSPSSWKDATRWIRPGAPSSCRKLPPSTAIKCSSKLVWSLWSTTRR
WPSPTGQMNTCGSPCGANGSTWSPTLSWSSALVEGKMQSPYPACTRGLVACAETTTRTAR
MTCCPVAPPRTSTPLATAGRRPRTHSCASPGLYQRRRRDKGRSWASARASKCPNVARSSW
RATAPRPVGCWQTPRAPLLPVTRRWPQSPSKSTACWICALLRTQESKRSCVARSSVAMEC
PAGTIYQSCMTPCPASCANLADPGDCEGPCVEGCASIPGYAYSGTQSLPWLTVAAPAMAS
TTSWAAAFLRTALSGAPVPAHGSCCVSPSAAERGRSAPWGTTPKAAFQKARVCRTPVRMT
GSVGSREPPSPASVKLVTGEACVWSLEMRHLPESQHLTWWASYWDCWCLWWSYYWPPESA
FTERGGRERKRRRETDWPGWWTQILFWTVPV
Download sequence
Identical sequences 9606.ENSP00000420820

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]