SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000012572 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000012572
Domain Number 1 Region: 3877-4066
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.05e-39
Family Laminin G-like module 0.00046
Further Details:      
 
Domain Number 2 Region: 4130-4331
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.03e-39
Family Laminin G-like module 0.00066
Further Details:      
 
Domain Number 3 Region: 1618-1634,3633-3813
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.58e-33
Family Laminin G-like module 0.0013
Further Details:      
 
Domain Number 4 Region: 3541-3628
Classification Level Classification E-value
Superfamily Immunoglobulin 7.01e-20
Family I set domains 0.011
Further Details:      
 
Domain Number 5 Region: 3179-3272
Classification Level Classification E-value
Superfamily Immunoglobulin 2.26e-19
Family I set domains 0.0088
Further Details:      
 
Domain Number 6 Region: 1932-2020
Classification Level Classification E-value
Superfamily Immunoglobulin 3.24e-19
Family I set domains 0.025
Further Details:      
 
Domain Number 7 Region: 3260-3353
Classification Level Classification E-value
Superfamily Immunoglobulin 5.84e-18
Family I set domains 0.016
Further Details:      
 
Domain Number 8 Region: 1652-1747
Classification Level Classification E-value
Superfamily Immunoglobulin 7.3e-18
Family I set domains 0.024
Further Details:      
 
Domain Number 9 Region: 2599-2686
Classification Level Classification E-value
Superfamily Immunoglobulin 1.31e-17
Family I set domains 0.015
Further Details:      
 
Domain Number 10 Region: 3463-3550
Classification Level Classification E-value
Superfamily Immunoglobulin 1.46e-17
Family I set domains 0.014
Further Details:      
 
Domain Number 11 Region: 2508-2591
Classification Level Classification E-value
Superfamily Immunoglobulin 1.53e-16
Family I set domains 0.011
Further Details:      
 
Domain Number 12 Region: 2407-2497
Classification Level Classification E-value
Superfamily Immunoglobulin 2.11e-16
Family I set domains 0.04
Further Details:      
 
Domain Number 13 Region: 2991-3086
Classification Level Classification E-value
Superfamily Immunoglobulin 2.94e-16
Family I set domains 0.028
Further Details:      
 
Domain Number 14 Region: 1749-1835
Classification Level Classification E-value
Superfamily Immunoglobulin 4.56e-16
Family I set domains 0.0000305
Further Details:      
 
Domain Number 15 Region: 3077-3170
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000117
Family I set domains 0.028
Further Details:      
 
Domain Number 16 Region: 2701-2785
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000181
Family I set domains 0.044
Further Details:      
 
Domain Number 17 Region: 1841-1928
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000412
Family I set domains 0.011
Further Details:      
 
Domain Number 18 Region: 2022-2104
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000126
Family I set domains 0.054
Further Details:      
 
Domain Number 19 Region: 383-465
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000132
Family I set domains 0.01
Further Details:      
 
Domain Number 20 Region: 2796-2882
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000181
Family I set domains 0.027
Further Details:      
 
Domain Number 21 Region: 2123-2204
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000181
Family I set domains 0.052
Further Details:      
 
Domain Number 22 Region: 2893-2978
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000182
Family I set domains 0.042
Further Details:      
 
Domain Number 23 Region: 3368-3459
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000762
Family I set domains 0.013
Further Details:      
 
Domain Number 24 Region: 2314-2398
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000474
Family I set domains 0.089
Further Details:      
 
Domain Number 25 Region: 262-299
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000314
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 26 Region: 1540-1592
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000753
Family Laminin-type module 0.0072
Further Details:      
 
Domain Number 27 Region: 347-390
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000119
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 28 Region: 304-340
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000128
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 29 Region: 741-793
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000148
Family Laminin-type module 0.0052
Further Details:      
 
Domain Number 30 Region: 1136-1188
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000363
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 31 Region: 801-849
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000828
Family Laminin-type module 0.02
Further Details:      
 
Domain Number 32 Region: 3809-3850
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000321
Family EGF-type module 0.0083
Further Details:      
 
Domain Number 33 Region: 174-214
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000681
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 34 Region: 1599-1648
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000209
Family Laminin-type module 0.051
Further Details:      
 
Domain Number 35 Region: 4072-4115
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000101
Family EGF-type module 0.011
Further Details:      
 
Domain Number 36 Region: 1196-1241
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000226
Family Laminin-type module 0.023
Further Details:      
 
Domain Number 37 Region: 2248-2295
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000045
Family I set domains 0.024
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000012572
Domain Number - Region: 1252-1299
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00046
Family Laminin-type module 0.0062
Further Details:      
 
Domain Number - Region: 864-903
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000838
Family Laminin-type module 0.029
Further Details:      
 
Domain Number - Region: 1099-1138
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00255
Family Laminin-type module 0.078
Further Details:      
 
Domain Number - Region: 1505-1542
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0168
Family Integrin beta EGF-like domains 0.087
Further Details:      
 
Domain Number - Region: 703-743
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0177
Family EGF-type module 0.048
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000012572   Gene: ENSECAG00000012961   Transcript: ENSECAT00000015624
Sequence length 4357
Comment pep:known_by_projection chromosome:EquCab2:2:32685592:32750712:1 gene:ENSECAG00000012961 transcript:ENSECAT00000015624 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VTHGLRAYDGLSLPEDAETVTAGRAGWSYSDLSDDEDLLADDASEDDGLGSGDVGSGDFQ
MVYFRALVNFTRSIDYSTRLENASSEEFREVSEAVVDTLESEYLKIPGDQVVSVVFIKEL
DGWVFVELDVGSEGNADGAQIQEVLHTVISSGSIASYITSPQGFQFRRLGTVPQVARVCT
EAEFACHSHNECVALEYRCDRRPDCRDMSDELNCEDLVPELSSLPPPLMETPLPPHYPEV
ATIRQLLVTPAPQPLFPEPHRPVPCGPHEAACHSGHCIPKDYVCDGQEDCKDGSDELDCG
PTPPCEPNEFPCGNGHCALKLWRCDGDFDCEDRTDEADCPAKRPEDVCGPTQFRCVSTNT
CIPASFHCDEESDCPDRSDEFGCMPPQVVTPPQESIQASRGQTVTFTCVAGVPTPIINWR
LNWHIPSHPRVTVTSEGRGTLTIRDVKETEQGPYPCEAMNARGMVFGPDGVLELIPQRGG
PCPDGHFYLEHSASCLPCFCFGVTSVCQSSQRYRDQIRLRFDQPNDFKGVNVTMPAQPGM
PPLSSTQLQIDTALQEFQLVDLSRRFLVHDSFWALPEQFLGNKVDSYGGSLRYKVRYELA
RGTLEPVQRPDVVLVGAGYRLLSRGHTPTQPGALNQRQVQFSEEHWVHESGQPVQRAELL
QVLQSLEAVLIQTVYNTKMASVGLSDITMDTTVTHATSHGRAHSVEECRCPIGYSGLSCE
SCDAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHCLNCQHNTEGPQCNKCKAGFFGDA
TKATATACRPCPCPYIDASRRFSDTCFLDTDGQATCDACAPGYTGRRCESCAPGYEGNPI
QPGGKCRPTNQEIVRCDERGSLGTSGQTCRCKNNVVGRLCNECVAGSFHLSAHNPDGCLK
CFCMGVSRQCTSSTWNRAQVHGASEEPTQFSLTNTAGTHTTSEGISSPVPGELVFSSFHN
LLSGPYFWSLPSRFRGDKVTSYGGELRFTVTQQPQPGSTPLHRQPLVVLQGNGIVLEHHV
SREPSPGQPSTFTVPFREQAWQRPDGQPATREHLLMALAGLDTLLIQASYTQRPAESRIS
GISMDVAVPEDTGQDPALEVEQCTCPPGYRGPSCQDCDTGYTRTPSGLYLGTCERCSCHG
HTEVCEPETGACQGCQHHTEGPRCEQCQPGYYGDAQRGTPEDCQPCPCHGAPAAGQATHT
CFLDTDGHATCDACSPGHSGRHCERCAPGYYGNPSQGQPCRRDGQMPEPIGCGCDPQGSI
SNQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSARNPEGCLPCFCMGVTQQCASSTYTRHL
ISTRFAPGDFQGFALVNPQRNSRLTGGFTVEPVPEGAQLSFGNFAQLGRESFYWQLPEAY
QGDKVAAYGGKLRYTLSYTAGAQGSPLSDPDVQITGNNIMLVASQPVLQGPERKSYEIIF
REEFWRRPDGQPATREHLLMALADLDELLVRATFSSMPQAASISAVSLEVAQPGPSEGPR
ALEVEECRCPPAYVGSSCQDCAPGYTRTGSGLYLGHCELCECNGHSDVCHPETGACSQCQ
HNAAGEFCELCAPGFYGDATAGTPEDCQPCACPLTNPENMFSRTCESLGAGGYRCTACEP
GYTGQYCEQCAPGYVGNPNVRGGRCLPQNQAPLVVQVHPARSIVPQGGPYSLRCQVSGSP
PHYFYWSREDGRPVPSSTQQRHQGSELHFPSVQPSDAGVYICTCRNLHHANSSRAELLVT
EAPSKPITVTVEEQRSRSVHPGADVTFICTAKSKSPAYTLVWTRLHNGKLPARAMDFNGI
LTIRNVQPSDAGTYVCTGSNMFAMDQGTATLHVQASGTPSAPVVSIHPPQLTVQPGQVAE
FRCSATGSPTPTLEWTGGPSGQLPQKAQIHGGILRLPAIEPSDQGQYLCRASSSAGQQVA
RAVLHVHGGNRPRVQVSPERTQVHEGRTVRLYCRAAGVPSATITWRKEGGSLPPHARSER
TDIATLLIPAITTADAGFYLCVATSPAGTAQARIQVVVLPVRIESSSPSVTEGQTLDLSC
VVAGLAYSQVTWYKRGGSLPPHAQVHGSRLRLAEVSPADSGEYVCRVESESGAKEASIII
SVLHSTHSGPSYTPAPGSTQPIRIESSSSHVAEGQTLDLKCVVPGQAHAQVMWYRRGGSL
PARHQTHGSLLRLHQVSPADSGEYVCRVILSSGPLETSVLVSIEASGSSADSIPGDPGRL
HFSEKPTVFPSLLSLATPPLGVAQLRTSRWTRGGSLPARHQVRGSRLYIFQASPADAGEY
VCRASNGVEASITVTVTGTQGANFAYPPGGSQLIRIESSSSHVAEGQTLDLNCVVPGQTH
AQITWHKRGGSLPARHQTHGSLLRLHQVSPADSGEYVCRVGGGSVPLEASVLVTIEPASS
MPALGVTPPVRIESSSSHVAEGQTLDLNCLVAGQAHAQITWHKRGGSLPARHQVHGSRLR
LPQVTPADSGEYVCRVVSSSGTQEASVLVTIQQRLSPSHTQGVVYPVRIESSSSSLANGH
TLDLNCLVTSQAPHTITWYKRGGSLPSRHQIVGSRLRIPQVTPADSGEYVCHVSNGAGSQ
ETSLIVTIQGSGSSHVPSVSPPIRIESSSPTVVEGQTLDLNCVVAGQPQATITWYKRGGS
LPARHQAHGSRLRLHQMSVADSGEYVCRANNNIDAQEASIMVSVSPSAGSPSVPGGSVPI
RIESSSSHVAEGQTLDLNCVVPGQAHAQVTWHRRGGSLPPHHQAHGSRLRLHQVSPADSG
EYVCRVVSSSGPLEASVLVTIEASGSSAVPVPGEVPPIRIETSSSHVAEGQTLDLKCVVP
GQAHAQVTWHRRGGSLPARHQVHGPLLRLNQVSPADSGEYSCQVTGSSGTLEASVLVTIE
ASSPRPIPAPGLAQPIYIEASSSHVAEGQTLDLNCVVPRQPHAQVTWHKRGGSLPARHQT
HGSRLRLHHVSPADSGEYVCRVVGGSGPEQEASFTVTVPPSAGSSYRLRSPVISIEPPSS
TVQQGQDASFKCLIHDGAAPISLEWKTRNQELEDNVHISPNGSIITIVGTRPSNHGAYRC
VASNAYGVAQSVVNLSVHGPPTVSVLPEGPVWVKVGKAVALECVSAGEPRSSARWTRIGT
PAKVEQQTYGPVDSHTVLQISSAKPSDAGTYVCLAQNALGTAQKRVEVIVDMGTVAPGAP
QVQVEEAELTVEAGHTATLRCSATGSPTPTIHWSKLRSPLPWQHQLEGNTLIIPRVAQQD
SGQYICNATSPAGHAEATIALHVESPPYATTVPEHASVQAGETVQLQCLAHGTPPLTFQW
SRVGGSLSGRATARNEMLHFEPAAPEDSGRYRCRVTNRVGSAEAFAHVVIQGPSGSLPAT
AVPAGSTPTVQVTPQLETKSIGASVEFHCAVPSDRGTQLRWIKEGGHLPPGHSVQDGVLR
IQNLDQSCQGTYICQAYGPWGQAQASAQLVVQALPSVLINIRTSVQTVVVGHAVEFECLA
LGDPKPQVTWSKVGGRLRPGIVQSGGIVRIAHVELADAGQYRCTATNAAGTTQSHVLLLV
QALPQISTPPEVHVPAGSTAVFPCMASGYPTPDITWSKLDGNLPPDSRLENNMLVLPSVR
PQDAGTYVCTATNRQGKVKAFAQLRVPERVVPYFTQTPHSFLPLPTIKDAYRKFEIRITF
RPDSADGMLLYNGQKQSPGSPASLAHRQPDFISFGLVGGRPEFRFDAGSGMATIRHPTPL
ALGQFHTVTLLRSLTQGSLIVGSLAPVNGTSQGKFQGLDLNEELYLGGYPDYSAIPKAGL
SSGFIGCVRELRIQGEEIVFHDLNLTAHGISHCPTCRDRPCQNGGQCQDSESSSYVCVCL
PGFTGSRCEHSQALHCHPEACGPDATCVNRPDGRGYTCRCHLGRSGIRCEEGVTVTTPSM
SGTGSYLALPALTNTHHELRLDVEFKPLAPDGVLLFSGGKSGPVEDFVSLAMVGGHLEFR
YELGSGLAVLRSAEPLALGRWHHVSAERLNKDGSLRVNGGRPVLRSSPGKSQGLNLDTLL
YLGGVEPSVSLPPANASAPFRGCVGEVSVNGKRLDLTYSFLGSRGIGQCYDSSPCEPQPC
QHGATCMPAGEYEFQCLCQDGFKGDLCEHEENPCQLREPCLHGGTCRGTHCLCPPGFSGP
RCQQGSGHGTAESDWHLEGSGGNDAPGQYAAYFHDDGFLALPGHVLSRSLPEVPETIELE
VRTSTASGLLLWQGVVGEASRGKDFISLGLQDGHLVFSYQLGSGEARLVSEDPINDGEWH
RVTALREGRRGSIQVDGEELVSGQSPGPNVAVNTKGSIYIGGAPDVATLTGGRFSSGITG
CIKNLVLHSARPGAPPPQPLDLQHRAQAGANTRPCPS
Download sequence
Identical sequences F7DMY7
ENSECAP00000012572

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]