SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000012479 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000012479
Domain Number 1 Region: 3889-4078
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.05e-39
Family Laminin G-like module 0.00046
Further Details:      
 
Domain Number 2 Region: 4142-4344
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.8e-38
Family Laminin G-like module 0.00069
Further Details:      
 
Domain Number 3 Region: 1618-1634,3645-3825
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.58e-33
Family Laminin G-like module 0.0013
Further Details:      
 
Domain Number 4 Region: 3553-3640
Classification Level Classification E-value
Superfamily Immunoglobulin 7.01e-20
Family I set domains 0.011
Further Details:      
 
Domain Number 5 Region: 3191-3284
Classification Level Classification E-value
Superfamily Immunoglobulin 2.26e-19
Family I set domains 0.0088
Further Details:      
 
Domain Number 6 Region: 1933-2021
Classification Level Classification E-value
Superfamily Immunoglobulin 3.39e-19
Family I set domains 0.025
Further Details:      
 
Domain Number 7 Region: 3272-3365
Classification Level Classification E-value
Superfamily Immunoglobulin 5.84e-18
Family I set domains 0.016
Further Details:      
 
Domain Number 8 Region: 1653-1748
Classification Level Classification E-value
Superfamily Immunoglobulin 7.3e-18
Family I set domains 0.024
Further Details:      
 
Domain Number 9 Region: 2609-2696
Classification Level Classification E-value
Superfamily Immunoglobulin 1.31e-17
Family I set domains 0.015
Further Details:      
 
Domain Number 10 Region: 3475-3562
Classification Level Classification E-value
Superfamily Immunoglobulin 1.46e-17
Family I set domains 0.014
Further Details:      
 
Domain Number 11 Region: 2518-2601
Classification Level Classification E-value
Superfamily Immunoglobulin 1.53e-16
Family I set domains 0.011
Further Details:      
 
Domain Number 12 Region: 2417-2507
Classification Level Classification E-value
Superfamily Immunoglobulin 2.11e-16
Family I set domains 0.04
Further Details:      
 
Domain Number 13 Region: 3003-3098
Classification Level Classification E-value
Superfamily Immunoglobulin 2.94e-16
Family I set domains 0.028
Further Details:      
 
Domain Number 14 Region: 1750-1836
Classification Level Classification E-value
Superfamily Immunoglobulin 4.56e-16
Family I set domains 0.0000305
Further Details:      
 
Domain Number 15 Region: 3089-3182
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000117
Family I set domains 0.028
Further Details:      
 
Domain Number 16 Region: 2027-2113
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000014
Family I set domains 0.031
Further Details:      
 
Domain Number 17 Region: 2711-2796
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000181
Family I set domains 0.049
Further Details:      
 
Domain Number 18 Region: 1842-1929
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000426
Family I set domains 0.011
Further Details:      
 
Domain Number 19 Region: 383-465
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000134
Family I set domains 0.01
Further Details:      
 
Domain Number 20 Region: 2132-2213
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000181
Family I set domains 0.052
Further Details:      
 
Domain Number 21 Region: 2905-2990
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000182
Family I set domains 0.042
Further Details:      
 
Domain Number 22 Region: 2808-2894
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000223
Family I set domains 0.027
Further Details:      
 
Domain Number 23 Region: 3380-3471
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000778
Family I set domains 0.013
Further Details:      
 
Domain Number 24 Region: 2324-2408
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000474
Family I set domains 0.089
Further Details:      
 
Domain Number 25 Region: 262-299
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000314
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 26 Region: 1540-1592
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000753
Family Laminin-type module 0.0072
Further Details:      
 
Domain Number 27 Region: 347-390
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000012
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 28 Region: 304-340
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000128
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 29 Region: 741-793
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000151
Family Laminin-type module 0.0052
Further Details:      
 
Domain Number 30 Region: 1136-1188
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000363
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 31 Region: 801-849
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000828
Family Laminin-type module 0.02
Further Details:      
 
Domain Number 32 Region: 3821-3862
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000321
Family EGF-type module 0.0083
Further Details:      
 
Domain Number 33 Region: 174-214
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000681
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 34 Region: 1599-1647
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000022
Family Laminin-type module 0.051
Further Details:      
 
Domain Number 35 Region: 4084-4127
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000101
Family EGF-type module 0.011
Further Details:      
 
Domain Number 36 Region: 1196-1241
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000226
Family Laminin-type module 0.023
Further Details:      
 
Domain Number 37 Region: 2260-2305
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000123
Family I set domains 0.024
Further Details:      
 
Weak hits

Sequence:  ENSECAP00000012479
Domain Number - Region: 1252-1299
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00046
Family Laminin-type module 0.0062
Further Details:      
 
Domain Number - Region: 864-903
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000838
Family Laminin-type module 0.029
Further Details:      
 
Domain Number - Region: 1099-1138
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00255
Family Laminin-type module 0.078
Further Details:      
 
Domain Number - Region: 1505-1542
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0168
Family Integrin beta EGF-like domains 0.087
Further Details:      
 
Domain Number - Region: 703-743
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0177
Family EGF-type module 0.048
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000012479   Gene: ENSECAG00000012961   Transcript: ENSECAT00000015518
Sequence length 4370
Comment pep:known chromosome:EquCab2:2:32685592:32750712:1 gene:ENSECAG00000012961 transcript:ENSECAT00000015518 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VTHGLRAYDGLSLPEDAETVTAGRAGWSYSDLSDDEDLLADDASEDDGLGSGDVGSGDFQ
MVYFRALVNFTRSIDYSTRLENASSEEFREVSEAVVDTLESEYLKIPGDQVVSVVFIKEL
DGWVFVELDVGSEGNADGAQIQEVLHTVISSGSIASYITSPQGFQFRRLGTVPQVARVCT
EAEFACHSHNECVALEYRCDRRPDCRDMSDELNCEDLVPELSSLPPPLMETPLPPHYPEV
ATIRQLLVTPAPQPLFPEPHRPVPCGPHEAACHSGHCIPKDYVCDGQEDCKDGSDELDCG
PTPPCEPNEFPCGNGHCALKLWRCDGDFDCEDRTDEADCPAKRPEDVCGPTQFRCVSTNT
CIPASFHCDEESDCPDRSDEFGCMPPQVVTPPQESIQASRGQTVTFTCVAGVPTPIINWR
LNWHIPSHPRVTVTSEGRGTLTIRDVKETEQGPYPCEAMNARGMVFGPDGVLELIPQRGG
PCPDGHFYLEHSASCLPCFCFGVTSVCQSSQRYRDQIRLRFDQPNDFKGVNVTMPAQPGM
PPLSSTQLQIDTALQEFQLVDLSRRFLVHDSFWALPEQFLGNKVDSYGGSLRYKVRYELA
RGTLEPVQRPDVVLVGAGYRLLSRGHTPTQPGALNQRQVQFSEEHWVHESGQPVQRAELL
QVLQSLEAVLIQTVYNTKMASVGLSDITMDTTVTHATSHGRAHSVEECRCPIGYSGLSCE
SCDAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHCLNCQHNTEGPQCNKCKAGFFGDA
TKATATACRPCPCPYIDASRRFSDTCFLDTDGQATCDACAPGYTGRRCESCAPGYEGNPI
QPGGKCRPTNQEIVRCDERGSLGTSGQTCRCKNNVVGRLCNECVAGSFHLSAHNPDGCLK
CFCMGVSRQCTSSTWNRAQVHGASEEPTQFSLTNTAGTHTTSEGISSPVPGELVFSSFHN
LLSGPYFWSLPSRFRGDKVTSYGGELRFTVTQQPQPGSTPLHRQPLVVLQGNGIVLEHHV
SREPSPGQPSTFTVPFREQAWQRPDGQPATREHLLMALAGLDTLLIQASYTQRPAESRIS
GISMDVAVPEDTGQDPALEVEQCTCPPGYRGPSCQDCDTGYTRTPSGLYLGTCERCSCHG
HTEVCEPETGACQGCQHHTEGPRCEQCQPGYYGDAQRGTPEDCQPCPCHGAPAAGQATHT
CFLDTDGHATCDACSPGHSGRHCERCAPGYYGNPSQGQPCRRDGQMPEPIGCGCDPQGSI
SNQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSARNPEGCLPCFCMGVTQQCASSTYTRHL
ISTRFAPGDFQGFALVNPQRNSRLTGGFTVEPVPEGAQLSFGNFAQLGRESFYWQLPEAY
QGDKVAAYGGKLRYTLSYTAGAQGSPLSDPDVQITGNNIMLVASQPVLQGPERKSYEIIF
REEFWRRPDGQPATREHLLMALADLDELLVRATFSSMPQAASISAVSLEVAQPGPSEGPR
ALEVEECRCPPAYVGSSCQDCAPGYTRTGSGLYLGHCELCECNGHSDVCHPETGACSQCQ
HNAAGEFCELCAPGFYGDATAGTPEDCQPCACPLTNPENMFSRTCESLGAGGYRCTACEP
GYTGQYCEQCAPGYVGNPNVRGGRCLPQTDQAPLVVQVHPARSIVPQGGPYSLRCQVSGS
PPHYFYWSREDGRPVPSSTQQRHQGSELHFPSVQPSDAGVYICTCRNLHHANSSRAELLV
TEAPSKPITVTVEEQRSRSVHPGADVTFICTAKSKSPAYTLVWTRLHNGKLPARAMDFNG
ILTIRNVQPSDAGTYVCTGSNMFAMDQGTATLHVQASGTPSAPVVSIHPPQLTVQPGQVA
EFRCSATGSPTPTLEWTGGPSGQLPQKAQIHGGILRLPAIEPSDQGQYLCRASSSAGQQV
ARAVLHVHGGNRPRVQVSPERTQVHEGRTVRLYCRAAGVPSATITWRKEGGSLPPHARSE
RTDIATLLIPAITTADAGFYLCVATSPAGTAQARIQVVVLPASGAVSPPVRIESSSPSVT
EGQTLDLSCVVAGLAYSQVTWYKRGGSLPPHAQVHGSRLRLAEVSPADSGEYVCRVESES
GAKEASIIISVLHSTHSGPSYTPAPGSTQPIRIESSSSHVAEGQTLDLKCVVPGQAHAQV
MWYRRGGSLPARHQTHGSLLRLHQVSPADSGEYVCRVILSSGPLETSVLVSIEASGSSAD
SIPGDPPLGVAQLRTSSASHQTPEHVGLVSEDEAELAPGWTRGGSLPARHQVRGSRLYIF
QASPADAGEYVCRASNGVEASITVTVTGTQGANFAYPPGGSQLIRIESSSSHVAEGQTLD
LNCVVPGQTHAQITWHKRGGSLPARHQTHGSLLRLHQVSPADSGEYVCRVGGGSVPLEAS
VLVTIEPASSMPALGVTPPVRIESSSSHVAEGQTLDLNCLVAGQAHAQITWHKRGGSLPA
RHQVHGSRLRLPQVTPADSGEYVCRVVSSSGTQEASVLVTIQQRLSPSHTQGVVYPVRIE
SSSSSLANGHTLDLNCLVTSQAPHTITWYKRGGSLPSRHQIVGSRLRIPQVTPADSGEYV
CHVSNGAGSQETSLIVTIQGSGSSHVPSVSPPIRIESSSPTVVEGQTLDLNCVVAGQPQA
TITWYKRGGSLPARHQAHGSRLRLHQMSVADSGEYVCRANNNIDAQEASIMVSVSPSAGS
PSVPGGSVPIRIESSSSHVAEGQTLDLNCVVPGQAHAQVTWHRRGGSLPPHHQAHGSRLR
LHQVSPADSGEYVCRVVSSSGPLEASVLVTIEASGSSAVPVPAPGGVPPIRIETSSSHVA
EGQTLDLKCVVPGQAHAQVTWHRRGGSLPARHQVHGPLLRLNQVSPADSGEYSCQVTGSS
GTLEASVLVTIEASSPRPIPAPGLAQPIYIEASSSHVAEGQTLDLNCVVPRQPHAQVTWH
KRGGSLPARHQTHGSRLRLHHVSPADSGEYVCRVVGGSGPEQEASFTVTVPPSAGSSYRL
RSPVISIEPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRNQELEDNVHISPNGSIITIV
GTRPSNHGAYRCVASNAYGVAQSVVNLSVHGPPTVSVLPEGPVWVKVGKAVALECVSAGE
PRSSARWTRIGTPAKVEQQTYGPVDSHTVLQISSAKPSDAGTYVCLAQNALGTAQKRVEV
IVDMGTVAPGAPQVQVEEAELTVEAGHTATLRCSATGSPTPTIHWSKLRSPLPWQHQLEG
NTLIIPRVAQQDSGQYICNATSPAGHAEATIALHVESPPYATTVPEHASVQAGETVQLQC
LAHGTPPLTFQWSRVGGSLSGRATARNEMLHFEPAAPEDSGRYRCRVTNRVGSAEAFAHV
VIQGPSGSLPATAVPAGSTPTVQVTPQLETKSIGASVEFHCAVPSDRGTQLRWIKEGGHL
PPGHSVQDGVLRIQNLDQSCQGTYICQAYGPWGQAQASAQLVVQALPSVLINIRTSVQTV
VVGHAVEFECLALGDPKPQVTWSKVGGRLRPGIVQSGGIVRIAHVELADAGQYRCTATNA
AGTTQSHVLLLVQALPQISTPPEVHVPAGSTAVFPCMASGYPTPDITWSKLDGNLPPDSR
LENNMLVLPSVRPQDAGTYVCTATNRQGKVKAFAQLRVPERVVPYFTQTPHSFLPLPTIK
DAYRKFEIRITFRPDSADGMLLYNGQKQSPGSPASLAHRQPDFISFGLVGGRPEFRFDAG
SGMATIRHPTPLALGQFHTVTLLRSLTQGSLIVGSLAPVNGTSQGKFQGLDLNEELYLGG
YPDYSAIPKAGLSSGFIGCVRELRIQGEEIVFHDLNLTAHGISHCPTCRDRPCQNGGQCQ
DSESSSYVCVCLPGFTGSRCEHSQALHCHPEACGPDATCVNRPDGRGYTCRCHLGRSGIR
CEEGVTVTTPSMSGTGSYLALPALTNTHHELRLDVEFKPLAPDGVLLFSGGKSGPVEDFV
SLAMVGGHLEFRYELGSGLAVLRSAEPLALGRWHHVSAERLNKDGSLRVNGGRPVLRSSP
GKSQGLNLDTLLYLGGVEPSVSLPPANASAPFRGCVGEVSVNGKRLDLTYSFLGSRGIGQ
CYDSSPCEPQPCQHGATCMPAGEYEFQCLCQDGFKGDLCEHEENPCQLREPCLHGGTCRG
THCLCPPGFSGPRCQQGSGHGTAESDWHLEGSGGNDAPGQYAAYFHDDGFLALPGHVLSR
SLPEVPETIELEVRTSTASGLLLWQGVEVGEASRGKDFISLGLQDGHLVFSYQLGSGEAR
LVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGQSPGPNVAVNTKGSIYIGGAPDVA
TLTGGRFSSGITGCIKNLVLHSARPGAPPPQPLDLQHRAQAGANTRPCPS
Download sequence
Identical sequences F7C0I7
ENSECAP00000012479 9796.ENSECAP00000012479 ENSECAP00000012479

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]