SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOCUP00000018458 from Oryctolagus cuniculus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOCUP00000018458
Domain Number 1 Region: 3910-4103
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.54e-40
Family Laminin G-like module 0.0007
Further Details:      
 
Domain Number 2 Region: 4164-4364
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.43e-37
Family Laminin G-like module 0.00069
Further Details:      
 
Domain Number 3 Region: 3639-3848
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.1e-33
Family Laminin G-like module 0.0012
Further Details:      
 
Domain Number 4 Region: 403-488
Classification Level Classification E-value
Superfamily Immunoglobulin 3.24e-21
Family I set domains 0.0091
Further Details:      
 
Domain Number 5 Region: 3212-3306
Classification Level Classification E-value
Superfamily Immunoglobulin 1.43e-19
Family I set domains 0.013
Further Details:      
 
Domain Number 6 Region: 1954-2042
Classification Level Classification E-value
Superfamily Immunoglobulin 1.83e-19
Family I set domains 0.015
Further Details:      
 
Domain Number 7 Region: 3574-3661
Classification Level Classification E-value
Superfamily Immunoglobulin 3.22e-19
Family I set domains 0.015
Further Details:      
 
Domain Number 8 Region: 2630-2720
Classification Level Classification E-value
Superfamily Immunoglobulin 1.52e-18
Family I set domains 0.023
Further Details:      
 
Domain Number 9 Region: 3110-3203
Classification Level Classification E-value
Superfamily Immunoglobulin 2.42e-18
Family I set domains 0.013
Further Details:      
 
Domain Number 10 Region: 1675-1770
Classification Level Classification E-value
Superfamily Immunoglobulin 6.18e-18
Family I set domains 0.025
Further Details:      
 
Domain Number 11 Region: 2050-2135
Classification Level Classification E-value
Superfamily Immunoglobulin 8.81e-18
Family I set domains 0.036
Further Details:      
 
Domain Number 12 Region: 3292-3384
Classification Level Classification E-value
Superfamily Immunoglobulin 1.17e-16
Family I set domains 0.013
Further Details:      
 
Domain Number 13 Region: 2438-2525
Classification Level Classification E-value
Superfamily Immunoglobulin 1.53e-16
Family I set domains 0.046
Further Details:      
 
Domain Number 14 Region: 3502-3583
Classification Level Classification E-value
Superfamily Immunoglobulin 1.81e-16
Family I set domains 0.022
Further Details:      
 
Domain Number 15 Region: 1771-1860
Classification Level Classification E-value
Superfamily Immunoglobulin 1.97e-16
Family I set domains 0.0000271
Further Details:      
 
Domain Number 16 Region: 3024-3119
Classification Level Classification E-value
Superfamily Immunoglobulin 6.83e-16
Family I set domains 0.029
Further Details:      
 
Domain Number 17 Region: 1864-1951
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000198
Family I set domains 0.051
Further Details:      
 
Domain Number 18 Region: 2828-2914
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000252
Family I set domains 0.027
Further Details:      
 
Domain Number 19 Region: 2248-2340
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000305
Family I set domains 0.044
Further Details:      
 
Domain Number 20 Region: 2539-2626
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000039
Family I set domains 0.024
Further Details:      
 
Domain Number 21 Region: 3399-3485
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000041
Family I set domains 0.02
Further Details:      
 
Domain Number 22 Region: 2732-2814
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000718
Family I set domains 0.073
Further Details:      
 
Domain Number 23 Region: 2927-3011
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000878
Family I set domains 0.08
Further Details:      
 
Domain Number 24 Region: 2345-2428
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000334
Family I set domains 0.081
Further Details:      
 
Domain Number 25 Region: 2155-2237
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000167
Family I set domains 0.075
Further Details:      
 
Domain Number 26 Region: 1562-1614
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000586
Family Laminin-type module 0.009
Further Details:      
 
Domain Number 27 Region: 322-358
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000615
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 28 Region: 365-408
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000641
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 29 Region: 281-317
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000144
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 30 Region: 763-815
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000017
Family Laminin-type module 0.0052
Further Details:      
 
Domain Number 31 Region: 1158-1210
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000865
Family Laminin-type module 0.02
Further Details:      
 
Domain Number 32 Region: 823-870
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000105
Family Laminin-type module 0.02
Further Details:      
 
Domain Number 33 Region: 3842-3883
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000288
Family EGF-type module 0.015
Further Details:      
 
Domain Number 34 Region: 194-234
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000314
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 35 Region: 1621-1669
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000046
Family Laminin-type module 0.059
Further Details:      
 
Domain Number 36 Region: 4106-4149
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000124
Family EGF-type module 0.01
Further Details:      
 
Domain Number 37 Region: 1218-1263
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000206
Family Laminin-type module 0.024
Further Details:      
 
Weak hits

Sequence:  ENSOCUP00000018458
Domain Number - Region: 1274-1321
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000205
Family Laminin-type module 0.0051
Further Details:      
 
Domain Number - Region: 887-925
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000804
Family Laminin-type module 0.025
Further Details:      
 
Domain Number - Region: 1122-1160
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00412
Family EGF-type module 0.087
Further Details:      
 
Domain Number - Region: 1527-1564
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00436
Family Integrin beta EGF-like domains 0.089
Further Details:      
 
Domain Number - Region: 725-765
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00805
Family EGF-type module 0.04
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOCUP00000018458   Gene: ENSOCUG00000010955   Transcript: ENSOCUT00000024390
Sequence length 4392
Comment pep:known_by_projection chromosome:OryCun2.0:13:131388583:131492798:1 gene:ENSOCUG00000010955 transcript:ENSOCUT00000024390 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGRRAAGTLLLALLLHGRLLAVAHGLRAYEGLSLPEDTETVTEGRAGWSYSYLSDDEDLL
ADDASGDGLGSGDLGSGDFQMVYFRALVNFTHSIEYSPRLEDAGSREFREVSDAVVDKLE
MEYAKIPGDQVVSVVFIKELDGWVFVELDVGSEGNADGAQIQDVLHRVVSGGAIASYVTS
PQGFQFRRLGTVPQIPRVCTEAEFACHSHNECVALEYRCDRRPDCRDMSDELNCEEPIPA
LSTTKPFLETPSFPPQPEVATQRPPDMAIPQPLFPSLGRPEPCGPQEAACHSGHCIPKDY
VCDGQEDCTDGSDELDCGPTPPCEPNEFPCGNGHCALKLWRCDGDFDCEDRTDEANCPAK
RPEEVCGPTQFRCVSTSTCIPASFHCDEESDCPDRSDEFGCMPPQVVTPPQESIQASRGQ
TVTFTCVAIGVPTPIINWRLNWGHIPSHPRVTMTSEGGRGTLIIRDVKESDQGAYTCEAM
NARGMVFGIPDGVLELIPQRAGPCPDGHFYLEPSASCLPCFCFGVTNVCQSSRRFRHQIH
LRFDQPGDFKGVNVTMPAQPGMPPLSSTQLHIDTALQEFQLVDLSRRFLVHDSFWALPEQ
FLGNKVDSYGGSLRYKVRYDLARGMLEPVQRPDVVLVGAGYRLLSRGHTPTQPGALNQRQ
VQLSEEHWVHESGRPVQRAEMLQVLQSLEAVLIQTVYNAKMASVGLSDISMDTTVTYATS
HGRAHSVEECRCPVGYSGLSCESCAAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHCL
NCQHNTEGPQCNKCKAGFFGDATKATATACRPCPCPYIDASRRFSDTCFLDTDGQATCDA
CAPGYTGRRCESCAPGYEGNPIQPGGKCRPTTQQLVRCDERGSLGTAGEACQCKNNVVGR
LCNECADGSFHLSTQNPDGCLRCFCMGVSRQCSSSSWSRAQVLGASEEPAQFSLTNAAGT
HTTSEGIASPTAGELLFSSFHSLLQGPYFWSLPPRFRGDKVTSYGGELRFTVTQQPQPGS
PPLHGQPLVVLQGNSITLEHRTSQEPVPGQPSSFTIPFREQAWQRPDGQPATREHLLMAL
AGIDALLIRASYSQRPAESRVSGISMDVAVPEGTGQDPALEVEQCTCPPGYRGPSCQDCD
TGYTRMPSGLYLGTCERCSCHGHSEACEPETGACQGCQHHTEGPRCQQCQPGYYGDAQQG
TPQDCQPCPCYGAPAAGQAAHTCFLDTDGHPTCDACSPGHSGRLCERCAPGYHGNPSQGQ
PCQRDGQVPEPIGCGCDPQGSVSSQCDATGQCQCKAQVEGLTCSHCRPHHFHLSASNPDG
CLPCFCMGVTQQCASSSYTRHLISSRFAPGDFQGFALVNPQRNSRLTGGFTVEPAPEGAQ
LSFGNFAHLGQEPFYWQLPDAYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDIQITGNN
IMLVASQPPPQGAERRSYEIVFREEFWRRPDGQPATREHLLMALADLDEILVRATFSSVP
LAASISAVSLEVAQPGPSDGPRALEVEECRCPPGYVGLSCQDCAPGYTRTGSGLYLGHCE
LCECNGHSDVCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPEDCQPCACPLTNPE
NMFSRTCESLGGNGYRCTACEPGYTGQYCEQCAPGYVGNPNVQGGRCLPQADQAPLVVQV
HPARSVVPQGGPYSLRCQVSGSPPHYFYWSREDGRPVPSSTQQRHQGSELHFPSVQPSDA
GVYICTCRNLHHANSSRAELLVTEAPSKPITVTVEEQRSQSVRPGADVTFICTAKSKSPA
YTLVWTRLHNGKLPSRAMDFNGILTIRGVQPSDAGTYVCTGSNMFAMDQGTATLHVQASP
TSSAPTVSIHPPQLTVQPGQMAEFRCSATGNPMPNLEWIEGPGGQLPQKAEVRGGILRLP
AVEPSDQAPYLCRAHNSAGQHVARAVLHVHGGSGPRVQVSPERTQVHEGHTVRLYCRAAG
VPSATITWRKEGGSLPPQARSERTDIATLLIPAITAADAGFYLCVATSPAGTAQARIQVV
VLTASGDSSPPPVRIESSSPSVTEGQTLDLNCVVAGSAHTQVTWYRRGASLPPRSQVHGS
RLRLPQVSPADSGEYVCRVENGSGPKEATITISVLRGTHTGPSHAPAPGSTQPIRIESSS
SHVAEGQTLDLNCVVPGQAHAQVTWHRRGGSLPARHQTHGSLLRLHQVSPADSGEYVCRV
LLGSEHLETSVLVSIEASGSMPAPGPIPPVRIESSSSTVAEGQTLDLSCVVAGQAHAQVT
WYKRGGSLPARHQVRGSRLYIFQTSPADAGEYVCRASNGAEASITVTVTRTQGANFAYPP
GTQPIRIESSSSHVAEGQTLDLNCVVPGQAHAQVTWHKRGGSLPARHQTHGSLLRLHQVS
PADSGEYVCRVLGGSVPLESSVLVTIEPADSVPAHGVTPPVRIESSSSHVAEGQTLDLNC
VVPGQAHAQVTWHKRGGSLPARHQVHGSRLRLPQVTPADSGEYVCRVVSSSGTQEASVLV
TIQRRLGPSHPQGVVYPVRIESSSASLANGNTLDLNCLVASQAPHTITWYKRGGSLPSRH
QIVGSRLRIPQVTPADSGEYVCHVSNGAASQETSLIVTIQGSGPPHVPSVSPPIRIESSS
PTVVEGQTLDLNCVVAGQPQATITWYKRGGSLPSRHQTHGSHLRLHHMSVADSGEYVCRA
NNNIDAQEASIMVSVSPSPSNPSAPGAATPIRIESSSSHVAEGQTLDLNCVVPGQAHAQV
MWYKRGGSLPTHHQTHGSHLRLYQVSPADSGQYVCRVLGSSGPLEASVLVTIEASDANPV
RIPAPGGAPPIRIETSSSHVAEGQTLDLNCVVPGQAHAQVTWHRRGGSLPAGHQVHGHIL
RLNQVSPADSGEYSCRVTGSSGTLEASVLVSIEPSSPSPIPAPGLAQPIHIEASSSHVAE
GQTLDLNCVVPGQAHAQVTWYKRGGSLPARHQTHGSRLRLHHVSPADSGEYVCRVAGGSG
PEQEASFTVTVPPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGATPISLEWKT
RNQELEDNVHISPNGSVITIVGTRPSNHGAYRCVASNAYGVAQSVVNLSVHGPPTVSVLP
EGSVRVKMGKDVTLECVSSGEPRSSARWTRVGYPARLEPRTYGMVDSHAVLKISSVKPSD
AGTYVCLAQNALGTARKEVEVIVDTGTSAPGAPQVQVEEVELTVEAGHTATLRCSASGSP
TPTIQWSKLRSPLPWQHRLEGNTLLIPRVAQQDSGQYICNATSPLGHAEATVILHVESPP
YATVVPEHASVRAGERVQLQCLAHGTPPLTFQWSRMDGSLPGRAAASKELLHFEPAAPED
SGRYRCQVSNKVGLAEAFAQVLVQGPSDTLPDTATPAGAPPTVQVTPQQETRSIGASVEF
HCAVPSDRGTQLRWFKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYVCQAHGPWGQAQASA
QLVVQACRSVWCRARGPEDVTSTGHAVEFECMARGDPKPQVTWSKVGGRLRPGIVQSGSV
IRIPHVELADAGQYRCTATNAAGTTQSHVLLLVQALPQISTPPEVRVPAGSAAVFPCMAS
GYPTPDITWSKLDGSLPPDSRLENNMLVLPSVRPQDAGTYICTATNRQGKVKAFAHLQVP
ERVVPYFTQTPYSFLPLPTIKDAYRKFEIKITFRPDSADGMLLYNGQKRTPGSPTNLANR
QPDFISFGLVGGRPEFRFDAGSGMATIRHPTPLALGQFHTVTLLRSLTQGSLIVGNLAPV
NGTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLTSGFVGCVRELRIQGEEIIFHDLNLTA
HGISHCPTCRDRPCQNGGQCHDSESSSYVCVCPAGFTGSRCEHSQALHCHPEACGPDATC
VNRPDGRGYTCRCHLGRSGPRCEEGVTVTTPSMSGAGSYLALPALTNTHHELRLDVEFKP
LAPDGILLFSGGKSGPVEDFVSLAMAGGHLEFRYELGSGLAVLRSPEPLALGRWHRVSAE
RLNKDGSLRVNGGRPVLRSSPGKSQGLNLHTLLYLGGVEPSVPLSPAANVSAHFRGCVGE
VSVNGKRLDLTYSFLGSQGVGQCYDSSPCERQPCQNGATCMPAGEYEFQCLCRDGFKGDL
CEQEENPCQLHEPCLHGGTCQGTRCLCLPGFSGPRCQQGPGHGLAESDWHLEGSGGSDAP
GQYSAYFHDGGFLALPGHVFSRSLPEVPETIELEVRTSTASGLLLWQGVEMGEASRGKDF
IGLGLQDGHLVFSYQLGSGEARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGQS
PGRNVAVNTKGSVYVGGAPDVAALTGGRFSSGITGCIKNLVLHSARPGGPPPQPLDLQHR
AQAGANTRPCPS
Download sequence
Identical sequences G1TN89
ENSOCUP00000018458

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]