SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMUSP00000030547 from Mus musculus 76_38

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMUSP00000030547
Domain Number 1 Region: 3894-4087
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.29e-40
Family Laminin G-like module 0.0008
Further Details:      
 
Domain Number 2 Region: 4148-4347
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.05e-37
Family Laminin G-like module 0.00066
Further Details:      
 
Domain Number 3 Region: 1641-1657,3653-3830
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.29e-34
Family Laminin G-like module 0.0014
Further Details:      
 
Domain Number 4 Region: 405-490
Classification Level Classification E-value
Superfamily Immunoglobulin 2.82e-21
Family I set domains 0.0083
Further Details:      
 
Domain Number 5 Region: 3199-3293
Classification Level Classification E-value
Superfamily Immunoglobulin 3.89e-21
Family I set domains 0.0082
Further Details:      
 
Domain Number 6 Region: 1954-2045
Classification Level Classification E-value
Superfamily Immunoglobulin 5.22e-19
Family I set domains 0.016
Further Details:      
 
Domain Number 7 Region: 3561-3648
Classification Level Classification E-value
Superfamily Immunoglobulin 5.89e-19
Family I set domains 0.02
Further Details:      
 
Domain Number 8 Region: 3097-3190
Classification Level Classification E-value
Superfamily Immunoglobulin 2.12e-18
Family I set domains 0.019
Further Details:      
 
Domain Number 9 Region: 3466-3564
Classification Level Classification E-value
Superfamily Immunoglobulin 2.34e-17
Family I set domains 0.014
Further Details:      
 
Domain Number 10 Region: 2621-2708
Classification Level Classification E-value
Superfamily Immunoglobulin 2.98e-17
Family I set domains 0.021
Further Details:      
 
Domain Number 11 Region: 1678-1770
Classification Level Classification E-value
Superfamily Immunoglobulin 3.98e-17
Family I set domains 0.022
Further Details:      
 
Domain Number 12 Region: 3283-3371
Classification Level Classification E-value
Superfamily Immunoglobulin 9.97e-17
Family I set domains 0.013
Further Details:      
 
Domain Number 13 Region: 3011-3106
Classification Level Classification E-value
Superfamily Immunoglobulin 1.36e-16
Family I set domains 0.034
Further Details:      
 
Domain Number 14 Region: 1771-1858
Classification Level Classification E-value
Superfamily Immunoglobulin 1.83e-16
Family I set domains 0.0000226
Further Details:      
 
Domain Number 15 Region: 2429-2518
Classification Level Classification E-value
Superfamily Immunoglobulin 6.08e-16
Family V set domains (antibody variable domain-like) 0.081
Further Details:      
 
Domain Number 16 Region: 2816-2901
Classification Level Classification E-value
Superfamily Immunoglobulin 7.27e-16
Family I set domains 0.028
Further Details:      
 
Domain Number 17 Region: 2529-2612
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000107
Family I set domains 0.045
Further Details:      
 
Domain Number 18 Region: 2047-2133
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000036
Family I set domains 0.086
Further Details:      
 
Domain Number 19 Region: 2718-2803
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000167
Family I set domains 0.057
Further Details:      
 
Domain Number 20 Region: 2238-2319
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000308
Family I set domains 0.046
Further Details:      
 
Domain Number 21 Region: 1863-1960
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000313
Family I set domains 0.025
Further Details:      
 
Domain Number 22 Region: 3388-3479
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000422
Family I set domains 0.014
Further Details:      
 
Domain Number 23 Region: 2337-2430
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000113
Family V set domains (antibody variable domain-like) 0.056
Further Details:      
 
Domain Number 24 Region: 2913-2998
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000154
Family I set domains 0.084
Further Details:      
 
Domain Number 25 Region: 2144-2232
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000166
Family I set domains 0.049
Further Details:      
 
Domain Number 26 Region: 764-816
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000162
Family Laminin-type module 0.0047
Further Details:      
 
Domain Number 27 Region: 1563-1615
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000502
Family Laminin-type module 0.0066
Further Details:      
 
Domain Number 28 Region: 367-411
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000668
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 29 Region: 324-361
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000117
Family LDL receptor-like module 0.00098
Further Details:      
 
Domain Number 30 Region: 283-319
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000122
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 31 Region: 1159-1211
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000156
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 32 Region: 824-871
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000093
Family Laminin-type module 0.02
Further Details:      
 
Domain Number 33 Region: 3826-3867
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000318
Family EGF-type module 0.012
Further Details:      
 
Domain Number 34 Region: 194-234
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000537
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 35 Region: 4090-4133
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000873
Family EGF-type module 0.01
Further Details:      
 
Domain Number 36 Region: 1622-1670
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000159
Family Laminin-type module 0.044
Further Details:      
 
Domain Number 37 Region: 1219-1263
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000276
Family Laminin-type module 0.024
Further Details:      
 
Weak hits

Sequence:  ENSMUSP00000030547
Domain Number - Region: 1275-1322
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00022
Family Laminin-type module 0.006
Further Details:      
 
Domain Number - Region: 887-926
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0014
Family Laminin-type module 0.028
Further Details:      
 
Domain Number - Region: 1119-1161
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00142
Family Laminin-type module 0.078
Further Details:      
 
Domain Number - Region: 1528-1565
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00609
Family Integrin beta EGF-like domains 0.089
Further Details:      
 
Domain Number - Region: 724-766
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0137
Family EGF-type module 0.053
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMUSP00000030547   Gene: ENSMUSG00000028763   Transcript: ENSMUST00000030547
Sequence length 4375
Comment pep:known chromosome:GRCm38:4:137468769:137570630:1 gene:ENSMUSG00000028763 transcript:ENSMUST00000030547 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGQRAVGSLLLGLLLHARLLAVTHGLRAYDGLSLPEDTETVTASRYGWTYSYLSDDEDLL
ADDASGDGLGSGDVGSGDFQMVYFRALVNFTRSIEYSPQLEDASAKEFREVSEAVVEKLE
PEYRKIPGDQIVSVVFIKELDGWVFVELDVGSEGNADGSQIQEVLHTVVSSGSIGPYVTS
PWGFKFRRLGTVPQFPRVCTETEFACHSYNECVALEYRCDRRPDCRDMSDELNCEEPVPE
LSSSTPAVGKVSPLPLWPEAATTPPPPVTHGPQFLLPSVPGPSACGPQEASCHSGHCIPR
DYLCDGQEDCRDGSDELGCASPPPCEPNEFACENGHCALKLWRCDGDFDCEDRTDEANCS
VKQPGEVCGPTHFQCVSTNRCIPASFHCDEESDCPDRSDEFGCMPPQVVTPPQQSIQASR
GQTVTFTCVATGVPTPIINWRLNWGHIPAHPRVTMTSEGGRGTLIIRDVKEADQGAYTCE
AMNSRGMVFGIPDGVLELVPQRGPCPDGHFYLEDSASCLPCFCFGVTNVCQSSLRFRDQI
RLSFDQPNDFKGVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSRRFLVHDAFWALPK
QFLGNKVDSYGGFLRYKVRYELARGMLEPVQKPDVILVGAGYRLHSRGHTPTHPGTLNQR
QVQLSEEHWVHESGRPVQRAEMLQALASLEAVLLQTVYNTKMASVGLSDIVMDTTVTHTT
IHGRAHSVEECRCPIGYSGLSCESCDAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHC
LNCQHNTEGPQCDKCKPGFFGDATKATATACRPCPCPYIDASRRFSDTCFLDTDGQATCD
ACAPGYTGRRCESCAPGYEGNPIQPGGKCRPTTQEIVRCDERGSLGTSGETCRCKNNVVG
RLCNECSDGSFHLSKQNPDGCLKCFCMGVSRQCSSSSWSRAQVLGASEQPSQFSLSNAAG
THTTSEGVSSPAPGELSFSSFHNLLSEPYFWSLPASFRGDKVTSYGGELRFTVTQRPRPS
SAPLHRQPLVVLQGNNIVLEHHASRDPSPGQPSNFIVPFQEQAWQRPDGQPATREHLLMA
LAGIDALLIQASYTQQPAESRVSGISMDVAVPENTGQDSAREVEQCTCPPGYRGPSCQDC
DTGYTRVPSGLYLGTCERCNCHGHSETCEPETGACQSCQHHTEGASCEQCQPGYYGDAQR
GTPQDCQPCPCYGAPAAGQAAHTCFLDTDGHPTCDSCSPGHSGRHCERCAPGYYGNPSQG
QPCHRDGQVPEVLGCGCDPHGSISSQCDAAGQCQCKAQVEGRTCSHCRPHHFHLSASNPE
GCLPCFCMGVTQQCASSSYSRQLISTHFAPGDFQGFALVNPQRNSQLTGGFTVEPVHDGA
RLSFSNFAHLGQESFYWQLPEIYQGDKVAAYGGKLRYTLSYTAGPQGSPLLDPDIQITGN
NIMLVASQPALQGPERRSYEIIFREEFWRRPDGQPATREHLLMALADLDELLVRATFSSV
PRAASISAVSLEVAQPGPSSGPRALEVEECRCPPGYVGLSCQDCAPGYTRTGSGLYLGQC
ELCECNGHSDLCHPETGACSRCQHNTAGEFCELCATGYYGDATAGTPEDCQPCACPLTNP
ENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCAPGYEGDPNVQGGRCQPLTKESLEVQI
HPSRSVVPQGGPHSLRCQVSGSPPHYFYWSREDGRPLPSSAQQRHQGSELHFPSVQPSDA
GVYICTCRNLIHTSNSRAELLVAEAPSKPITVTVEEQRSQSVRPGADVTFICTAKSKSPA
YTLVWTRLHNGKLPSRAMDFNGILTIRNVQPSDAGTYVCTGSNMFAMDQGTATLHVQVSG
TSTAPVASIHPPQLTVQPGQQAEFRCSATGNPTPMLEWIGGPSGQLPAKAQIHNGILRLP
AIEPSDQGQYLCRALSSAGQHVARAMLQVHGGSGPRVQVSPERTQVHEGRTVRLYCRAAG
VPSASITWRKEGGSLPPQARSENTDIPTLLIPAITAADAGFYLCVATSPTGTAQARIQVV
VLSVPVRIESSSPSVTEGQTLDLNCAVMGLTYTQVTWYKRGGSLPPHAQVHGSRLRLPQV
SPADSGDYVCRVESDVGPKEASIVVSVLHSPHSGPSYTPATSITPPIRIESSSSHVAEGQ
TLDLNCVVPGQAQVTWRKRGGSLPARHQTHGSLLRLHQVSPADSGEYVCHVVLGSEHTET
SVLVTIEPAESIPAPGPAPPVRIEASSSTVTEGHMLDLNCVVAGQAHAQVTWYKRGGSLP
ARHQVRGSRLYILQASPADAGEYVCRAGNGQEATITVTVTRNHGANLAYPPGSTSPIRIE
SSSSHVAEGQTLDLNCVVQGQAHAQVTWHKRGGSLPARHQTHGSLLRLHQVSPVDSGEYV
CRVEGGAVPLESSVLVTIEPAGTAPGVIPPVRIESSSSHVSEGQSLDLNCLVSGQTHPQI
SWHKRGGSLPARHQVHGSRLRLLQVTPTDSGEYVCRVVSGSGTQEASILVTIQQTLSPSH
SQSVVHPVRIESSSPSLANGHTLDLNCLVASLTPHTITWYKRGGSLPSRHQIVGSRLRIP
QVTPADSGEYVCHVSNGAGSQETSLIVTIESRGPSHVPSVSPPMRIETSSPTVTEGQTLD
LNCVVVGRPQATITWYKRGGSLPFRHQAHGSRLRLHHMSVADSGEYVCRANNNIDAQETS
IMISVSPSTNSPPAPASPAPIRIESSSSRVAEGQTLDLNCVVPGHAHAQVTWHKRGGSLP
THHQTHGSRLRLYQVSSADSGEYVCSVLSSSGPLEASVLVSITPAAANVHIPGVVPPIRI
ETSSSRVAEGQTLDLSCVVPGQAHAQVTWHKRGGSLPAGHQVHGHMLRLNRVSPADSGEY
SCQVTGSSGTLEASVLVTIEASEPSPIPAPGLAQPVYIESSSSHLTEGQTVDLKCVVPGQ
AHAQVTWHKRGSSLPARHQTHGSLLRLYQLSPADSGEYVCQVAGSSHPEHEASFKLTVPS
SQNSSFRLRSPVISIEPPSSTVQQGQDASFKCLIHEGATPIKVEWKIRDQELEDNVHISP
NGSIITIVGTRPSNHGAYRCVASNVYGMAQSVVNLSVHGPPTVSVLPEGPVHVKMGKDIT
LECISSGEPRSSPRWTRLGIPVKLEPRMFGLMNSHAMLKIASVKPSDAGTYVCQAQNALG
TAQKQVELIVDTGTVAPGAPQVQVEESELTLEAGHTATLHCSATGNPPPTIHWSKLRAPL
PWQHRIEGNTLVIPRVAQQDSGQYICNATNSAGHTEATVVLHVESPPYATIIPEHTSAQP
GNLVQLQCLAHGTPPLTYQWSLVGGVLPEKAVARNQVLRLEPTVPEDSGRYRCQVSNRVG
SAEAFAQVLVQGSSSNLPDTSIPGGSTPTVQVTPQLETRNIGASVEFHCAVPNERGTHLR
WLKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYVCQAHGPWGQAQATAQLIVQALPSVLIN
VRTSVHSVVVGHSVEFECLALGDPKPQVTWSKVGGHLRPGIVQSGSIIRIAHVELADAGQ
YRCAATNAAGTTQSHVLLLVQALPQISTPPEIRVPAGSAAVFPCMASGYPTPAITWSKVD
GDLPPDSRLENNMLMLPSVRPEDAGTYVCTATNRQGKVKAFAYLQVPERVIPYFTQTPYS
FLPLPTIKDAYRKFEIKITFRPDSADGMLLYNGQKRSPTNLANRQPDFISFGLVGGRPEF
RFDAGSGMATIRHPTPLALGQFHTVTLLRSLTQGSLIVGNLAPVNGTSQGKFQGLDLNEE
LYLGGYPDYGAIPKAGLSSGFVGCVRELRIQGEEVVFHDVNLTTHGISHCPTCQDRPCQN
GGQCQDSESSSYTCVCPAGFTGSRCEHSQALHCHPEACGPDATCVNRPDGRGYTCRCHLG
RSGVRCEEGVTVTTPSMSGAGSYLALPALTNMHHELRLDVEFKPLEPNGILLFSGGKSGP
VEDFVSLAMVGGHLEFRYELGSGLAVLRSHEPLTLGRWHRVSAERLNKDGSLRVDGGRPV
LRSSPGKSQGLNLHTLLYLGGVEPSVQLSPATNMSAHFHGCVGEVSVNGKRLDLTYSFLG
SQGVGQCYDSSPCERQPCQNGATCMPAGEYEFQCLCQDGFKGDLCEHEENPCQLHEPCLN
GGTCRGARCLCLPGFSGPRCQQGAGYGVVESDWHPEGSGGNDAPGQYGAYFYDNGFLGLP
GNSFSRSLPEVPETIEFEVRTSTADGLLLWQGVVREASRSKDFISLGLQDGHLVFSYQLG
SGEARLVSEDPINDGEWHRITALREGQRGSIQVDGEDLVTGRSPGPNVAVNTKDIIYIGG
APDVATLTRGKFSSGITGCIKNLVLHTARPGAPPPQPLDLQHRAQAGANTRPCPS
Download sequence
Identical sequences B1B0C7
10090.ENSMUSP00000030547 NYSGRC-IgSF-Q05793 ENSMUSP00000030547

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]