SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 10090.ENSMUSP00000046199 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  10090.ENSMUSP00000046199
Domain Number 1 Region: 2208-2301
Classification Level Classification E-value
Superfamily C-type lectin-like 3.3e-32
Family Link domain 0.0001
Further Details:      
 
Domain Number 2 Region: 508-647
Classification Level Classification E-value
Superfamily FAS1 domain 3.14e-30
Family FAS1 domain 0.0017
Further Details:      
 
Domain Number 3 Region: 1713-1869
Classification Level Classification E-value
Superfamily FAS1 domain 1.1e-28
Family FAS1 domain 0.0052
Further Details:      
 
Domain Number 4 Region: 1473-1521,1599-1713
Classification Level Classification E-value
Superfamily FAS1 domain 3.01e-28
Family FAS1 domain 0.002
Further Details:      
 
Domain Number 5 Region: 965-1122
Classification Level Classification E-value
Superfamily FAS1 domain 1.05e-21
Family FAS1 domain 0.0066
Further Details:      
 
Domain Number 6 Region: 386-502
Classification Level Classification E-value
Superfamily FAS1 domain 2.75e-19
Family FAS1 domain 0.0059
Further Details:      
 
Domain Number 7 Region: 2299-2460
Classification Level Classification E-value
Superfamily FAS1 domain 1.57e-16
Family FAS1 domain 0.0075
Further Details:      
 
Domain Number 8 Region: 1131-1245
Classification Level Classification E-value
Superfamily FAS1 domain 0.00000288
Family FAS1 domain 0.0085
Further Details:      
 
Domain Number 9 Region: 1545-1582
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000885
Family Merozoite surface protein 1 (MSP-1) 0.047
Further Details:      
 
Weak hits

Sequence:  10090.ENSMUSP00000046199
Domain Number - Region: 903-961
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000628
Family EGF-type module 0.028
Further Details:      
 
Domain Number - Region: 2095-2131
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00124
Family EGF-like domain of nidogen-1 0.042
Further Details:      
 
Domain Number - Region: 199-232
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00144
Family Merozoite surface protein 1 (MSP-1) 0.034
Further Details:      
 
Domain Number - Region: 231-284
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00193
Family EGF-type module 0.033
Further Details:      
 
Domain Number - Region: 2009-2136
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00204
Family Growth factor receptor domain 0.016
Further Details:      
 
Domain Number - Region: 1371-1498
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00235
Family Growth factor receptor domain 0.019
Further Details:      
 
Domain Number - Region: 873-911
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00471
Family EGF-type module 0.049
Further Details:      
 
Domain Number - Region: 2143-2175
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00492
Family EGF-type module 0.061
Further Details:      
 
Domain Number - Region: 1977-2009
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00642
Family EGF-type module 0.039
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) 10090.ENSMUSP00000046199
Sequence length 2571
Comment (Mus musculus)
Sequence
MAEPRTLLLLCVLVLCLSDSSFIRGQTVRSKRCDIHTKFVTHTPCTACAAIRRQLCPWGW
SRNFPEKILLDCRYELQLRGAAISLSGCSQECWKDVVQKACCPGYWGSQCFECPGGPATP
CSGHGTCLDGIEGNGTCVCQENFSGSVCQECRDPNRFGPDCQSVCNCVHGVCSHGPRGDG
SCRCFAGYTGPHCDQELPVCQSLKCPQNSQCSAEAPTCKCLPGYTQQDNVCLAPDPCQPS
ACSPLARCSVTPQGQAQCQCPENYHGDGKVCLPRDPCLTNFGGCPSNSTFCLYRGPGKAT
CMCRPGMTSINNNASEGCHVSCKPHSCDRSATCQVTPDRKTSCVCKNDEVGDGHACYGHL
LHEVRRANQNGLVFLRLRAAIAMLEQGCQEILTTSGPFTVLVPSMFSVSSVSSNMNATLA
QQLCRQHVIAGEHMLENAGPPSTRRWWTLAGQEVTITFKNMRYAYKYEDQPQQFSIHKAN
YIAANGVFHTVTALRWQLPPPLPGDSKKTVGQILASTEVFTRFETILENCGLPSILDGPG
PFTVFAPSNEAVDSLRDGRLIYLFTAGLSKLQELVRYHIYNHGQLTVEKLISKGRVLTMA
NQVLTVNISEEGRILLGPEGIPVRRVDVPAANGVIHMLEGILLPPTILPILPKHCDEEQH
QTVLGSCVDCQALNTSVCPPNSVKMDIFPKECVYIHDPNGLNVLKKGCADYCNQTITKRG
CCKGFFGPDCTQCPGGFSNPCYGKGNCSDGVRGNGACLCFPDYKGIACHICSDPKKHGEQ
CQEDCGCVHGLCDNRPGSGGVCQQGTCAPGFQGRFCNESMGNCGSTGLAQPCHSDAHCVI
QEGVARCVCHDGFEGNGFSCKRSNPCSRPDRGGCSENAECVPGDLGTHHCICHKGWSGDG
RICVAIDECGLDTRGGCHADALCSYVGPGQSRCTCKLGFAGNGYECSPIDPCRVGNGGCH
GLATCKAVGGGQRVCTCPPHFGGDGFSCYGDIIQELEANAHFSAFSQWFKNSSITLPADS
RVTALVPSESAIRRLSLEDQAFWLQPKMLPELARAHFLQGAFSEEELARLNGQQVATLSA
TTRWQIHNISGKVWVQNATVDVPDLLATNGILHIVSQVLLPPRGDMQTGPGLLQQLDSVP
AFRLFGEQLKHHKLVAQIEAAKAYTIFVPTNHSLETQGNNSVLGIDTVRHHVILGEALSV
EVLRKGGHRNSLLGPAHWLVFYNHSGQPEVNHMPLEGPLLEAPGSSLFGLSGILAVGSSR
CLHSHAEALREKCINCTRKFRCTQGFQLQDTPRKSCVYRSGLSFSRGCSYTCAKKIQVPD
CCPGFFGTLCEPCPGGLGGVCSGHGQCQDRFLGNGECRCQEGFHGTACEMCELGRYGPTC
SGVCDCDHGLCQEGLRGNGSCVCHAGWQGLRCDQKITDHQCPKKCDPNANCIQDSAGIPA
CVCAAGYSGNGSYCSEVDPCASGHGGCSPYANCTKVAPGQRTCTCQDGYTGDGELCQEIN
SCLVHNGGCHVHAECIPTGPQQVSCSCREGYSGDGIQTCKLLDPCSQNNGGCSPYAVCKS
TGDGQRTCSCDATHTVGDGITCHGRVGLELLRNKYASFFSLHLLEYKELKGDGPFTVFVP
HADLISNMSQDELARIRAHRQLVFRYHVVGCRKLWSQEMLDQGYITTLSGHTLRVSEREG
SIYLNDFARVVSSDLEVVNGVLHFIDHVLLPPDVLHWESGAIPIPQRNVTAAAESFGYKI
FSRLLTVAGLLPMLQDASHRPFTMLWPTDSALQALPPDRKNWLFHEDHRDKLAAILRGHM
IRNIEALASDLPNLGQLRTMHGNTISFSCGLTRPGELIVGEDEAHIVQRHLTFEGGLAYG
IDQLLEPPDLGARCDRFEPQPLQMKTCSICGLEPPCPRGSREQGSPETCWRHYSKFWTTP
LHSISMRGAYWIPSSFWNRNHMSRGCHRNCVTTVWKPSCCPGHYGINCHACPGGPRSPCS
DHGVCLDGIRGSGQCNCHPGFAGTACELCAPGAFGPQCQACRCTQHGRCDEGLGGSGSCF
CDEGWTGARCEVQLELQPVCTPPCAPQAVCRLGNSCECSLGYEGDGRVCTVADLCQKGHG
GCSKHANCSQVGTVVTCTCLPDYEGDGWSCRARDPCLDGHRGGCSEHADCLNTGPNTRRC
ECHVGYVGDGLQCLEELEPPVDRCLGGSSPCHTDALCTDLHFQEKQAGVFHIQATSGPYG
LTFSEAKEACEGQGAVLASLPQLSAAQQLGFHVCFVGWLANGSAAHPVVTPAADCGNNRV
GVVSLGVRKNLSELWDAYCYRVQDVACQCRAGFVGDGISTCNGKLLDVLAATANFSTFYG
MLLGYANATQRGLEFMDFLEDELTYKTLFVPVNKGFVDNMTLSGPDLELHASNATFLSIN
ASRGTLLPAHSGLSLFISDTGPDNTSLVPLAPGAVVVSHVIVWDIMAFNGIIHALASPLL
MPPQTRAVLGSEPPPVALSLGVVVTSGTLLGLVAGALYLRARGKPPGFSFSAFQAEDNAD
DDFSPWQEGTSPTLVSVPNPVFGSSDIFCEPFDDSVLEEDFPDTQRVLKVK
Download sequence
Identical sequences G3X973
ENSMUSP00000125239 ENSMUSP00000046199 10090.ENSMUSP00000046199 ENSMUSP00000046199 NP_619613.2.92730

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]