SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for gi|126012571|ref|NP_005520.4| from Homo sapiens

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|126012571|ref|NP_005520.4|
Domain Number 1 Region: 3909-4100
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.27e-39
Family Laminin G-like module 0.00064
Further Details:      
 
Domain Number 2 Region: 4163-4365
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.15e-37
Family Laminin G-like module 0.00065
Further Details:      
 
Domain Number 3 Region: 3638-3847
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.33e-33
Family Laminin G-like module 0.00097
Further Details:      
 
Domain Number 4 Region: 405-490
Classification Level Classification E-value
Superfamily Immunoglobulin 3.81e-21
Family I set domains 0.008
Further Details:      
 
Domain Number 5 Region: 3298-3387
Classification Level Classification E-value
Superfamily Immunoglobulin 2.8e-19
Family I set domains 0.013
Further Details:      
 
Domain Number 6 Region: 3573-3660
Classification Level Classification E-value
Superfamily Immunoglobulin 3.99e-19
Family I set domains 0.011
Further Details:      
 
Domain Number 7 Region: 3211-3303
Classification Level Classification E-value
Superfamily Immunoglobulin 5.89e-19
Family I set domains 0.0076
Further Details:      
 
Domain Number 8 Region: 1955-2043
Classification Level Classification E-value
Superfamily Immunoglobulin 8.04e-19
Family I set domains 0.02
Further Details:      
 
Domain Number 9 Region: 2050-2136
Classification Level Classification E-value
Superfamily Immunoglobulin 1.14e-18
Family I set domains 0.037
Further Details:      
 
Domain Number 10 Region: 1676-1771
Classification Level Classification E-value
Superfamily Immunoglobulin 5.92e-18
Family I set domains 0.015
Further Details:      
 
Domain Number 11 Region: 3495-3578
Classification Level Classification E-value
Superfamily Immunoglobulin 6.39e-17
Family I set domains 0.014
Further Details:      
 
Domain Number 12 Region: 3108-3202
Classification Level Classification E-value
Superfamily Immunoglobulin 9.06e-17
Family I set domains 0.035
Further Details:      
 
Domain Number 13 Region: 1772-1859
Classification Level Classification E-value
Superfamily Immunoglobulin 1.69e-16
Family I set domains 0.0000262
Further Details:      
 
Domain Number 14 Region: 3022-3117
Classification Level Classification E-value
Superfamily Immunoglobulin 1.81e-16
Family I set domains 0.031
Further Details:      
 
Domain Number 15 Region: 2435-2522
Classification Level Classification E-value
Superfamily Immunoglobulin 8.08e-16
Family I set domains 0.05
Further Details:      
 
Domain Number 16 Region: 2537-2621
Classification Level Classification E-value
Superfamily Immunoglobulin 8.81e-16
Family I set domains 0.019
Further Details:      
 
Domain Number 17 Region: 2628-2715
Classification Level Classification E-value
Superfamily Immunoglobulin 9.79e-16
Family I set domains 0.019
Further Details:      
 
Domain Number 18 Region: 2730-2816
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000011
Family I set domains 0.04
Further Details:      
 
Domain Number 19 Region: 2826-2912
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000182
Family I set domains 0.034
Further Details:      
 
Domain Number 20 Region: 2245-2326
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000025
Family I set domains 0.057
Further Details:      
 
Domain Number 21 Region: 1864-1955
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000366
Family I set domains 0.04
Further Details:      
 
Domain Number 22 Region: 2155-2238
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000000667
Family I set domains 0.055
Further Details:      
 
Domain Number 23 Region: 2924-3009
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000322
Family I set domains 0.061
Further Details:      
 
Domain Number 24 Region: 3400-3491
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000573
Family I set domains 0.023
Further Details:      
 
Domain Number 25 Region: 2343-2429
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000000335
Family I set domains 0.052
Further Details:      
 
Domain Number 26 Region: 322-360
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000537
Family LDL receptor-like module 0.00094
Further Details:      
 
Domain Number 27 Region: 282-319
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000576
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 28 Region: 367-410
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000602
Family LDL receptor-like module 0.0016
Further Details:      
 
Domain Number 29 Region: 1563-1615
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000642
Family Laminin-type module 0.0087
Further Details:      
 
Domain Number 30 Region: 764-816
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000977
Family Laminin-type module 0.0046
Further Details:      
 
Domain Number 31 Region: 1159-1211
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000949
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 32 Region: 824-872
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000991
Family Laminin-type module 0.02
Further Details:      
 
Domain Number 33 Region: 3841-3882
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000288
Family EGF-type module 0.015
Further Details:      
 
Domain Number 34 Region: 198-236
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000103
Family LDL receptor-like module 0.0013
Further Details:      
 
Domain Number 35 Region: 1219-1264
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000109
Family Laminin-type module 0.022
Further Details:      
 
Domain Number 36 Region: 4105-4148
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000157
Family EGF-type module 0.012
Further Details:      
 
Domain Number 37 Region: 1622-1670
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000444
Family Laminin-type module 0.046
Further Details:      
 
Weak hits

Sequence:  gi|126012571|ref|NP_005520.4|
Domain Number - Region: 1275-1322
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000289
Family Laminin-type module 0.0064
Further Details:      
 
Domain Number - Region: 887-926
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000452
Family Laminin-type module 0.029
Further Details:      
 
Domain Number - Region: 1122-1161
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00204
Family Laminin-type module 0.092
Further Details:      
 
Domain Number - Region: 1528-1565
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00449
Family EGF-type module 0.091
Further Details:      
 
Domain Number - Region: 726-766
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0191
Family EGF-type module 0.048
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|126012571|ref|NP_005520.4|
Sequence length 4391
Comment basement membrane-specific heparan sulfate proteoglycan core protein precursor [Homo sapiens]
Sequence
MGWRAAGALLLALLLHGRLLAVTHGLRAYDGLSLPEDIETVTASQMRWTHSYLSDDEDML
ADSISGDDLGSGDLGSGDFQMVYFRALVNFTRSIEYSPQLEDAGSREFREVSEAVVDTLE
SEYLKIPGDQVVSVVFIKELDGWVFVELDVGSEGNADGAQIQEMLLRVISSGSVASYVTS
PQGFQFRRLGTVPQFPRACTEAEFACHSYNECVALEYRCDRRPDCRDMSDELNCEEPVLG
ISPTFSLLVETTSLPPRPETTIMRQPPVTHAPQPLLPGSVRPLPCGPQEAACRNGHCIPR
DYLCDGQEDCEDGSDELDCGPPPPCEPNEFPCGNGHCALKLWRCDGDFDCEDRTDEANCP
TKRPEEVCGPTQFRCVSTNMCIPASFHCDEESDCPDRSDEFGCMPPQVVTPPRESIQASR
GQTVTFTCVAIGVPTPIINWRLNWGHIPSHPRVTVTSEGGRGTLIIRDVKESDQGAYTCE
AMNARGMVFGIPDGVLELVPQRGPCPDGHFYLEHSAACLPCFCFGITSVCQSTRRFRDQI
RLRFDQPDDFKGVNVTMPAQPGTPPLSSTQLQIDPSLHEFQLVDLSRRFLVHDSFWALPE
QFLGNKVDSYGGSLRYNVRYELARGMLEPVQRPDVVLMGAGYRLLSRGHTPTQPGALNQR
QVQFSEEHWVHESGRPVQRAELLQVLQSLEAVLIQTVYNTKMASVGLSDIAMDTTVTHAT
SHGRAHSVEECRCPIGYSGLSCESCDAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHC
LNCQHNTEGPQCNKCKAGFFGDAMKATATSCRPCPCPYIDASRRFSDTCFLDTDGQATCD
ACAPGYTGRRCESCAPGYEGNPIQPGGKCRPVNQEIVRCDERGSMGTSGEACRCKNNVVG
RLCNECADGSFHLSTRNPDGCLKCFCMGVSRHCTSSSWSRAQLHGASEEPGHFSLTNAAS
THTTNEGIFSPTPGELGFSSFHRLLSGPYFWSLPSRFLGDKVTSYGGELRFTVTQRSQPG
STPLHGQPLVVLQGNNIILEHHVAQEPSPGQPSTFIVPFREQAWQRPDGQPATREHLLMA
LAGIDTLLIRASYAQQPAESRVSGISMDVAVPEETGQDPALEVEQCSCPPGYRGPSCQDC
DTGYTRTPSGLYLGTCERCSCHGHSEACEPETGACQGCQHHTEGPRCEQCQPGYYGDAQR
GTPQDCQLCPCYGDPAAGQAAHTCFLDTDGHPTCDACSPGHSGRHCERCAPGYYGNPSQG
QPCQRDSQVPGPIGCNCDPQGSVSSQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSASNPD
GCLPCFCMGITQQCASSAYTRHLISTHFAPGDFQGFALVNPQRNSRLTGEFTVEPVPEGA
QLSFGNFAQLGHESFYWQLPETYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDVQITGN
NIMLVASQPALQGPERRSYEIMFREEFWRRPDGQPATREHLLMALADLDELLIRATFSSV
PLAASISAVSLEVAQPGPSNRPRALEVEECRCPPGYIGLSCQDCAPGYTRTGSGLYLGHC
ELCECNGHSDLCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPEDCQPCACPLTNP
ENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCGPGYVGNPSVQGGQCLPETNQAPLVVE
VHPARSIVPQGGSHSLRCQVSGSPPHYFYWSREDGRPVPSGTQQRHQGSELHFPSVQPSD
AGVYICTCRNLHQSNTSRAELLVTEAPSKPITVTVEEQRSQSVRPGADVTFICTAKSKSP
AYTLVWTRLHNGKLPTRAMDFNGILTIRNVQLSDAGTYVCTGSNMFAMDQGTATLHVQAS
GTLSAPVVSIHPPQLTVQPGQLAEFRCSATGSPTPTLEWTGGPGGQLPAKAQIHGGILRL
PAVEPTDQAQYLCRAHSSAGQQVARAVLHVHGGGGPRVQVSPERTQVHAGRTVRLYCRAA
GVPSATITWRKEGGSLPPQARSERTDIATLLIPAITTADAGFYLCVATSPAGTAQARIQV
VVLSASDASPPPVKIESSSPSVTEGQTLDLNCVVAGSAHAQVTWYRRGGSLPPHTQVHGS
RLRLPQVSPADSGEYVCRVENGSGPKEASITVSVLHGTHSGPSYTPVPGSTRPIRIEPSS
SHVAEGQTLDLNCVVPGQAHAQVTWHKRGGSLPARHQTHGSLLRLHQVTPADSGEYVCHV
VGTSGPLEASVLVTIEASVIPGPIPPVRIESSSSTVAEGQTLDLSCVVAGQAHAQVTWYK
RGGSLPARHQVRGSRLYIFQASPADAGQYVCRASNGMEASITVTVTGTQGANLAYPAGST
QPIRIEPSSSQVAEGQTLDLNCVVPGQSHAQVTWHKRGGSLPVRHQTHGSLLRLYQASPA
DSGEYVCRVLGSSVPLEASVLVTIEPAGSVPALGVTPTVRIESSSSQVAEGQTLDLNCLV
AGQAHAQVTWHKRGGSLPARHQVHGSRLRLLQVTPADSGEYVCRVVGSSGTQEASVLVTI
QQRLSGSHSQGVAYPVRIESSSASLANGHTLDLNCLVASQAPHTITWYKRGGSLPSRHQI
VGSRLRIPQVTPADSGEYVCHVSNGAGSRETSLIVTIQGSGSSHVPSVSPPIRIESSSPT
VVEGQTLDLNCVVARQPQAIITWYKRGGSLPSRHQTHGSHLRLHQMSVADSGEYVCRANN
NIDALEASIVISVSPSAGSPSAPGSSMPIRIESSSSHVAEGETLDLNCVVPGQAHAQVTW
HKRGGSLPSHHQTRGSRLRLHHVSPADSGEYVCRVMGSSGPLEASVLVTIEASGSSAVHV
PAPGGAPPIRIEPSSSRVAEGQTLDLKCVVPGQAHAQVTWHKRGGNLPARHQVHGPLLRL
NQVSPADSGEYSCQVTGSSGTLEASVLVTIEPSSPGPIPAPGLAQPIYIEASSSHVTEGQ
TLDLNCVVPGQAHAQVTWYKRGGSLPARHQTHGSQLRLHLVSPADSGEYVCRAASGPGPE
QEASFTVTVPPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRN
QELEDNVHISPNGSIITIVGTRPSNHGTYRCVASNAYGVAQSVVNLSVHGPPTVSVLPEG
PVWVKVGKAVTLECVSAGEPRSSARWTRISSTPAKLEQRTYGLMDSHAVLQISSAKPSDA
GTYVCLAQNALGTAQKQVEVIVDTGAMAPGAPQVQAEEAELTVEAGHTATLRCSATGSPA
PTIHWSKLRSPLPWQHRLEGDTLIIPRVAQQDSGQYICNATSPAGHAEATIILHVESPPY
ATTVPEHASVQAGETVQLQCLAHGTPPLTFQWSRVGSSLPGRATARNELLHFERAAPEDS
GRYRCRVTNKVGSAEAFAQLLVQGPPGSLPATSIPAGSTPTVQVTPQLETKSIGASVEFH
CAVPSDRGTQLRWFKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYICQAHGPWGKAQASAQ
LVIQALPSVLINIRTSVQTVVVGHAVEFECLALGDPKPQVTWSKVGGHLRPGIVQSGGVV
RIAHVELADAGQYRCTATNAAGTTQSHVLLLVQALPQISMPQEVRVPAGSAAVFPCIASG
YPTPDISWSKLDGSLPPDSRLENNMLMLPSVRPQDAGTYVCTATNRQGKVKAFAHLQVPE
RVVPYFTQTPYSFLPLPTIKDAYRKFEIKITFRPDSADGMLLYNGQKRVPGSPTNLANRQ
PDFISFGLVGGRPEFRFDAGSGMATIRHPTPLALGHFHTVTLLRSLTQGSLIVGDLAPVN
GTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLSSGFIGCVRELRIQGEEIVFHDLNLTAH
GISHCPTCRDRPCQNGGQCHDSESSSYVCVCPAGFTGSRCEHSQALHCHPEACGPDATCV
NRPDGRGYTCRCHLGRSGLRCEEGVTVTTPSLSGAGSYLALPALTNTHHELRLDVEFKPL
APDGVLLFSGGKSGPVEDFVSLAMVGGHLEFRYELGSGLAVLRSAEPLALGRWHRVSAER
LNKDGSLRVNGGRPVLRSSPGKSQGLNLHTLLYLGGVEPSVPLSPATNMSAHFRGCVGEV
SVNGKRLDLTYSFLGSQGIGQCYDSSPCERQPCQHGATCMPAGEYEFQCLCRDGFKGDLC
EHEENPCQLREPCLHGGTCQGTRCLCLPGFSGPRCQQGSGHGIAESDWHLEGSGGNDAPG
QYGAYFHDDGFLAFPGHVFSRSLPEVPETIELEVRTSTASGLLLWQGVEVGEAGQGKDFI
SLGLQDGHLVFRYQLGSGEARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGRSP
GPNVAVNAKGSVYIGGAPDVATLTGGRFSSGITGCVKNLVLHSARPGAPPPQPLDLQHRA
QAGANTRPCPS
Download sequence
Identical sequences P98160
gi|126012571|ref|NP_005520.4| NP_005520.4.87134 NP_005520.4.92137 ENSP00000363805 ENSP00000363827 9606.ENSP00000363827 ENSP00000363827

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]