SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPANP00000012150 from Papio anubis 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPANP00000012150
Domain Number 1 Region: 2362-2429,2541-2617,2715-2740,2774-2825
Classification Level Classification E-value
Superfamily Sialidases 0.00000000000589
Family Sialidases (neuraminidases) 0.047
Further Details:      
 
Domain Number 2 Region: 1681-1797,1877-1985,2028-2089,2246-2278
Classification Level Classification E-value
Superfamily Sialidases 0.0000000149
Family Sialidases (neuraminidases) 0.02
Further Details:      
 
Domain Number 3 Region: 2580-2665,2773-2807,2858-3101
Classification Level Classification E-value
Superfamily Sialidases 0.000000333
Family Sialidases (neuraminidases) 0.022
Further Details:      
 
Domain Number 4 Region: 586-639,783-858,941-1005
Classification Level Classification E-value
Superfamily Sialidases 0.000000343
Family Sialidases (neuraminidases) 0.034
Further Details:      
 
Weak hits

Sequence:  ENSPANP00000012150
Domain Number - Region: 2484-2508
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000785
Family Integrin beta EGF-like domains 0.023
Further Details:      
 
Domain Number - Region: 677-702
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00136
Family EGF-type module 0.023
Further Details:      
 
Domain Number - Region: 1035-1060
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00199
Family EGF-type module 0.069
Further Details:      
 
Domain Number - Region: 2132-2163
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00628
Family EGF-type module 0.051
Further Details:      
 
Domain Number - Region: 1153-1222,1312-1361
Classification Level Classification E-value
Superfamily Sialidases 0.0255
Family Sialidases (neuraminidases) 0.041
Further Details:      
 
Domain Number - Region: 3233-3260
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0911
Family Integrin beta EGF-like domains 0.042
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPANP00000012150   Gene: ENSPANG00000009099   Transcript: ENSPANT00000013028
Sequence length 3458
Comment pep:known_by_projection chromosome:PapAnu2.0:3:125622763:126131890:-1 gene:ENSPANG00000009099 transcript:ENSPANT00000013028 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MERSGWAWRTFLLALLLGATQRARAAAGYYPRFSPFFFLCTHHGELEGDGEQGEVLISLH
IAGNPTYYVPGQEYHVTISTSTFFDGLLVTGLYTSTSVQASQSIGGSSAFGFGIMSDHQF
GNQFMCSVVASHVSHLPTTNLSFIWIAPPAGTGCVNFMATATHRGQVIFKDALAQQLCEQ
GAPTEATMHPHLAEIHSDSIILRDDFDSYHQLQLNPNIWVECNNCETGEQCGAIMHGNAV
TFCEPYGPRELITTGLNTTTASVLQFSIGSGSCRFSYSDPSIIVLYAKNNSADWIQLEKI
RAPSNVSTIIHILYLPEDAKGENVQFQWKQENLRVGEVYEACWALDNILIINSAHRQVIL
EDSLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDVSTEDIQEQW
SEEFESQPTGWDVLGAVIGTECGTIESGLSMVFLKDGERKLCTPYMDTTGYGNLRFYFVM
GGICDPGNSHENDIILYAKIEGRKEHIALDTLSYSSYKVPSLVSVVINPELQTPATKFCL
RQKNHQGHNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVSLEFSTNHGRS
WSLLHTECLPEICAGPHLPHSTVYSSENYSGWNRITIPLPNAALTRNTRIRWRQAGPILG
NMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSSRL
SSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGSKS
VLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYLSYHEPRIISVELPDDARQFGIQFR
WWQPYHSSQGEDVWAIDEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPYCGHDWTL
CFTGDSKLASSMRYVETQSMQIGACYMIQFSLVMGCGQKYTPHMDNQVKLEYSTNHGLTW
HLVQEECLPSMPSCQEFTSASIYHASEFTQWRRVIVLLPQKTWSSATRFRWSQSYYTVQD
EWALDSIYIGQQCPNMCSGHGSCDHGVCRCDQGYQGTECHPEAALPSTIMSDFENQNGWE
SDWQEVIGGEIVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWVDFVQFYIQIGGES
ASCNKPDSREEGVLLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPCTRFRWW
QPVFSGEDYDQWAVDDIIILSEKQKQIIPVINPTLPQNFYEKPAFDYPMNQMSVWLMLAN
EGMVKNETFCAATPSAMIFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCANQFSNTAPVL
LQYSHDAGMSWFLVKEGCYPASAGKGCEGNSRELSEPTMYHAGDFEEWTRITIVIPRSLA
SSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCISGVCFCDLGYTAAQGTC
VSNVPNHNEMFDRFEGKLSPLWYKITGAQVGTGCGTLNDGKSLYFNGPGKREARTVPLDT
RNIRLVQFYIQIGSKTSGITCIKPRTRNEGLIVQYSNDNGILWHLLRELDFMSFLEPQII
SIDLPQEAKTPATAFRWWQPQHGKHSAQWALDDVLIGMNDSSQTGFQDKFDGSIDLQANW
YRIQGGQVDIDCLSMDTALIFTENIGKPRYAETWDFHVSASTFLQFEMSMGCSKPFSSSH
SVQLQYSLNNGKDWHLVTEECVPPTIGCLHYTESSIYTSERFQNWKRITVYLPLSTISPR
TRFRWIQANYTVGADSWAIDNVVLASGCPWMCSGRGICDAGRCVCDRGFGGPYCVPVVPL
PSILKDDFNGNLHPDLWPEVYGAERGNLNGETIKSGTCLIFKGEGIRMLISRDLDCTNTM
YVQFSLRFIAKGTPERSHSILLQFSINGGITWHLMDEFYFPQTTNILFINVPLPYTAQTN
ATRFRLWQPYNNGKKEEIWIVDDFIIDGNNLNNPMMLLDTFDFGPREDNWFFYPGGNIGL
YCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLNVNENTIIQFEINVGCSTDSSSADPVR
LEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHHPSSTYYAGTMQGWRREVVHFGKLHLC
GSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCNGQGSCINGTKCICDPGYSGPTCK
ISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLMTRDLDL
SHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIALEI
PLKARSGSTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFTTLDSRKWLLHPG
GTKMPVCGSTGDALVFIEKASTRYVVSTDIAVNEDSFLQIDFAASCSVTDSCYAIELEYS
VDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPPYTRSQATRFRWHQ
PAPFDKQQTWAIDNVYIGDGCIDMCSGHGRCIQGNCVCDEQWGGLYCDDPETSLPTQLKD
NFNRAPSNQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEFIQFYFM
YGCLITPNNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIATRFRW
WQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGRIAFDMF
MEDKTSVNEHWLFHDDCTVERFCESPDGVMLCGSHDGREVYAVTHDLTPTEGWIMQFKIS
VGCKVSEKIAQNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGTVSQPSVFFPTKGWKRIT
YPLPESLVGNPVRFRFYQKYSDMQWAIDNFYLGPGCLDNCRGHGDCLREQCICDPGYSGP
NCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTVRQAITQ
DLDLRGAKFLQYWGRIGSENNMTSCHRPICRKEGVLLDYSTDGGITWTLLHEMDYQKYIS
VRHDYILLPEDALTNTTRLRWWQPFVISNGIVVSGVERAQWALDNILIGGAEINPSQLVD
TFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTHNALSSRELIIQPGYMM
QFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQFHEATIY
NSVNSSSWKRITIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPKLCSGHGY
CTTGAICICDESFQGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIGSGCGQLAP
YAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTSQTDSCNSDLSGPHTVDKAVL
LQYSVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGTGHDQWAL
DHVEVVLTRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP
Download sequence
Identical sequences ENSPANP00000012150

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]