SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOPRP00000005472 from Ochotona princeps 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOPRP00000005472
Domain Number 1 Region: 2529-2617,2714-2739,2773-2823
Classification Level Classification E-value
Superfamily Sialidases 0.00000000036
Family Sialidases (neuraminidases) 0.044
Further Details:      
 
Domain Number 2 Region: 792-817,1260-1379
Classification Level Classification E-value
Superfamily Sialidases 0.0000216
Family Sialidases (neuraminidases) 0.056
Further Details:      
 
Weak hits

Sequence:  ENSOPRP00000005472
Domain Number - Region: 2484-2509
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000492
Family Integrin beta EGF-like domains 0.027
Further Details:      
 
Domain Number - Region: 678-703
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00121
Family EGF-type module 0.023
Further Details:      
 
Domain Number - Region: 798-824,1100-1165
Classification Level Classification E-value
Superfamily Sialidases 0.00196
Family Sialidases (neuraminidases) 0.02
Further Details:      
 
Domain Number - Region: 1036-1061
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00214
Family EGF-type module 0.065
Further Details:      
 
Domain Number - Region: 1770-1794
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00544
Family Integrin beta EGF-like domains 0.043
Further Details:      
 
Domain Number - Region: 2132-2163
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00828
Family EGF-type module 0.042
Further Details:      
 
Domain Number - Region: 1525-1566,1683-1733
Classification Level Classification E-value
Superfamily Sialidases 0.0114
Family Sialidases (neuraminidases) 0.047
Further Details:      
 
Domain Number - Region: 1681-1797,1877-1985,2202-2267
Classification Level Classification E-value
Superfamily Sialidases 0.0167
Family Sialidases (neuraminidases) 0.022
Further Details:      
 
Domain Number - Region: 3230-3257
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0754
Family Integrin beta EGF-like domains 0.051
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOPRP00000005472   Gene: ENSOPRG00000005912   Transcript: ENSOPRT00000005959
Sequence length 3423
Comment pep:known_by_projection genescaffold:pika:GeneScaffold_3556:191417:684238:-1 gene:ENSOPRG00000005912 transcript:ENSOPRT00000005959 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MERGRWAPRTLLVVLLLLGATLRGRAAVGYSPRFSPFFFLCTHHGELEGDGEQGEVLISL
HIAGSPTYYVPGQEYHVTISTSTFFDGLLVTGLYTSTSAQSSQSIGGANAFGFGIMSDHQ
FGNQFMCSVVASHVSHLPTTNLSFVWIAPPAGTGCVNFMATATHRGQVIFKDALAQQLCE
QGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAACNNCELGEQCGAIMQGSA
VTFCEPYGSRELMTTGLNTTTASVLQFSIGSGSCRFSYSDPSILVSYAKNNTADWIQLEK
IRAPSNVSTIIHILYLPEEAKGENVQFQWKQENLHVGEVYEACWALDNILVINSAHRQVV
LEDNLDPVDTGNWLFFPGATIKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSTEDIQEQ
WSEEFESQPTGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXGICDPGDSHEDDVILYAKIEGRKEHTALDTLSYSSYKXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVSLEFSTNHGR
SWSLLHTECLPEICAGPHLPHSTVYPSENYSGGNPLTIPLPNAALTRDTRIRWRQTGPIL
GNMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGNSR
LSSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGSK
SVLSTCKAPDQPGEGVLLHYSYDNGITWKLLEHYSYLNYHEPRIISVELPDDARQFGIQF
RWWQPYHSSQGEDVWAIDEVIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPYCGHDWT
LXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXECLPSMPSCQEFTSASIYHASEFTQWRRIIALLPQKTWSSATRFRWSQSYYTAQ
DEWALDDIYIGQQCPSMCSGHGSCDHGMCRCDQGYHGTECHEATLPSTIMSDFENQNGWE
ADWQEVIGGEIVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWVDFVQFYIQIGGES
TACNKPDSREEGILLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPTAAKTPCTRFRWW
QPVFSGEDYDQWAIDDIIILSEKQKQVIPAVNPTFPQNFYEKPAFDYPMNQMSVWLMLAN
EGLAKNETFCAATPSAMVFGKSDGDRFAVTRDLTLKPGYVLQFKLNIACANQFSTTAPVL
LQYSHDAGMSWFLVKEGCYPASAGKSCEGSSRELSEPTVYHTGDFEEWTRITIVIPRSLA
SSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCISGVCFCDLGYTAAQGTC
VSNIPNYSEMFDRFEGKLSPLWYKITGGQVGTGCGTLNDGKSLYFSGPGKREARTVPLDT
RTIRLVQFYVQIGSKTSGITCSKPRARNEGLVVQYSNDNGIVWHLLRELDFSSFLEPRII
SIDLPREAKTSATAFRWWQPQHGKHSAQWALDDVLIGMNDSSQTGFQDKFDGSLDLQANW
YRIQGGQVDIDCLSMDTALIFTENIGKPRYAETWDFHVSASTFLQFEMSMGCSKPFSDSH
SIQLQYSLNSGRDWHLVTEECVPPTIGCLHYTQSSIYTSERFQNWKQITVYLPLSTISPR
TRFRWIQTNYTVGADSWALDNVVLASGCPWMCSGRGVCDAGRCVCDRGFGGPSCVPVVPL
PSILKDDFNGNLHPDLWPEVYGAERGDLNGEPIKSGTSLIFKGEGLRMLISRDLDCTNTM
YVQFSLRFIAKGTPERSHCILLQFSVTGGITWHLMDEFYFPQTTNILFINVPLPYTAQTN
ATRFRLWQPYNNGKKEEIWIVDDFVIDGNNLHNPVMLLDTFDFGPREDNWFFYPGGNIGL
YCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLSVNENTIIQFEXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEGMCNGHGSCINGTKCICDPGYSGPTCK
ISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLMTRDLDL
SHARFVQFFMRLGCGKGVPDPRSQPVLLQYSRNGGLSWSLLQEFLFSNASNVGRYIALEI
PLKARSGSTRLRWWQPSESGHFYGPWVIDQILIGGNISGNTVLEDDFTGLDSRKWLLHPG
GTKMPVCGSSGDALVFIEKASTRYVVTTDIAVNEDSFLQMDFAASCSVTDSCYXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQATRFRWHQ
PAPFDKQQTWAIDNVYIGDGCVDMCSGHGRCVQGNCVCDEQWGGLYCDEPEAPLPAQLKD
NFNRAPSNQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEFIQFYFM
YGCLITPNNRNQGVLLEYSVNGGTTWNLLMEIFYDQYTKPGFVNILLPPDAKEVATRFRW
WQPKHDGLDQNDWAIDNVLISGSADQRTVMLDTFSTAPVPQHERSPADAGPVGRIAFDMF
MEDKTEVNEHWLHDDCMVERFCHSPDGVMICGSHDGREVYAVTHDLTPTEGWIMQFKISV
GCKVPEKIAQNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGSVSQPSVFFPTKGWKRITY
PLPESLVGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXTFLKERFDSEEIKPDLWMSLEGGNTCTDCGVLAEDTALYFGGSTVRQAITQD
LDLRGAKFLQYWGRIGSENNMTSCHRPVCRKEGVLLDYSTDGXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXPTNTIACHWWQHFVINHGIGVFWVELAQGADNLSGGADINPSQLVDTFD
DEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPSEKKDKTHNALSSRELIIQPGYMMQFK
IVVGCEDTSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQFHEATIYNAV
NSSSWKRITIQLPDHVSSSATQFRWIQKGEEAEKQSWAIDHVYIGEACPKLCSGHGYCTT
GAVCICDASFQGDDCSVFTHDLPSSIKDNFESARVTEANWETIQGGSIGSGCGQLAPYAH
GDSLYFNGCQVRQAATKPLDLTRASKIMFVLQIGSTSQTDSCNSDLSSPHAVDKAVLLQY
SVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGTGHDQWALDHV
EVV
Download sequence
Identical sequences ENSOPRP00000005472 ENSOPRP00000005472

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]