SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000019567 from Equus caballus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000019567
Domain Number 1 Region: 331-508,543-709
Classification Level Classification E-value
Superfamily (Trans)glycosidases 4.91e-84
Family YicI catalytic domain-like 0.00087
Further Details:      
 
Domain Number 2 Region: 1206-1398,1443-1604
Classification Level Classification E-value
Superfamily (Trans)glycosidases 2.47e-73
Family YicI catalytic domain-like 0.00026
Further Details:      
 
Domain Number 3 Region: 87-330
Classification Level Classification E-value
Superfamily Galactose mutarotase-like 4.71e-50
Family YicI N-terminal domain-like 0.0086
Further Details:      
 
Domain Number 4 Region: 955-1202
Classification Level Classification E-value
Superfamily Galactose mutarotase-like 6.28e-48
Family YicI N-terminal domain-like 0.01
Further Details:      
 
Domain Number 5 Region: 1607-1687
Classification Level Classification E-value
Superfamily Glycosyl hydrolase domain 2.52e-17
Family Putative glucosidase YicI, domain 3 0.024
Further Details:      
 
Domain Number 6 Region: 712-792
Classification Level Classification E-value
Superfamily Glycosyl hydrolase domain 3.31e-16
Family Putative glucosidase YicI, domain 3 0.021
Further Details:      
 
Domain Number 7 Region: 927-978
Classification Level Classification E-value
Superfamily Trefoil 0.00000000199
Family Trefoil 0.0079
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000019567   Gene: ENSECAG00000021256   Transcript: ENSECAT00000023613
Sequence length 1827
Comment pep:known chromosome:EquCab2:19:6057109:6141141:-1 gene:ENSECAG00000021256 transcript:ENSECAT00000023613 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MAKKKFTPLEIILIVLFVIVTIIAIALIIVLATKTPAVEGLKFSTSTPATSTTTAYPGSE
NCPSELNDAINERINCIPEQFPTQALCASRGCCWRPWNDSVIPWCFFVDNHGYNVEEMTT
NNTGLEARLNRIPSPTLFGDDINSVLLTTQSQTPNRFRFKITDPNNRRYEVPHQFVKEPT
GTTDSETLYNVQVTENPFSIKVIRKSNNRTLFDTSIGPLVYSDQYLQISTRLPSEYIYGI
GEHIHKRFRHDLYWKKWPLFTRDQLPGDNNNNLYGHQTFFMCIEDTSGKSFGVFLMNSNA
MEIFIQPTPIVTYRVIGGILDFYIFLGDTPEQVVQQYQELIGLPAMPSYWSLGFQLSRWN
YKSLDVVKEVVRRNREAGIPFDTQVTDIDYMEDKKDFTYDKVTFSGLPEFVQDLHDHGQK
YVIILDPAISIDRRADGTAYEAYERGNAQKVWVNESDGTTAIIGEVWPGLTVYPDFTNPS
CIDWWANECSIFHQEVPYDGIWIDMNEVSSFVQGSLKGCDVNKLNYPPFTPDILDKLLYS
KTICMDAVQYWGKQYDVHSLYGYSMAIATEQAVQKVFPNKRSFILTRSTFAGSGSYAAHW
LGDNTASWEQMEWSIAGMLEFGLFGMPLVGADICGFVADTTEELCRRWMQLGAFYPFSRN
HNADGYVEQDPAFFGQDSLLVRSSKYYLNIRYSLLPFLYTLFYKAHKFGETVARPILHEF
YEDTNSWIEDTQFLWGPALLITPVLKEGADTVSAYIPDATWYDYETGAKRPWRKQRVNMY
LPADKIGLHLRGGYIIPTQEPAVTTNASRQNPLGLIVPLDENNTAKGDFFWDDGETKGKS
QKHGNYILYTFSVSDNKLGIICTHSSYQEGTTLAFETIKILGLTETVTYVLVGEENRPTQ
AHSNFTYYPSNQSLLIYNLNFNLGRNFTVQWDQSFPENEKFTCYPDADVATEEKCRQRGC
LWEPSSFGSRAPDCYFPREDNPYLVSSIQYSSMGVTADLQLNTAKARINLPSEPISTLRV
EVKYHKNDMLQFKIYDAQNKRYEVPVPLNIPDTPTSTYENRLYDVEIKENPFGIQIRRRS
TGTVIWDSQLPGFAFNDQFIQISTRLPSEYIYGFGEVEHTAFKRDLNWHTWGMFTRDQPP
GYKLNSYGFHPYYMALEDESNAHGVFLLNSNGMDVTFQPTPALTYRIIGGILDFYMFLGP
HPEVATKQYHEVIGQPVMPPYWSLGFQLCRYGYRNTSQVQQVYEEMVAARIPYDVQYTDI
NYMERQLDFTIGEAFSDLPQFVDRIRQEGMRYIIILDPAISGNETQPYPAFERGQEKDVF
VKWPNTDEICWAKVWPDLPNITIDESLTEDEAVNASRAHAAFPDFFRNSTAQWWAKEILD
FYNNKMKFDGLWIDMNEPSSFVNGTTTNQCRNEGLNYPPYFPELTKRTDGLHFRTLCMET
EQILSDGSSVLHYDVHNLYGWSQVKPTYDALQRTTGKRGIVISRSTYPTAGRWGGHWLGD
NYARWDNMDKSIIGMMEFSLFGISYTGADICGFFNDTEYQLCARWMQLGAFYPYSRNHNI
ANTRRQDPASWNETFAAMSRDILNVRYTLLPYFYTQLHEVHVQGGTVIRPLLHEFFNEKP
TWDIFKQFLWGPAFMVTPVMEPNVDVVQGYVPNARWFDYHTGEDIGFRGNFHVFDAPLDK
INLHVRGGHILPCQEPAQNTFYSRQNYMRLIVAADDNHTAQGSLFWDDGDTINTYERDLY
FLIQFNYNHTTLTSTVLKNGYINRSEMRLGIINIWGKGKTPVQQVHLTYDENTYSLQFTQ
EADKEILNIDLRPNNFTLDEPIEIKWS
Download sequence
Identical sequences F6X9C4
ENSECAP00000019567 9796.ENSECAP00000019567 ENSECAP00000019567

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]