SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000022098 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000022098
Domain Number 1 Region: 30-787
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 5.83e-278
Family Motor proteins 0.000000000141
Further Details:      
 
Domain Number 2 Region: 1950-2066
Classification Level Classification E-value
Superfamily Second domain of FERM 1.09e-22
Family Second domain of FERM 0.0024
Further Details:      
 
Domain Number 3 Region: 1247-1350
Classification Level Classification E-value
Superfamily Ubiquitin-like 1.11e-20
Family First domain of FERM 0.033
Further Details:      
 
Domain Number 4 Region: 2067-2157
Classification Level Classification E-value
Superfamily PH domain-like 2.34e-20
Family Third domain of FERM 0.015
Further Details:      
 
Domain Number 5 Region: 1850-1958
Classification Level Classification E-value
Superfamily Ubiquitin-like 0.00000000000000336
Family First domain of FERM 0.061
Further Details:      
 
Domain Number 6 Region: 1342-1436
Classification Level Classification E-value
Superfamily Second domain of FERM 0.000000000000109
Family Second domain of FERM 0.0073
Further Details:      
 
Domain Number 7 Region: 1563-1633
Classification Level Classification E-value
Superfamily SH3-domain 0.0000000152
Family SH3-domain 0.0042
Further Details:      
 
Domain Number 8 Region: 790-858
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 0.0000443
Family Motor proteins 0.045
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000022098   Gene: ENSECAG00000024143   Transcript: ENSECAT00000026486
Sequence length 2171
Comment pep:known_by_projection chromosome:EquCab2:7:67131282:67196394:-1 gene:ENSECAG00000024143 transcript:ENSECAT00000026486 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDLRSGQEFDVPIGAVVKLCDSGQIQVVDDEGNEHWISPQNATHIKPMHPTSVHGVEDMI
RLGDLNEAGILRNLLIRYRDHLIYTNCGGRTYTGSILVAVNPYQLLSIYSPEHIRQYTNK
KIGEMPPHIFAIADNCYFNMKRNSRDQCCIISGESGAGKTESTKLILQFLAAISGQHSWI
EQQVLEATPILEAFGNAKTIRNDNSSRFGKYIDIHFNKRGAIEGAKIEQYLLEKSRVCRQ
APDERNYHVFYCMLEGMSEEQKRKLGLGGASDYNYLAMGNCIACEGREDSQEYANIRSAM
KVLMFTDTENWEISKLLAAILHLGNLQYEARTFENLDACEVLFSPCLATAASLLEVNPPD
LMTCLTSRTLITRGETVSTPLSREQALDVRDAFVKGIYGRLFVWIVDKINAAIYKPPSQE
VKNSRRSIGLLDIFGFENFAVNSFEQLCINFANEHLQQFFVRHVFKLEQEEYDLESIDWL
HIEFTDNQDALDMIANKPMNIISLIDEESKFPKGTDTTMLHKLNSQHKLNSNYVPPKNNH
ETQFGIIHFAGVVYYESQGFLEKNRDTLHGDIIQLVHSSRNKFIKQLFQADVAMGAETRK
RSPTLSSQFKRSLELLMRTLGACQPFFVRCIKPNEFKKPMLFDRHLCVRQLRYSGMMETI
RIRRAGYPIRYSFVEFVERYRVLLPGVKPAYKQDDLRGTCQRMAEAVLGTHDDWQIGRTK
IFLKDHHDMLLEVERDKAITDRVILLQKVIRGFKDRSNFLKLKNAATLIQRHWRGHNCRR
NYELMRLGFLRLQALHRARKLHQQYRLARRHIIEFQARCRAYLVRRAFRHRLWAVLTVQA
YARGLIARRLYRRLRAEYLRRLEAEKMRLAEEEKLRKEMSAKKAKEEAERKHQERLAQLA
REDAERELKEKEEARRKKELLEQMERARHEPINHSDMVDKMFGFLGTSGGLPGQEGQAPS
GFEDLERGRREMVEEDLDAALPLPDEDEEDLSEYKFAKFAATYFQGTTMHTYTRRPLKQP
LLYHDDEGDQLAALAVWITILRFMGDLPEPKYHTAMSDGSEKIPVMTKIYETLGKKTYKR
ELQALQGEGEAQLPEGQKKSSVRHKLVHLTLKKKSKLTEEVTKRLHDGESTVQGNSMLED
RPTSNLEKLHFIIGNGILRPALRDEIYCQISKQLTHNPSKSSYARGWILVSLCVGCFAPS
EKFVKYLRNFIHGGPPGYAPYCEERLRRTFVNGTRTQPPSWLELQATKSKKPIMLPVTFM
DGTTKTLLTDSATTAKELCNALADKISLKDRFGFSLYIALFDKVSSLGSGSDHVMDAISQ
CEQYAKEQGAQERNAPWRLFFRKEVFTPWHNPSEDNVATNLIYQQVVRGVKFGEYRCEKE
DDLAELASQQYFVDYGSEMILERLLSLVPTYIPDREITPLKTLEKWAQLAIAAHKKGIYA
QRRTEAQKVKEDVVNYARFKWPLLFSRFYEAYKFSGPNLPKNDVIVAVNWTGVYFVDEQE
QVLLELSFPEIMAVSSSRGAKLTAPSFTLATIKGDEYTFTSSNSEDIRDLVVTFLEGLRK
RSKYVVALQDNPNPAGEESGFLSFAKGDLIILDHDTGEQVMNSGWANGINERTKQRGDFP
TDCVYVMPTVTMPPREIVALVTLTPDQRQDVIRLLQQRTAEPEPRAKPYTLEEFSYDYFR
PPPKHTLSRVMVSKARGKDRLWSHTREPLKQALLKKILGSEELSQEACMAFIDIHTLGDG
PHGGMRSVNELTDQIFEGALKAEPLKDEVYVQILKQLTDNHIRSRTDRGWLQKLWLNTLA
GWEPGSVSQVPSVLHSPWPHEPLAPDFATPHVFFRNGSRKYPPHLVEVEAIQHKTTQIFH
KVYFPDDTDEAFEVESSTKAKDFCQSIATRLLLKSSDGFSLFVKIADKVISVPENDFFFD
FVRHLTDWIKKARPIKDGIVPSLTYQVFFMKKLWTTTVPGKDPMADSIFHYYQELPKYLR
GYHKCTREEVLQLGALIYRVKFEEDKSYFPSIPKLLRELVPQDLIRQISPDDWKRSIVAY
FNKHAGKSKEEAKLAFLKLIFKWPTFGSAFFEVKQTTEPNFPEILLIAINKYGVSLIDPR
TKDILTTHPFTKISNWSSGNTYFHITIGNLVRGSKLLCETSLGYKMDDLLTSYISQMLTA
MSKQRGSRGGK
Download sequence
Identical sequences F6S607
ENSECAP00000022098

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]