SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 9796.ENSECAP00000021245 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  9796.ENSECAP00000021245
Domain Number 1 Region: 2128-2202,2229-2278,2344-2369,2407-2605
Classification Level Classification E-value
Superfamily Hect, E3 ligase catalytic domain 1.01e-85
Family Hect, E3 ligase catalytic domain 0.0000473
Further Details:      
 
Domain Number 2 Region: 363-485
Classification Level Classification E-value
Superfamily Ankyrin repeat 1.87e-25
Family Ankyrin repeat 0.00076
Further Details:      
 
Domain Number 3 Region: 1271-1339
Classification Level Classification E-value
Superfamily Mib/herc2 domain-like 2.22e-22
Family Mib/herc2 domain 0.0000087
Further Details:      
 
Domain Number 4 Region: 11-215,512-638,891-976
Classification Level Classification E-value
Superfamily ARM repeat 8e-20
Family Armadillo repeat 0.022
Further Details:      
 
Domain Number 5 Region: 1091-1243
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.00000000000201
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.03
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) 9796.ENSECAP00000021245
Sequence length 2613
Comment (Equus caballus)
Sequence
MADVDPDTLLEWLQMGQGDERDMQLIALEQLCMLLLMSDNVDRCFETCPPRTFLPALCKI
FLDESAPDNVLEVTARAITYYLDVSAECTRRIVGVDGAIKALCNRLVVVELNNRTSRDLA
EQCVKVLELICTRESGAVFEAGGLNCVLTFIRDSGHLVHKDTLHSAMAVVSRLCGKMEPQ
DSSLEICVESLSSLLKHEDHQVSDGALRCFASLADRFTRRGVDPAPLAKHGLTEELLSRM
AAAGGTVSGPSSACKPGRGTTGAPSTAADSKLSNQVSTIVSLLSTLCRGSPVVTHDLLRS
ELPDSIESALQGDERCVLDTMRLVDLLLVLLFEGRKALQKSSAGSTGRIPGLRRLDSSGE
RSHRQLIDCIRSKDTDALIDAIDTGAFEVNFMDDVGQTLLNWASAFGTQEMVEFLCERGA
DVNRGQRSSSLHYAACFGRPQVAKTLLRHGANPDLRDEDGKTPLDKARERGHSEVVAILQ
SPGDWMCPVNKGDDKKKKDTNKDEEECNEPKGDPEMAPIYLKRLLPVFAQTFQQTMLPSI
RKASLALIRKMIHFCSEALLKEVCDSDVGHNLPTILVEITATVLDQEDDDDGHLLALQII
RDLVDKGGDIFLDQLARLGVISKVSTLAGPSSDDENEEESKPEKEDEPQEDAKELQQGKP
YHWRDWSIIRGRDCLYIWSDAAALELSNGSNGWFRFILDGKLATMYSSGSPEGGSDSSES
RSEFLEKLQRARGQVKPSTSSQPILSAPGPTKLTVGNWSLTCLKEGEIAIHNSDGQQATI
LKEDLPGFVFESNRGTKHSFTAETSLGSEFVTGWTGKRGRKLKSKLEKTKQKVRTMARDL
YDDHFKAVESMPRGVVVTLRNIATQLESSWELHTNRQCIESENTWRDLMKTALENLIVLL
KDENTISPYEMCSSGLVQALLTVLNNVSLCNSTEQSIYNEFRKFIINVFKTAFSENEDEE
SRPAVALIRKLIAVLESIERLPLHLYDTPGSTYNLQILTRRLRFRLERAPGETALIDRTG
RMLKMEPLATVESLEQYLLKMVAKQWYDFDRSSFVFVRKLREGQNFIFRHQHDFDENGII
YWIGTNAKTAYEWVNPAAYGLVVVTSSEGRNLPYGRLEDILSRDNSALNCHSNDDKNAWF
AIDLGLWVIPSAYTLRHARGYGRSALRNWVFQVSKDGQNWTPLYTHVDDCSLNEPGSTAT
WPLDPPKDEKQGWRHVRIKQMGKNASGQTHYLSLSGFELYGTVNGVCEDQLGKAAKEAEA
NLRRQRRLVRSQVLKYMVPGARVIRGLDWKWRDQDGSPQGEGTVTGELHNGWIDVTWDAG
GSNSYRMGAEGKFDLKLAPGYDPDTVASPKPVSSTVSGTTQSWSSLVKNNCPDKTSAAAG
SSSRKGSSSSVCSVASSSDISLGSTKTERRSEIVMEHSIVSGADVHEPIVVLSSAENVPQ
TEVGSSSSASTSTLTAETGSENAERKLGPDSSVRTPGESSAISMGIVSVSSPDVSSVSEL
TNKEAASQRPLSSSASNRLSVSSLLAAGAPMSSSASVPNLSSRETSSLESFVRRVANIAR
TNATNNMNLSRSSSDNNTNTLGRNVMSTATSPLMGAQSFPNLTTPGTTSTVTMSTSSVTS
SSNVATATTVLSVGQSLSNTLTTSLTSTSSESDTGQEAEYSLYDFLDSCRASTLLAELDD
DEDLPEPDEEDDENEDDNQEDQEYEEVMILRRPSLQRRAGSRSDVTHHAVTSQLPQVPAG
AGSRPIGEQEEEEYETKGGRRRTWDDDYVLKRQFSALVPAFDPRPGRTNVQQTTDLEIPP
PGTPHSELLEEVECTPSPRLALTLKVTGLGTTREVELPLTNFRSTIFYYVQKLLQLSCNG
NVKSDKLRRIWEPTYTIMYREMKDSDKEKENGKMGCWSIEHVEQYLGTDELPKNDLITYL
QKNADAAFLRHWKLTGTNKSIRKNRNCSQLIAAYKDFCEHGTKSGLNQGAISTLQSSDIL
SLTKEQPQAKAGNGQNSCGVEDVLQLLRILYIVASDPYSRISQEEGDEQPQFTFPPDEFT
SKKITTKILQQIEEPLALASGALPDWCEQLTSKCPFLIPFETRQLYFTCTAFGASRAIVW
LQNRREATVERTRTTSSVRRDDPGEFRVGRLKHERVKVPRGESLMEWAENVMQIHADRKS
VLEVEFLGEEGTGLGPTLEFYALVAAEFQRTDLGAWLCDDNFPDDESRHVDLGGGLKPPG
YYVQRSCGLFTAPFPQDSDELERITKLFHFLGIFLAKCIQDNRLVDLPISKPFFKLMCMG
DIKSNMSKLIYESRGDRDLHCTESQSEASTEEGHDSLSVGSFEEDSKSEFILDPPKPKPP
AWFNGILTWEDFELVNPHRARFLKEIKDLAIKRRQILSNKGLSEDEKNTKLQELVLKNPS
GSGPPLSIEDLGLNFQFCPSSRIYGFTAVDLKPSGEDEMITMDNAEEYVDLMFDFCMHTG
IQKQMEAFRDGFNKVFPMEKLSSFSHEEVQMILCGNQSPSWAAEDIINYTEPKLGYTRDS
PGFLRFVRVLCGMSSDERKAFLQFTTGCSTLPPGGLANLHPRLTVVRKVDATDASYPSVN
TCVHYLKLPEYSSEEIMRERLLAATMEKGFHLN
Download sequence
Identical sequences F6YN41
ENSECAP00000021245 9796.ENSECAP00000021245 ENSECAP00000021245

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]