SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSONIP00000020566 from Oreochromis niloticus 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSONIP00000020566
Domain Number 1 Region: 1805-1963
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 7.27e-53
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00000369
Further Details:      
 
Domain Number 2 Region: 352-535
Classification Level Classification E-value
Superfamily Cupredoxins 9.75e-49
Family Multidomain cupredoxins 0.0000111
Further Details:      
 
Domain Number 3 Region: 1650-1804
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 2.21e-46
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0000369
Further Details:      
 
Domain Number 4 Region: 1323-1501
Classification Level Classification E-value
Superfamily Cupredoxins 1.2e-43
Family Multidomain cupredoxins 0.0000378
Further Details:      
 
Domain Number 5 Region: 31-201
Classification Level Classification E-value
Superfamily Cupredoxins 1.62e-43
Family Multidomain cupredoxins 0.0000158
Further Details:      
 
Domain Number 6 Region: 1508-1650
Classification Level Classification E-value
Superfamily Cupredoxins 1.65e-34
Family Multidomain cupredoxins 0.0000301
Further Details:      
 
Domain Number 7 Region: 542-679
Classification Level Classification E-value
Superfamily Cupredoxins 4.44e-32
Family Multidomain cupredoxins 0.00019
Further Details:      
 
Domain Number 8 Region: 205-333
Classification Level Classification E-value
Superfamily Cupredoxins 5.21e-26
Family Multidomain cupredoxins 0.000055
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSONIP00000020566   Gene: ENSONIG00000016333   Transcript: ENSONIT00000020584
Sequence length 1964
Comment pep:known_by_projection scaffold:Orenil1.0:GL831210.1:2728799:2745437:1 gene:ENSONIG00000016333 transcript:ENSONIT00000020584 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MRLRVRAGARWLLPVLVLLTAHQVKANQPQPKERHYYIAAVEIDWKYSGNDTDGSGPTYK
KVVFREYEKGFRQAKTHPSWLGLLGPTLRAEEGETIVVTFRNMATSPYSISPHGVAYGKQ
SEGAYYFDNTSQKEKEDDKVLPNSEHVYYWEVTSDVAPQKNDPTCLTYTYISHQNVVKDY
NSGLIGTLLICKQGSLDESGQQAGIHHEYVFLFGVFDENKSKYEPSGYTSDSHVKYTING
YTNGSLPDVSMCAYAPVRLHLVGMSSEPEVFSVHMNGQVLEQAGHKVSSVGLITGSSVTA
SMVAVHTGRWLLSSHTVKHIEASMHAFVDVINCEGFEKPHRWVTIAQKRQSREWKYYIAA
EEIVWDYAPTKQDHIDEDFKRQYLSQSPTRIGGKYKKAVYTQYKDETFTERSEHLQRKNE
LGILGPVIRAQIRDIITIVFKNMASRPYSIYPHGLTIEKSQEGVNYPKGGNHSHGVQPGE
THTYVWKVVEEDEPLEGDSRCLTRLYHSAVNTPRDIASGLIGPMLICKSQSLNVRNVQLK
ADKEQHAMFAVFDENKSWYLEDNIRQYCERSKVNREDPEFYKSNIKHTVNGYIFKNDPPL
GFCSGEVATWHVSSIGAQDYIQTATFYGHTFELNDRTEDFLSLYPMTGETITMNMLNTGV
WLLASLNSHETTNGMRVKFRDVECFRDYQYEYEDSVPKNEFTVWKPPSLDDIKKEEEKAD
PVKNPSLEPDVYTEMFAEVLNLRSHKNQSADSDMEKLDLSFLDYDVVDVLEKDMNSTLNF
TEIKSKPETSTTNPDALTEMWFLILKVLNITLGNLQNQSTSENTTHILNSSTPLTQSVLD
NSTLYQIQNLTGLNSHNVSQKDSSTTRNGSLSSETTRPNVALQEPTNLTAAPSGNSSNSL
ERDRINVSLTTDNHTSIEEVVQDSEEKYTRGDVFSYSVPLSKPSINNFNSSLNSNLSADT
LLENTKREDENNTAVKYGNTSAERTNHSLSVLEVDVEEVSSNIKRHNLTILELTESTDYK
NSDNNTFLSSGELEYGLQINLEVVTTPTNSSYKNVTQILLENEQNVTGDASRSALSVMSS
MGREENISSLFDKLTNTSLESLSNQTTVSVNQSHSSEELGFSESSEEVIIFLKENNTEAI
KTSLVKIQGHNWTYEGTYQMIPEELPDQLKKHFEKETPQTTLPPKKKVRVVNRRKRPEKG
HGMKTRKRKEYKPQARSGLPFSPRGFNPGMTPRGSRPHSPKPVSTEDHVIDMPVVIGVPR
PDFSDYELYIPGGEPDHLRLEEQDFKADEYEYVMYKDPYSDVDDIKNLDLDETTKYYLKM
SGPNVKTYFIAAEEVEWDYAGYGHRRQEKHELLSQETKFTKVVFRGYLDSSFTTPDIRGE
IDEHLGILGPVIKAEVGQSIMVVFRNNANRPYSIHPNGVSYTKRTEGLSYEDDSHYWFKY
DNEVNPNNTFTYLWKVGHTVGPKPEESDCRTWAYYSGVNPERDIHSGLIGPLLVCREGTL
DTKLTDRREFTLLFMTFDESRSWYYEKNSEIMQRKRRRRIMDHNFKENLKFHSINGIIHS
LKGLRMYTNQLVTWHLINLGSPNDIHSVHFHGQTFIHKKTTSYRQAVYPLLPGGFATLEM
YPSKPGLWQLETEVGLSQQKGMQTLFLVLDNDCCRPLGLESGSVKDGEITAINTRGYWEP
HLARLNNQGKYNAWSTDKNSSFIQVDFQRPVVISQVATQGAKQMFYSQYVVKYLISYSND
RRKWIFYKGDSKGFRKVFTGNQEAYETKTNIFFPPVIGRFIRLHPIEWYNMATVRMEYFG
CELDGCSVPLGMESEAIGDIYITASSTATSWYAGPWKPSLARLNRQGAVNAWRAKYNNMD
QWLQVELPQVKKITGIITQGAKSLGKEMYVMSYILQYSDNGIEWKEYTDSEDEPARIFMG
NTNNNDHARNYIYPPIFSRFIRVIPKSWMTSITMRIELLGCDFE
Download sequence
Identical sequences I3KHH1
ENSONIP00000020566 ENSONIP00000020566 XP_005475290.1.78416 XP_019208521.1.78416

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]