SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000010440 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000010440
Domain Number 1 Region: 2011-2169
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 4.68e-56
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0000000338
Further Details:      
 
Domain Number 2 Region: 1850-2010
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 3.02e-51
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000000224
Further Details:      
 
Domain Number 3 Region: 1524-1706
Classification Level Classification E-value
Superfamily Cupredoxins 2.89e-50
Family Multidomain cupredoxins 0.0000259
Further Details:      
 
Domain Number 4 Region: 349-534
Classification Level Classification E-value
Superfamily Cupredoxins 6.19e-46
Family Multidomain cupredoxins 0.0000573
Further Details:      
 
Domain Number 5 Region: 31-204
Classification Level Classification E-value
Superfamily Cupredoxins 3.81e-45
Family Multidomain cupredoxins 0.000000104
Further Details:      
 
Domain Number 6 Region: 541-667
Classification Level Classification E-value
Superfamily Cupredoxins 1.89e-36
Family Multidomain cupredoxins 0.00024
Further Details:      
 
Domain Number 7 Region: 1713-1850
Classification Level Classification E-value
Superfamily Cupredoxins 2.01e-32
Family Multidomain cupredoxins 0.00000158
Further Details:      
 
Domain Number 8 Region: 207-329
Classification Level Classification E-value
Superfamily Cupredoxins 2.38e-31
Family Multidomain cupredoxins 0.0000014
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000010440   Gene: ENSGGOG00000010691   Transcript: ENSGGOT00000010747
Sequence length 2170
Comment pep:known_by_projection chromosome:gorGor3.1:1:148716295:148791478:-1 gene:ENSGGOG00000010691 transcript:ENSGGOT00000010747 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MFPGCPRLWVLVVLGTSWVGWGSQGTEAAQLRQFYVAAQGISWSYRPEPTNSSLNLSVTS
FKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHFKNKADKPLSIHPQGIRYS
KLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGPTHDDPPCLTHIYYSHENLIG
DFNSGLIGPLLICKKGTLTEDGTQKTFDKQIVLLFAVFDESKSWSQSSSLMYTVNGYVNG
TMPDVTVCAHDHISWHLLGMSSGPELFSIHFNGQVLEQNHHKVSAITLVSATSTTANMTV
GPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTRNLKKITREQRRHMKRWEYFIAAEEVI
WDYAPVIPANMDKKYRSQHLDNFSNQIGKHYKKVMYTQYEDESFTKHTVNPNMKEDGILG
PIIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDEVNSSFTSGRNNTMIRAVQPGET
YTYKWNILEFDEPTENDAQCLTRPYYSDVDIVRDIASGLIGLLLICKSRSLDRRGIQRAA
DIEQQAVFAVFDENKSWYLEDNINKFCENPDEVKRDDPKFYESNIMSTINGYVPESITTL
GFCFDDTVQWHFCSVGTQNEILTIHFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGT
WMLTSMNSSPRSKKLRLKFRDVKCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEES
DADYDYQNRLAAALGIRSFRNSSLNQEEEEFNLTALALENGTEFVSSNTDIIVGSNYSSP
SNISKFTVNSLAEPQKAPSHQQATTAGSPLRHLIGKNSVLNSSTAEHSSPYSEDPIEDPL
QPDVTGIRLLSLGAGEFKSQEHAKHKGPKVERDQAAKHRFSWMKLLAHKVGRHLSQDTGS
PSGMKPWEDLPSQDTGSPSRMRPWEDPPSDLLLLKQNNPSKILVGRWHLASEKGSYEIIQ
DTDEDTAVNNWLISPQNASRAWGESTPLANKPGKQSGHPKFPRVRHKSLQVRQDGGKSRL
KKSQFLIKTRKKKKEKHTHHAPLSPRTFHPLRSEAYNTFPERRLKHSLVLHKSNETSLPT
DLNQTLPCMDFGWIASLPDHNQNSSNDTGQTSCPPGLYQTVPPEEHYQTFPIQDPDQMHS
TSDPSHISSSPELSEMLEYDRSHKSFPTDISQMSPSSEHEVWQTITSPDLSQVTLSPELS
QTNPSPDLSHTTLSPELIQTNLSPALGQMPISPDLSHTTLSPDLSHTTLSPDLSHTTLSP
DLSHTTLSPDLSHTTLSLDFSQTNLSPELSQTNISPALGQMPLSPDPSHTTLSLDHSQTN
LSPELSQTNLSPDLSEMPLFADLSQIPLTPDLDQMTLSPDLGETDLSPNFGQMSLSPDLS
QVTLSPDISDTTFLPDLSQISPPPDLDQIFYPSESSQSLLLQEFNESFPYPDLGQMPSPS
SPTLNDTFLSKEFNPLVIVGLSKDGTDYIEIIPKEEVQSSEDDYAEIDYVPYDDPYKTDV
RTNINSSRDPDNIAAWYLRSNNGNRRNYYIAAEEISWDYSEFVQRETDIEDSDDIPEDTT
YKKVVFRKYVDSTFTKRDPRGEYEEHLGILGPIIRAEVDDVIQVRFKNLASRPYSLHAHG
LSYEKSSEGKTYEDDSPEWFKEDNAVQPNSTYTYVWHATERSGPESPGSACRAWAYYSAV
NPEKDIHSGLIGPLLICQKGILHKDSNMPVDMREFVLLFMTFDEKKSWYYEKKSRSSWRL
TSSEMKKSHEFHAINGMIYSLPGLRMYEQEWVRLHLLNIGGSQDIHVVHFHGQTLLENGN
KQHQLGVWPLLPGSFKTLEMKASKPGWWLLNTEVGENQRAGMQTPFLIMDRDCRMPMGLS
TGIISDLQMKASEFLGYWEPRLARLNNGGSYNAWSVEKLAAEFASKPWIQVDMQKEVIIT
GIQTQGAKHYLKSCYTTEFYVAYSSNQINWQIFKGNSTRNVMYFNGNSDASTIKENWFDP
PIVARYIRISPTRAYNRPTLRLELQGCEVNGCSTPLGMENGKIENKQITASSFKKSWWGD
YWEPFRARLNAQGRVNAWQAKANNNKQWLEIDLLKIKKITAIITQGCKSLSSEMYVKSYT
IHYSDQGVEWKPYRLKSSMVDKIFEGNTNTKGHVKNFFNPPIISRFIRVIPKTWNQSIAL
RLELFGCDIY
Download sequence
Identical sequences ENSGGOP00000010440 ENSGGOP00000010440

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]