SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSPTRP00000039644 from Pan troglodytes 76_2.1.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSPTRP00000039644
Domain Number 1 Region: 1867-2025
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 4.25e-56
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0000000338
Further Details:      
 
Domain Number 2 Region: 1706-1866
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.62e-52
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.000000192
Further Details:      
 
Domain Number 3 Region: 1380-1562
Classification Level Classification E-value
Superfamily Cupredoxins 5.6e-50
Family Multidomain cupredoxins 0.0000264
Further Details:      
 
Domain Number 4 Region: 349-534
Classification Level Classification E-value
Superfamily Cupredoxins 5.96e-46
Family Multidomain cupredoxins 0.0000573
Further Details:      
 
Domain Number 5 Region: 31-201
Classification Level Classification E-value
Superfamily Cupredoxins 1.75e-45
Family Multidomain cupredoxins 0.000000125
Further Details:      
 
Domain Number 6 Region: 1569-1706
Classification Level Classification E-value
Superfamily Cupredoxins 1.84e-32
Family Multidomain cupredoxins 0.00000158
Further Details:      
 
Domain Number 7 Region: 207-329
Classification Level Classification E-value
Superfamily Cupredoxins 4.58e-32
Family Multidomain cupredoxins 0.00000138
Further Details:      
 
Domain Number 8 Region: 542-589
Classification Level Classification E-value
Superfamily Cupredoxins 0.00000000162
Family Multidomain cupredoxins 0.0013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSPTRP00000039644   Gene: ENSPTRG00000001659   Transcript: ENSPTRT00000046325
Sequence length 2026
Comment pep:known_by_projection chromosome:CHIMP2.1.4:1:147905991:147984568:-1 gene:ENSPTRG00000001659 transcript:ENSPTRT00000046325 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MFPGCPRLWVLVVLGTSWVGWGSQGTEAAQLRQFYVAAQGISWSYRPEPTNSSLNLSVTS
FKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHFKNKADKPLSIHPQGIRYS
KLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGPTHDDPPCLTHIYYSHENLIE
DFNSGLIGPLLICKKGTLTEGGTQKTFDKQIVLLFAVFDESKSWSQSSSLMYTVNGYVNG
TMPDITVCAHDHISWHLLGMSSGPELFSIHFNGQVLEQNHHKVSAITLVSATSTTANMTV
GPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTRNLKKITREQRRHMKRWEYFIAAEEVI
WDYAPVIPANMDKKYRSQHLDNFSNQIGKHYKKVMYTQYEDESFTKHTVNPNMKEDGILG
PIIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDEVNSSFTSGRNNTMIRAVQPGET
YTYKWNILEFDEPTENDAQCLTRPYYSDVDIMRDIASGLIGLLLICKSRSLDRRGIQRAA
DIEQQAVFAVFDENKSWYLEDNINKFCENPDEVKRDDPKFYESNIMSRTWMLTSMNSSPR
SKKLRLKFRDVKCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLA
AALGIRSFRNSSLNQEEEEFNLTALALENGTEFVSSNTDIIVGSNYSSPSNISKFTVNNL
AEPQKAPSHQQATTAGSPLRHLIGKNSVLNSSTAEHSSPYSEDPIEDPLQPDVTGIRLLS
LGAGEFKSQEHAKHKGPKVERDQAAKHRFSWMKLLAHKVGRHLSQDTGSPSRMKPWEDLP
SQDTGSPSRMRPWEDPPSDLLLLKQNNPSKILVGRWHLASEKGSYEIIQDTDEDTAVNNW
LISPQNASRAWGESTPLANKLGKQSGHPKFPRVRHKSLQVRQDGGKSRLKKSQFLIKTRK
KKEKHTHHAPLSPRTFHPLRSEAYNTFSERRLKHSLMLHKSNETSLPTDLNQTLPSMDFG
WIASLPDHNQNSSNDTGQTSCPPGLYQTVPPEEHYQTFPIQDPDQMHSTSDPSHISSSPE
LSEMLEYDRSHKSFPTDISQMSPSSEHEVWQAVTSPDLSQVTLSPDLSQTNPSPDLSHTT
LSPELSQTNISPALGQMPLSPDPSHTTLSLDHSQTNLSPELSQTNLSPDLSEMPLFADLS
QIPLTPDLDQMTLSPDLGETDLSPNFGRMSLSPDLSQVTLSPDISDTTLLPDLSQISPPP
DLDQIFYPSESSQSLLLQEFNESFPYPDLGQMPSPSSPTLNDTFLSKEFNPLVIVGLSKD
GTDYIEIIPKEEVQSSEDDYAEIDYVPYDDPYKTDVRTNINSSRDPDNIAAWYLRSNNGN
RRNYYIAAEEISWDYSEFVQRETDIEDSDDIPEDTTYKKVVFRKYLDSTFTKRDPRGEYE
EHLGILGPIIRAEVDDVIQVRFKNLASRPYSLHAHGLSYEKSSEGKTYEDDSPEWFKEDN
AVQPNSSYTYVWHATERSGPESPGSACRAWAYYSAVNPEKDIHSGLIGPLLICQKGILHK
DSNMPVDMREFVLLFMTFDEKKSWYYEKKSRSSWRLTSSEMKKSHEFHAINGMIYSLPGL
RMYEQEWVRLHLLNIGGSQDIHVVHFHGQTLLENGNKQHQLGVWPLLPGSFKTLEMKASK
PGWWLLNTEVGENQRAGMQTPFLIMDRDCRTPMGLSTGIISDSQIKASEFLGYWEPRLAR
LNNGGSYNAWSVEKLAAEFASKPWIQVDMQKEVIITGIQTQGAKHYLKSCYTTEFYVAYS
SNQINWQIFKGNSTRNVMYFNGNSDASAIKENQFDPPIVARYIRISPTRAYNRPTLRLEL
QGCEVNGCSTPLGMENGKIENKQITASSFKKSWWGDYWEPFRARLNAQGRVNAWQAKANN
NKQWLEIDLLKIKKITAIITQGCKSLSSEMYVKSYTIHYSDQGVEWKPYRLKSSMVDKIF
EGNTNTKGHVKNFFNPPIISRFIRVIPKTWNQSIALRLELFGCDIY
Download sequence
Identical sequences ENSPTRP00000039644 ENSPTRP00000039644

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]