SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000005495 from Gorilla gorilla 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000005495
Domain Number 1 Region: 2224-2725
Classification Level Classification E-value
Superfamily alpha/beta-Hydrolases 2.32e-95
Family Acetylcholinesterase-like 0.0000323
Further Details:      
 
Domain Number 2 Region: 720-782,896-926
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 1.92e-19
Family Thyroglobulin type-1 domain 0.0064
Further Details:      
 
Domain Number 3 Region: 984-1078
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 2.49e-19
Family Thyroglobulin type-1 domain 0.0016
Further Details:      
 
Domain Number 4 Region: 295-360
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 2.88e-18
Family Thyroglobulin type-1 domain 0.0013
Further Details:      
 
Domain Number 5 Region: 92-165
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 1.18e-17
Family Thyroglobulin type-1 domain 0.002
Further Details:      
 
Domain Number 6 Region: 1142-1214
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 1.19e-16
Family Thyroglobulin type-1 domain 0.0014
Further Details:      
 
Domain Number 7 Region: 23-92
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 0.00000000000000102
Family Thyroglobulin type-1 domain 0.0029
Further Details:      
 
Domain Number 8 Region: 612-662
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 0.00000000000000615
Family Thyroglobulin type-1 domain 0.0011
Further Details:      
 
Domain Number 9 Region: 656-726
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 0.0000000000000392
Family Thyroglobulin type-1 domain 0.0016
Further Details:      
 
Domain Number 10 Region: 1089-1148
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 0.000000112
Family Thyroglobulin type-1 domain 0.0041
Further Details:      
 
Domain Number 11 Region: 163-195,230-248
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 0.000000536
Family Thyroglobulin type-1 domain 0.0073
Further Details:      
 
Domain Number 12 Region: 1513-1559
Classification Level Classification E-value
Superfamily Thyroglobulin type-1 domain 0.00000101
Family Thyroglobulin type-1 domain 0.0053
Further Details:      
 
Domain Number 13 Region: 1441-1527
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00000286
Family Growth factor receptor domain 0.0089
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000005495   Gene: ENSGGOG00000005600   Transcript: ENSGGOT00000005639
Sequence length 2771
Comment pep:novel chromosome:gorGor3.1:8:132748867:133017590:1 gene:ENSGGOG00000005600 transcript:ENSGGOT00000005639 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MALVLEIFTLLASICWVSANIFEYQVDAQPLRPCELQRETAFLKQADYVPQCAEDGSFQT
VQCQNDGHSCWCVGADGSEVLGSRQPGRPVACLSFCQLQKQQILLSGYINSTDTSYLPQC
QDSGDYTPVQCDVQQVQCWCVDAEGMEVYGTRQLGRPKRCPRSCEIRNRRLLHGVGDKSP
PQCSAEGEFMPVQCKFVNTTDMMIFDLVHSYNRFPDAFVTFSSFQRRFPEVSGYCHCADS
QGRELAETGLELLLDEIYDTIFAGLDLPSTFTETTLYRILQRRFLAVQSVISGRFRCPTK
CEVERFTATSFGHPYVPSCRRNGDYQAVQCQTEGPCWCVDAQGKEMHGTRQQGEPPSCAE
GQSCAAKRQQALSRLYFGTSGYFSQHDLFSSPEKRWASPRVARFATSCPPTIKELFVDSG
LLRPMVEGQSQQFSVSENLLKEAIRAIFPSRGLARLALQFTTNPKRLQQNLFGGKFLVNV
GQFNLSGALGTRGTFNFSQFFQQLGLASFLNGGRQEDLAKPFSVGLDSNSSTGTPEAAKK
DGTMNKPTVGSFGFEINLQENQNALKFLASLLELPEFLLFLQLAISVPEDVARDLGDVME
TVFSSQTCEQTPERLFVPSCTTEGSYEDVQCFAGECWCVNSWGKQLPGSRVRGGQPRCPT
DCEKQRARMQSLMGSQPAGSTLFVPACTSEGHFLPVQCFNSECYCVDAEGQAIPGTRSAI
GKPKKCPTPCQLQAEQAFLRTVQALLSNSSMLPTLSDTYIPQCSTDGQWRQVQCDGPPEQ
VFELYQRWEAQNKGQDLTPAKLLVKIMSYREAASGNFSLFIQSLYEAGQQDIFPVLSQYP
SLQDVPLAALEGKRPQPRENILLEPYLFWQILNGQLSQYPGSYSDFSTPLAHFDLRNCWC
VDEAGQELEGTRAEPSKLPTCPGSCEEAKLRVLQFIRETEEIVSASNSSRFPLGESFLVA
KGIRLRNEDLGLPPLFPPREAFAEQFLRGSDYAIRLAAQSTLSFYQRRRFPPDDSAGASA
LLRLGPYVPQCDVFGSWEPVQCHAGTGHCWCVDEKGGLIPASLTARSLQIPQCPTTCEKS
RTSGLLSSWKQARSQENPSPKDLFVPACLETGEYARLQASGAGTWCVDPASGEELRPGLN
SSAQCPSLCNVLKSGVLSRRVSPGYVPACRAEDGGFSPVQCDQAQGSCWCVMDSGEEVPG
TRVAGGQPACESPRCPLPFNASEVVGGTILCETTSGPTGSAIQQCQLLCRQGSWSVFPPG
PLICSLESGRWESQPPQPWACQRPQLWQTIQTQGHFQLQLPPGKMCSADYTGLLQTFQVF
ILDELTARGFCQIQVKTFGTLVSIPVCNNSSVQVGCLTRERLGVNVTWKSRLEDIPVASL
PDLHDIERALVGKDLLGRFTDLIQSGSFQLHLDSKTFPAETTIRFLQGDHFGTSPRTWFG
CSEGFYQVLTSEASQDGLGCVKCPEGSYSQDEECIPCPVGFYQEQAGSLACVPCPVGRMT
ISAGAFSQTHCVTDCQRNEAGLQCDQNGQYRASQKDRGSGKAFCVDGEGRRLPWWETEAP
LEDSQCLMMQKFEKVPESKVIFDANAPVAVRSKVPDSEFPVMKCLTAMSQWTSKNYFLIP
VAFPRLSAGQYKISQDNNAASFHLQKQDALGNSKATSFGSLRCQVKVRSRGQDSPAVYLK
KGQGSTTTLQKSFEPTGFQNMLSGLYNPIVFSASGANLTDAHLFCLLACDRDLCCDGFVL
TQVQGGAIICGLLSSPSVLLCNVKDWMDPSEAWANATCPGVTYDQESHQVILHLGGQEFI
KSLTPLEGTQDTFTNFQQVYLWKDSDMGSRPESVGCRKDTVPRPASPTEAGLTTELFSPV
DLNQVIVNGNQSLSSQKHWLFKHLFSAQQANLWCLSRCVQEHSFCQLAEITESASLYFTC
TLYPEAQVCDDIMESNAQGCRLVLPQMPKALFRKKVILEDKVKNFYTRLPFQKLMGISIR
NKVSMSEKSISNGFFECERRCDADPCCTGFGFLNVSQLKGPGGEVTCLTLNSLGIQMCSE
ENGGAWRILDCGSPDIEVHTYPFGWYQKPIAQNNAPSFCPLVVLPSLTEKVSLDSWQSLA
LSSVVVDPSIRHFDVAHVSTAATSNFSAVRDLCLSECSQHEACLITTLQTQPGAVRCMFY
ADTQSCTHSLQGQDCRLLLREEATHIYRKPGISLLSYEASVPSVPISTHGRLLGRSQAIQ
VGTSWKQVDQFLGVPYAAPPLAERRFQEPEPLNWTGSWDASKPRASCWQPGTRTSTSPGV
SEDCLYLNVFIPQNVAPNASVLVFFHNTMDGEESEGWPAIDGSFLAAVGNLIVVTASYRV
GVFGFLSSGSGEVSGNWGLLDQVAALTWVQTHIRGFGGDPRRVSLAADHGGADVASIHLL
TARATNSQLFRRAVLMGGSALSPAAVISHERAQQQAIALAKEVSCPTSSSQEVVSCLRQK
PASVLNDAQTKLLAVSGPFHYWGPVIDGQFLREPPARALKRSLRVEVDLLIGSSQEDGLI
NRAKAVKQFEESQGRTSSKTAFYQALQNSLGGEDSDARVEAAATWYYSLEHSTDDYASFS
RALENATRDYFIICPIIDMASAWAKRARGNVFMYHVPESYGHGSLELLADVQFAFGLPFY
PAYEGQFSLEEKSLSLKIMQYFSHFIRSGNPNYPYEFSRKVPTFATPWPDFVPRAGGENY
KEFSALLPNRQGLKKADCSFWSKYISSLKASADGAKGGQSAESEEEELTAGSGLREDLLS
LQEPGSKSYSK
Download sequence
Identical sequences ENSGGOP00000005495 ENSGGOP00000024036

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]