SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000011070 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000011070
Domain Number 1 Region: 1172-1502
Classification Level Classification E-value
Superfamily NHL repeat 3.4e-26
Family NHL repeat 0.0016
Further Details:      
 
Domain Number 2 Region: 1399-1645
Classification Level Classification E-value
Superfamily 3-carboxy-cis,cis-mucoante lactonizing enzyme 0.0000000000863
Family 3-carboxy-cis,cis-mucoante lactonizing enzyme 0.015
Further Details:      
 
Domain Number 3 Region: 867-942
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.000000667
Family Pre-dockerin domain 0.073
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000011070
Domain Number - Region: 715-740
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000481
Family EGF-type module 0.042
Further Details:      
 
Domain Number - Region: 685-709
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000534
Family Integrin beta EGF-like domains 0.096
Further Details:      
 
Domain Number - Region: 619-643
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000659
Family EGF-type module 0.04
Further Details:      
 
Domain Number - Region: 551-578
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00209
Family Integrin beta EGF-like domains 0.029
Further Details:      
 
Domain Number - Region: 742-781
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00209
Family EGF-type module 0.06
Further Details:      
 
Domain Number - Region: 586-610
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00461
Family Integrin beta EGF-like domains 0.049
Further Details:      
 
Domain Number - Region: 521-545
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00858
Family EGF-type module 0.031
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000011070   Gene: ENSGGOG00000011344   Transcript: ENSGGOT00000011397
Sequence length 2699
Comment pep:known_by_projection chromosome:gorGor3.1:4:193087793:193589794:1 gene:ENSGGOG00000011344 transcript:ENSGGOT00000011397 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDVKERRPYCSLTKSRREKERRYTNSSADNEECRVPTQKSYSSSETLKAFDHDSSRLLYG
NRVKDLVHREADEFTRQGQNFTLRQLGVCEPATRRGLAFCAEMGLPHRGYSISAGSDADT
ENEAVMSPEHAMRLWGRGVKSGRSSCLSSRSNSALTLTDTEHENKSDSENETLGDHTSPS
TALPLPPSHQQHSAQHHPSITSLNRNSLTNRRNQSPAPPAALPAELQTTPESVQLQDSWV
LGSNVPLESRHFLFKTGTGTTPLFSTATPGYTMASGSVYSPPTRPLPRNTLSRSAFKFKK
SSKYCSWKCTALCAVGVSVLLAILLSYFIAMHLFGLNWQLQQTENDTFENGKVNSDTMPT
NTVSLPSGDNGKLGGFTQENNTIDSGELDIGRRAIQEIPPGIFWRSQLFIDQPQFLKFNI
SLQKDALIGVYGRKGLPPSHTQYDFVELLDGSRLIAREQRSLLETERAGRRARSVSLHEA
GFIQYLDSGIWHLAFYNDGKNAEQVSFNTIVIESVVECPRNCHGNGECVSGTCHCFPGFL
GPDCSRAACPVLCSGNGQYSKGRCLCFSGWKGTECDVPTTQCIDPQCGGRGICIMGSCAC
NSGYKGENCEEADCLDPGCSNHGVCIHGECHCSPGWGGSNCEILKTMCPDQCSGHGTYLQ
ESGSCTCDPNWTGPDCSNEICSVDCGSHGVCMGGTCRCEEGWTGPACDQRACHPRCAEHG
TCKDGKCECSQGWNGEHCTIEGCPGLCNSNGRCTLDQNGWHCVCQPGWRGAGCDVAMETL
CTDSKDNEGDGLIDCMDPDCCLQSSCQNQPYCRGLPDPQDIISQNLQSPSQQAAKSFYDR
ISFLIGSDSTHVIPGESPFNKSLASVIRGQVLTADGTPLIGVNVSFFRYPEYGYTITRQD
GMFDLVANGGASLTLVFERSPFLTQYHTVWIPWNVFYVMDTLVMKKEENDIPSCDLSGFV
RPNPIIVSSPLSTFFRSSPEDSPIIPETQVLHEETTIPGTDLKLSYLSSRAAGYKSVLKI
TMTQSIIPFNLMKVHLMVAVVGRLFQKWFPASPNLAYTFIWDKTDAYNQKVYGLSEAVVS
VGYEYESCLDLTLWEKRTAILQGYELDASNMGGWTLDKHHVLDVQNGILYKGNGENQFIS
QQPPVVSSIMGNGRRRSISCPSCNGQADGNKLLAPVALACGIDGSLYVGDFNYVRRIFPS
GNVTSVLELSSNPAHRYYLATDPVTGDLYVSDTNTRRIYRPKSLTGAKDLTKNAEVVAGT
GEQCLPFDEARCGDGGKAVEATLMSPKGMAVDKNGLIYFVDGTMIRKVDQNGIISTLLGS
NDLTSARPLTCDTSMHISQVRLEWPTDLAINPMDNSIYVLDNNVVLQITENRQVRIAAGR
PMHCQVPGVEYPVGKHAVQTTLESATAIAVSYSGVLYITETDEKKINRIRQVTTDGEISL
VAGIPSECDCKNDANCDCYQSGDGYAKDAKLSAPSSLAASPDGTLYIADLGNIRIRAVSK
NKPLLNSMNFYEVASPTDQELYIFDINGTHQYTVSLVTGDYLYNFSYSNDNDITAVTDSN
GNTLRIRRDPNRMPVRVVSPDNQVIWLTIGTNGCLKSMTAQGLELVLFTYHGNSGLLATK
SDETGWTTFFDYDSEGRLTNVTFPTGVVTNLHGDMDKAITVDIESSSREEDVSITSNLSS
IDSFYTMVQDQLRNSYQIGYDGSLRIIYASGLDSHYQTEPHILAGTANPTVAKRNMTLPG
ENGQNLVEWRFRKEQAQGKVNVFGRKLRVNGRNLLSVDFDRTTKTEKIYDDHRKFLLRIA
YDTSGHPTLWLPSSKLMAVNVTYSSTGQIASIQRGTTSEKVDYDGQGRIVSRVFADGKTW
SYTYLEKSMVLLLHSQRQYIFEYDMWDRLSAITMPSVARHTMQTIRSIGYYRNIYNPPES
NASIITDYNEEGLLLQTAFLGTSRRVLFKYRRQTRLSEILYDSTRVSFTYDETAGVLKTV
NLQSDGFICTIRYRQIGPLIDRQIFRFSEDGMVNARFDYSYDNSFRVTSMQGVINETPLP
IDLYQFDDISGKVEQFGKFGVIYYDINQIISTAVMTYTKHFDAHGRIKEIQYEIFRSLMY
WITIQYDNMGRVTKREIKIGPFANTTKYAYEYDVDGQLQTVYLNEKIMWRYNYDLNGNLH
LLNPSNSARLTPLRYDLRDRITRLGDVQYRLDEDGFLRQRGTEIFEYSSKGLLTRVYSKG
SGWTVIYRYDGLGRRVSSKTSLGQHLQFFYADLTYPTRITHVYNHSSSEITSLYYDLQGH
LFAMEISSGDEFYIASDNTGTPLAVFSSNGLMLKQIQYTAYGEIYFDSNIDFQLVIGFHG
GLYDPLTKLIHFGERDYDILAGRWTTPDIEIWKRIGKDPAPFNLYMFRNNNPASKIHDVK
DYITDVNSWLVTFGFHLHNAIPGFPVPKFDLTEPSYELVKSQQWDDTPPIFGVQQQVARQ
AKAFLSLGKMAEVQVSRRRAGGAQSWLWFATVKSLIGKGVMLAVSQGRVQTNVLNIANED
CIKVAAVLNNAFYLENLHFTIEGKDTHYFIKTTTPESDLGTLRLTSGRKALENGINVTVS
QSTTVVNGRTRRFADVEMQFGALALHVRYGMTLDEEKARILEQARQRALARAWAREQQRV
RDGEEGARLWTEGEKRQLLSAGKVQGYDGYYVLSVEQYPELADSANNIQFLRQSEIGRR
Download sequence
Identical sequences ENSGGOP00000011070 ENSGGOP00000011070

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]