SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000015984 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000015984
Domain Number 1 Region: 1291-1623
Classification Level Classification E-value
Superfamily NHL repeat 1.05e-24
Family NHL repeat 0.0021
Further Details:      
 
Domain Number 2 Region: 986-1063
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.000000961
Family Pre-dockerin domain 0.078
Further Details:      
 
Domain Number 3 Region: 1592-1794
Classification Level Classification E-value
Superfamily Calcium-dependent phosphotriesterase 0.0000347
Family SGL-like 0.0097
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000015984
Domain Number - Region: 704-729
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000385
Family Integrin beta EGF-like domains 0.075
Further Details:      
 
Domain Number - Region: 834-859
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00105
Family EGF-type module 0.047
Further Details:      
 
Domain Number - Region: 804-828
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00502
Family Integrin beta EGF-like domains 0.083
Further Details:      
 
Domain Number - Region: 737-762
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00795
Family Integrin beta EGF-like domains 0.086
Further Details:      
 
Domain Number - Region: 670-697
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0193
Family Integrin beta EGF-like domains 0.057
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000015984   Gene: ENSGACG00000012069   Transcript: ENSGACT00000016015
Sequence length 2824
Comment pep:novel group:BROADS1:groupI:14585614:14684838:1 gene:ENSGACG00000012069 transcript:ENSGACT00000016015 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MEVKERRPYRSLTARQDTERRYTSSSADSEDGKPNPKSYSSSETLKAFDHDSRMAYGSRV
KEMVHHEVDEFNRQGADFSLRNLGFGEALPSHVATYRTDLGMPHREYSVSVGSDADTETD
GIMSPEHAVRLWGRSNTKSGRSSCLSSRANSNLTLTDTEHENTENGPPLHCSSASTSPVE
QLPYPPHSITANESQGGLLGNCAAQPAQDSDSEDEFGPNSFLVKTGSGNLYAPATATADD
GAFQNHSRLRTPPLPLSHSHSPNHQHHAASINSLNRGNYTPRSNPSPAPTDGSVPPEGPP
SGQDPGSTQDNWLLNSNIPLETRNIAKQAFLETLQDNLIEMDILASARQDASYNDGHFLF
KPGGTSPMYCTTSPGYPLTSSTVYSPPPRPLPRNTFSRPAFSLKKPYKHCNWKCAALSAI
LLSVTLLFLLAYFIAMHLFGLNWHLQPMQRQMYQLSEDNTSGVPFSTDLSLPPLGNTGLE
IPDRRGKDEGKLDSLFPDDSYIDMGEIDVGRKVAQQIPPGIFWRAQVFIDHPMYLKFNVS
LSKDALVGIYGRRGLPPSHTQFDFVELLDGRRLLAQDIHSLEGPQTMQRSLAPIVTHDTG
FIQYLDSGIWHLAIYNDGRDAETVSYLTTAIDSVDDCPSNCLGKGDCVAGPCHSFLGFKG
SACGGAACPVLCSGNGQYLKGRCMCHSGWKGSECDVPTNQCIDVTCSSHGTCIVGTCICN
PGYKGENCEEVDCLDPTCSGRGVCVQGECHCFVGWGGPGCESPRASCMDQCSGHGAFLAD
TGTCSCDPNWTGQDCSTEICAADCGGHGICVSGTCRCDDGWMGTGCDQRACHPRCNEHGT
CRDGKCECSPGWNGEHCTIEGCPGLCNGNGRCTLGNNGWYCVCQLGWRGTGCDTSMETAC
SDVKDNDGDGLVDCMDPDCCLQATCHTTSLCVGSPDPLDIIQETQMSSAQSNLQTFYDRV
YFLVGRDSTHVIPGANPFDGNHACVIRGQVVTSDGTPLVGVNISFINNPSYGYTITRQDG
SFDLVTNGGIAIALHFERAPFITQEHTLWLPWGRFFVMDTIVMRHEENDIPSCDLSSFTR
PGPVVSPAPLTAFAGSCSERRTVVPEIQALQEEVPIPGTDMKLGYLSSRTPGYKSILRVT
LTHSTIPFNLMKVHLMVAVEGRLFRKWFPAVPNLSYDFVWDKADVYSQKVYGLSEAFVSV
GFEYESCPDFILWEKRTAVLQGHETTASKLGGWSIDKHHALNIQSGILHMGNGENVFISQ
QPPVIGSVMGNGRRRSISCPSCNGLADGNKLLAPVALACGSDGSLYVGDFNYVRRIFTTG
NVTSVLELSNSPTHKYYLATSPVNGAVYLSDTSSRKVFKVKSMNVVKDVAKNLELVAGTG
DQCLPYDDARCGDGGKGVEATLTNPRGITVDKYGVIFFVDGTMIRRIDQNGIISTLLGFN
DLTSARPLSCDAVMDISQVRLEWPTDLAVSPMDNSLYVLDNNVVLQISENHQVRIVAGRP
MHCQVPGIDHFLMSKVAIHATLESANALAVSHNGVLYIAESDEKKINRVRQVSTNGEISL
VAGAPSGCDCKNDANCDCYFGDEGYAKDAKLNAPSSLAVCPDGELYIADLGNIRIRFVRK
NKPYLNPLSMYEVSSPIHDELYLFDSNGSHIFTQSLTTGDHLYNLTYTGAGDLTSITDKN
KNVVNIRRDTTGMPLWLMVPDGQTFWFTIGTNNALKTVAAQGQELAVMTYHGSSGLLATK
TNENGWTTFFEYDSYGRLTNVTYPTGRVSSYRTDGDSSVRIQTEGSNKEDITITTNLSAS
GAFYTLMQEQVRNSYFLGLDGSLRLVLANGMEVSLHTEPHSLAGTVNPTVSKRNVTLAID
NGLNLVEWRQRKEQARGQVTVYGRRLRVHNRNLLSLDFDRITRTEKVYDDHRKFTLRIHY
DHAGRPTLWAPSSRLNGVNVTYSPGGHVAGIQRGTMSVRMEYDQNGRISSQMFADGKSWS
YTYLEKSMVLLLHTQRQYIFEFDKNDRLSSVTMPNVARQTLETSRSIGYYRNTYRPPEGN
ASVLQDYSEEGQLLQTTYLGEGRRVIYKYGKLSKLIEILYDTTRIGFSYDEVAGMLKTVN
LQSEGFTCTIRYRQIGPLIDRQIFRFGEEGLVNARFDYVYDNSFRVTSMQAVINETPLPI
DLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRVKEVQYEIFRSLMYW
MMVQYDNMGRVVAKELKVGPYANTTRYTYEYDADGQLQVVSINDKALWRYSYDLNGNLHL
LSPGNTARLTPLRYDIRDRITRLGDVQYRMDEDGFLKQRGNDYFDYNSAGLLVKVYNKVS
GWSIRYRYDGLGRRVSSRSSTGHHLQFFYADLSSPTRVTHMYNHSSSEITSLYYDLQGHL
FAMELSSGDEFYVACDNIGTPLAVFSGSGLMIKQILHTAFGEVYLDTNPSLQLIIGYQGG
LYEPLSRLLHMGRRDYDVLAGRWTTPNHDVWKRLNSNHIVPFNLYMFKNNNPLSNNEEIK
CYMTDVNSWLVTFGFQLYNVIPGYRKPSTQAMEPSYELVRTQIKTQEWDSTKSLLGVQCE
VQRQLKAFVKLERFGQIYRSKSAGCPQTEDKKVFASGGSIFGKGVKFAIREGRISTDIIS
LANEDGRRMAAVLNDAFYLEDLHFTIAGMDTHNFVKLGSVEGDLSLIGMTVGRRTLETGV
NVTVSQVNTVLNGRTRRITDIQLQYGALCLNTRYGSSVDEEKARVLELARQRSVAQAWAR
ERQRLRDGEEGSRTWTEGEKQQLLGSGKVQGYDGYYVVSVDQYPQLADSVNNIHFMRQSE
MGRR
Download sequence
Identical sequences G3PEG0
69293.ENSGACP00000015984 ENSGACP00000015984 ENSGACP00000015984

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]