SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSTNIP00000003386 from Tetraodon nigroviridis 76_8

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSTNIP00000003386
Domain Number 1 Region: 1245-1317,1360-1524
Classification Level Classification E-value
Superfamily NHL repeat 0.00000000000759
Family NHL repeat 0.0098
Further Details:      
 
Domain Number 2 Region: 662-687
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000597
Family EGF-type module 0.052
Further Details:      
 
Weak hits

Sequence:  ENSTNIP00000003386
Domain Number - Region: 792-817
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00023
Family EGF-type module 0.049
Further Details:      
 
Domain Number - Region: 762-786
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00115
Family EGF-type module 0.049
Further Details:      
 
Domain Number - Region: 695-720
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00167
Family Integrin beta EGF-like domains 0.086
Further Details:      
 
Domain Number - Region: 628-655
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00471
Family Integrin beta EGF-like domains 0.057
Further Details:      
 
Domain Number - Region: 821-854
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0474
Family Integrin beta EGF-like domains 0.057
Further Details:      
 
Domain Number - Region: 729-755
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0963
Family Integrin beta EGF-like domains 0.034
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSTNIP00000003386   Gene: ENSTNIG00000019040   Transcript: ENSTNIT00000004189
Sequence length 2772
Comment pep:known_by_projection chromosome:TETRAODON8:16:5682465:5751760:-1 gene:ENSTNIG00000019040 transcript:ENSTNIT00000004189 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MEVKERRPYRSLTARQDTEHRYTSSSADSEDGKANPKSYSSSETLKAFDHDSRMAYGSRV
KEMVHHEVDEFSRQGVDFSLRNLGFGEALPSHVATYRSDMGLPHRDYSVSAGSDPDTETD
GIMSPEHAVRLWGRSNTKSGRSSCLSSRANSNLTLTDTEHENTENGRPGPPLHCSSASSS
PVEQLPYPPPSIAANESQGGLLGNSAAQPAQDSDSEDEFGPNSFLVKTGSGNLYTPATAA
AEEGAFQNHSRLRTPPLPLSHSHSPSHQHHAASINSLNRSNYTQRSNPSPAPTDSSVPPE
GPASQDSVSVQDNWLLNSNIPLETRQFLFKPGGTSPMYCTTSPGYPLTSSTVYSPPPRPL
PRNTFSRPAFSLKKPYKHCNWKCAALSAILISVMLLFLLAYFIAMHLFGLNWHLQPVQRQ
MYQLSEDNTSGLPFPTDLSVSPVGNTGLVIPERRDDSYIDMGEIDVGRKVTQQIPPGVFW
RSQVFIDHPMYVKFNVSLSKDALVGIYGRRGLPPSHTQQFDFVELLDGRRLLVQDIRGVE
GPAAMQRGLIPITTHDTGFIQYMDSGIWHLAIYNDGKETETVSFLTTATDSIDDCPSNCF
MNGDCIAGKCHCFLGFKGPDCGRAACPVLCSGNGQYLKGRCMCHSGWKGSECDVPTNQCI
DIACSNHGTCIVGTCICNPGYKGENCEEVDCMDPTCSGRGVCVQGECHCFVGWGGSGCES
PRASCMDQCSGHGAFLADTGTCSCDPNWTGHDCSTEICAADCGGHGVCVSGSCRCDDGWM
GSGCDQRACHPRCNEHGTCKDGKCECSPGWNGEHCTIEGCPGLCNGNGRCTLGNNGWYCV
CQLGWRGTGCDTSMETACTDVKDNDGDGLVDCMDPDCCLQATCHTTALCVGSPDPLDIIQ
ETQLSSTQSKLQTFYERVRFLVGRDSTHVIPGVNPFDGKVSSAVRQIVREDDSPSLKRFS
FLMFPFSLSLFIFNHFFDLVTNGGIAVALHFERAPFITQEHTLWLPWGRFFVMDTIVMRH
EENDIPSCDVSSFSRPSPLVSPASLTAFAGSCSERRSVVPEIQEEESANTQESDMKFSYL
SSRTAGYKSLLRVTLTHSTIPFNLMKVHLMVAVEGRLFRKWFPAAPNLSYDFVWDKADVY
SQKVYGLSEVFVSVGFEYESCPDFILWEKRAAVLQGYETTASKLGGWTVDKHHALNIQSG
ILHMGNGENVFISQQPPVIGSVMGNGRRRSISCPSCNGLADGNKLLAPVALACGSDGSLY
VGDFNYVRRIFTTGNVTSVLELRNKDFRPIGKKHSIYLNKYRKSVYCAETNTLELYTVLA
ELYSQADSGGELEEIVIGFAHPCLLLNTWWCQWRTPVDCLCDATAGITVDKYGVIFFVDG
TMIRRIDQNGIISTLLGFNDLTSARPLSCDAVMDISQVRLEWPTDLAVSPLDNSLYVLDN
NVVLQISENHQVRIVAGRPIHCQVPGIDHFIMSKVAIHATLESANALAVSHTGILYIAES
DEKKINRVRQVSTNGEIFLVAGAPSGCDCGQELAVMTYHGSSGLLATKTNENGWTTFYES
PSPVFELEIIVRVSLVRSVEQVINYYYELHLFVVPAKKGQQNEKHIITRKYQTFYMFKDS
RAKCLAYSGIPACKPGKCSSELPAPLVTPLSNENQLIGRNIGSQKYLRVFCAEEIIRFFF
SSGSFVMCVSFNISLMLTSCYKVYDSYGRLTNVTYPTGQVSSYRTDADSSVRIQTEGSNK
ENITVTTNLSASGTFYTLMQDQVRNSYFIGLDGSLRLVLANGMEVSLHTEPHLLAGTIIP
TVSKRNITLAIDNGLNLVEWRQRKEQARGQVTVYGRRLRVHNRNLLSLDFDRITRTEKVY
DDHRKFTLRIHYDHAGRPTLWAPSSRLNGVNVTYSPGGNVAGIQRGTMSVRMEYDQTGRI
TSQIFADGKSWTYTYLEKSMVLLLYSQRQYIFEFDKNDRLSSVTMPNVARHTLETSRSIG
YYRTTYQPPEGNASVLQDYSEEGQLLQTTYLGTGRRVTYKYGKIAKLLEMLYDTTRIGFY
YDELTGMLKTVNLQSEGFTCTVRYRQIGPLIDRQIFRFSEEGMVNARLDYVYDNSFRVTS
MQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRVKE
VQYEIFRSLMYWMMVQYDNMGRVVAKELKVGPYANTTRYTYEYDADGQLQVVSINDKPLW
RYSYDLNGNLHLLSPGNSARLTPLRYDNRDRITRLGDVQYRMDEDGFLKQRGNDYFEYNS
AGLLVRVYNKVSGWSIQYRYDGLGRRVSSRSSIGHHLQFFYADLSSPTRITHMYNHSSSE
IASLYYDLQGHLFAMELSSGDEFYVACDNIGTPLAVFSGAGIMIKQILHTAFGEVYLDTN
PSFQLIIGYQGGLYEPLSRLVHMGKRDYDVLAGRWTTPNLEIWKRLNSKHIAPFNMYMFK
NNNPLSNNEEIKCYMTDVNSWLVTFGFQLYNVIPGYHKPNTESMEPSYELVRTQIPENAE
SLLGVQCEVQRQLKAFVKLERFGQIYGAKSAGCPQTETKHIFATMGSIFGKGVKFAIREG
RVSTDIISLANEDGRRMAAVLNDAFYLEHLHFTVAGMDTHYFVKMGPVEGDLSLIGMTVG
QRTLETGVNVTVSQVNAVLNGRTRRITDIQLQYGTLYLNTRYGSSVDEEKARILEMARQR
AVTQAWARERQRLRDGEEGSRTWTEGEKQQLLGSGKVQGYDGYYVVSVDQYPELADSVNN
IHFMRQTEMGRR
Download sequence
Identical sequences H3C567
ENSTNIP00000003386

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]