SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for 99883.ENSTNIP00000011201 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  99883.ENSTNIP00000011201
Domain Number 1 Region: 2040-2233
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.87e-35
Family Laminin G-like module 0.0038
Further Details:      
 
Domain Number 2 Region: 1785-2013
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.34e-34
Family Laminin G-like module 0.0067
Further Details:      
 
Domain Number 3 Region: 1287-1422
Classification Level Classification E-value
Superfamily Cadherin-like 5.42e-28
Family Cadherin 0.0023
Further Details:      
 
Domain Number 4 Region: 746-860
Classification Level Classification E-value
Superfamily Cadherin-like 1.7e-25
Family Cadherin 0.00067
Further Details:      
 
Domain Number 5 Region: 1071-1183
Classification Level Classification E-value
Superfamily Cadherin-like 5.37e-25
Family Cadherin 0.00034
Further Details:      
 
Domain Number 6 Region: 211-315
Classification Level Classification E-value
Superfamily Cadherin-like 9.85e-25
Family Cadherin 0.0017
Further Details:      
 
Domain Number 7 Region: 965-1078
Classification Level Classification E-value
Superfamily Cadherin-like 1.83e-24
Family Cadherin 0.00093
Further Details:      
 
Domain Number 8 Region: 425-533
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-24
Family Cadherin 0.00082
Further Details:      
 
Domain Number 9 Region: 861-962
Classification Level Classification E-value
Superfamily Cadherin-like 2e-24
Family Cadherin 0.00074
Further Details:      
 
Domain Number 10 Region: 318-424
Classification Level Classification E-value
Superfamily Cadherin-like 5.71e-20
Family Cadherin 0.001
Further Details:      
 
Domain Number 11 Region: 529-641
Classification Level Classification E-value
Superfamily Cadherin-like 2.22e-19
Family Cadherin 0.0045
Further Details:      
 
Domain Number 12 Region: 1179-1304
Classification Level Classification E-value
Superfamily Cadherin-like 2e-16
Family Cadherin 0.0019
Further Details:      
 
Domain Number 13 Region: 10-103
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000000228
Family Cadherin 0.0034
Further Details:      
 
Domain Number 14 Region: 119-223
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000000249
Family Cadherin 0.0042
Further Details:      
 
Domain Number 15 Region: 1398-1509
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000671
Family Cadherin 0.0082
Further Details:      
 
Domain Number 16 Region: 638-746
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000428
Family Cadherin 0.0083
Further Details:      
 
Domain Number 17 Region: 1505-1605
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000929
Family Cadherin 0.022
Further Details:      
 
Domain Number 18 Region: 2278-2320
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000948
Family EGF-type module 0.014
Further Details:      
 
Weak hits

Sequence:  99883.ENSTNIP00000011201
Domain Number - Region: 2236-2268
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0188
Family EGF-type module 0.038
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) 99883.ENSTNIP00000011201
Sequence length 2421
Comment (Tetraodon nigroviridis)
Sequence
DWYLNKVRLKITDVNDNIPEWDMKPYPYLAVVSHEAPAGTFVYQLQAHDEDEGRSGEVEY
FLSDGGDGCFSVEKKTGQVVTTGLALQRDREYLLSVVALDGLGGRSTPAMLSVVAGARAP
QFTNGSYAISIPENTPEEQPFLVVYTISFQKQPISYSLLINPSSLFSIRPETGEISLTRT
LDYESDQRRYLLMVRASEEPGILSAAAEVQVLITDENDCVPEFLQSIYSVDGVPETVTTA
TSLLQVLATDCDSGSNAELTYYILSPDFSISPHGTIFPASRLDYERPNHLYEFVVMATDG
GIEPHSGTATVRVRMANINDEAPEFSQPVYRTFVSEDAGPNTLVATVLAKDPDGDGIIYS
ITSGNEEGNFVIDSQKGLIRLRSNPLPKLQGLEYVLNVTATDDNASGGPQPLFSTARVIV
GVDDVNNNKPVFKECQQYREQASVLENQPTGTFVLQVHAVDADEGANGKVKYGLMHRDSA
MPAFRIHPDTGAIVTAQRFDRERQREYSVTVTATDWAEEPLIGICQLTVQILDQNDNSPK
FENLRYEYFLREDTLVGTSFLRVAAHDDDFGTNAAITYSMSPEQPEYLQVNPVTGWVYVN
QPISQRTYITRDIVATDGGNRSSSVELAVTITNVKNQPPQWEEENYSVVIPENTARDTPI
VTIKATSQLGDPRVTYNLEDGMVPETNMPVRFYLSPNREDGSASILVSEPLDYETTSFFS
LRVRAQNVAAVPLAAFTTVYVNITDVNDNVPFFTSSIYEASVTEGAQIGTSVLQVSAHDK
DLGLNGQITYTLLSDSSGDHSLFRIDPELGIIYTEAVFDREARSSYLLEVQSEDGQESAR
PGKNKQPNSDTAYVRVFISDVNDNKPVFAQRLYEVGVDEDADVGLAVVTVSATDEDEAGA
NAKLRYQITSGNKGGVFDIEPEVGTIFVAQPLDYEQQKRYKLLVLASDGRWEDYAAIVVT
VVNKNDEAPVFSMNEYYGSVTEELDGSPVFVLQVTATDPDKDADQGAIRYSIHGQGAESH
FVINDITGEMYAQRTMDREERTVWRFVVMATDEEGEGLTGFTDVIINVWDINDNAPTFTC
APDNCHSSIAENSPPGTLVVEMTAVDLDDAAIGQNAILTYRITKNVRNTNKEDFFAIDSS
TGTISVAAEGLDREFADTHRVVVEASDGGGMTGTATVTIAVTDINDHAPKFLEDWCGAHV
SENTDKDASVLELRAVDPDIGTYGQLTFSVLAGDLDQRFYMMSNRERQMGILMLKEKLDF
EKPAEQGFNLTIKVEDSDFSSIIHCLIQIDDENDNAPEFASSSHLLPPLPEDVPVGTSII
QVVATDTDSGLNGEILYSILPQSDPHGHFAVSRAGLVAVARPLDRETVAGYELVVMATDR
GHPLTASVTVHLLLLDTNDNGPELETAYAPVLWENSPAPQVVWLNQSSTLLHVVDRDSPE
NGPPYYLSLPSLYSVDFHLQDHGNGSAMLTALRMFDRERQSEFHLPVIMVDSGDPPMTAT
NTLTITIGDQNDNAHQAGEKDVYVHSRKGRVGNAALGKVYAPDPDDWDNKTYTLETSAAK
YFSLNQSSGELTIRQNAPAGSYWLRVGVSDGVWPDVISGIRVHVRDLEEKAVLSSASLRL
TGLTARDFIDSRIEGKSRLETFWDFLSETLSVRTGSINIFSVADREQKTVDVHFYVLAEN
GYLHPEKLHAVLAAHKRKLQSLLRVNVSQVEVDECVHADCRASGGCFTQLSVSNSPSLLD
SGALSLVSVKVTPVAVCGCAAREMTYRICASYPGNPCLNGGTCMDTKNGYRCHCPPQFDG
PDCQQTRLSFLGNGYAWLPPIRPCFDSHLSLEFMTDEDDGLLLYAGPLATLLPGDTEDYM
AIELIGGTPSLKMNHGSGTLVLQLNHNIGVANGRWHRLDVRSNSKEVHFTLDRCSSAVIM
ETEGVDSWVMTEDRSSCEIRGVTPKRDKSSRYLNGSHVLQLGGVNENLSYEYPQLQYKHF
TGCIRNLLVDSKLYDLGSPAEASNTVPGCTLIDGHCTDGEPRPPCGLRGHCHSHWGSYSC
LCQPGFTGPQCDRAAPEFSFDGRSHIQFQLSGSLPARQTRVQVGVRTRAAAGVILSLLSQ
EQNEYLRLEVIQGLLAVFYNLGDGDYNLTLPHQHLSDGEWHELELDRYGREFTLRLDGGG
GRREVTASRGQGQEIVVDPTTVMLGNSFPSGHNHSFLGCMRDLRFNGHSIPLDPEQTSDR
LQVVAVQGVSPGCSSEACRQHQCSPPLVCVDLWRHHECRCPPGHITRETTQGKVCMYTLC
ASRPCRHGTCVAHSPSRYSCHCSEGYRGRHCEVTLAMFHNEDNNSLSLSSMFAISICVMA
FLVLMLGLFLYSCWRRHKGLKEGVYHVSAHHGEWEDIRENVLNYDEEGGGEQDQNAFNMV
ELQRSLQPSPAQSLRYSYPHS
Download sequence
Identical sequences H3CSG7
ENSTNIP00000011201 99883.ENSTNIP00000011201 ENSTNIP00000011201

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]