SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for PTSG_06730T0 from Proterospongia sp. ATCC 50818

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  PTSG_06730T0
Domain Number 1 Region: 2234-2336
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000134
Family Cadherin 0.0031
Further Details:      
 
Domain Number 2 Region: 961-1063
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000228
Family Cadherin 0.0042
Further Details:      
 
Domain Number 3 Region: 4315-4416
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000714
Family Cadherin 0.0066
Further Details:      
 
Domain Number 4 Region: 4094-4227
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000742
Family Cadherin 0.0093
Further Details:      
 
Domain Number 5 Region: 4619-4708,4745-4776
Classification Level Classification E-value
Superfamily L domain-like 0.000000000742
Family L domain 0.037
Further Details:      
 
Domain Number 6 Region: 4424-4494,4553-4605
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000000000832
Family Growth factor receptor domain 0.01
Further Details:      
 
Domain Number 7 Region: 613-653
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000294
Family EGF-type module 0.008
Further Details:      
 
Domain Number 8 Region: 1188-1288
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000514
Family Cadherin 0.0029
Further Details:      
 
Domain Number 9 Region: 3316-3441
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000286
Family Cadherin 0.0033
Further Details:      
 
Domain Number 10 Region: 3779-3890
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000357
Family Cadherin 0.0054
Further Details:      
 
Domain Number 11 Region: 4763-4857
Classification Level Classification E-value
Superfamily L domain-like 0.0000000544
Family L domain 0.06
Further Details:      
 
Domain Number 12 Region: 2880-3004
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000014
Family Cadherin 0.0095
Further Details:      
 
Domain Number 13 Region: 1617-1715
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000257
Family Cadherin 0.0092
Further Details:      
 
Domain Number 14 Region: 3440-3540
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000271
Family Cadherin 0.0078
Further Details:      
 
Domain Number 15 Region: 4878-4926
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.000000275
Family TSP type-3 repeat 0.0013
Further Details:      
 
Domain Number 16 Region: 4918-4959
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000607
Family EGF-type module 0.017
Further Details:      
 
Domain Number 17 Region: 3553-3658
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000928
Family Cadherin 0.0041
Further Details:      
 
Domain Number 18 Region: 3896-4000
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000134
Family Cadherin 0.0059
Further Details:      
 
Domain Number 19 Region: 98-131,187-246
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00000204
Family Growth factor receptor domain 0.012
Further Details:      
 
Domain Number 20 Region: 2575-2677
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000243
Family Cadherin 0.0036
Further Details:      
 
Domain Number 21 Region: 3114-3217
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000286
Family Cadherin 0.0093
Further Details:      
 
Domain Number 22 Region: 2049-2163
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000286
Family Cadherin 0.01
Further Details:      
 
Domain Number 23 Region: 3230-3330
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000371
Family Cadherin 0.0072
Further Details:      
 
Domain Number 24 Region: 1073-1177
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000127
Family Cadherin 0.0055
Further Details:      
 
Domain Number 25 Region: 1938-2047
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000214
Family Cadherin 0.0077
Further Details:      
 
Domain Number 26 Region: 4204-4314
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000328
Family Cadherin 0.013
Further Details:      
 
Domain Number 27 Region: 3667-3775
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000628
Family Cadherin 0.006
Further Details:      
 
Domain Number 28 Region: 1737-1817
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000728
Family Cadherin 0.0075
Further Details:      
 
Domain Number 29 Region: 6245-6312
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.0000824
Family Type I dockerin domain 0.0041
Further Details:      
 
Weak hits

Sequence:  PTSG_06730T0
Domain Number - Region: 6079-6165
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.00012
Family Cellulose-binding domain family III 0.014
Further Details:      
 
Domain Number - Region: 419-472,540-585
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000173
Family Growth factor receptor domain 0.0067
Further Details:      
 
Domain Number - Region: 2459-2565
Classification Level Classification E-value
Superfamily Cadherin-like 0.000414
Family Cadherin 0.01
Further Details:      
 
Domain Number - Region: 653-766
Classification Level Classification E-value
Superfamily Cadherin-like 0.000614
Family Cadherin 0.013
Further Details:      
 
Domain Number - Region: 475-557
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00125
Family Complement control module/SCR domain 0.0052
Further Details:      
 
Domain Number - Region: 2341-2404
Classification Level Classification E-value
Superfamily Cadherin-like 0.00128
Family Cadherin 0.01
Further Details:      
 
Domain Number - Region: 873-957
Classification Level Classification E-value
Superfamily Cadherin-like 0.00657
Family Cadherin 0.0098
Further Details:      
 
Domain Number - Region: 1390-1468
Classification Level Classification E-value
Superfamily Cadherin-like 0.0141
Family Cadherin 0.013
Further Details:      
 
Domain Number - Region: 4012-4098
Classification Level Classification E-value
Superfamily Cadherin-like 0.0186
Family Cadherin 0.013
Further Details:      
 
Domain Number - Region: 765-840
Classification Level Classification E-value
Superfamily Cadherin-like 0.022
Family Cadherin 0.0087
Further Details:      
 
Domain Number - Region: 1289-1396
Classification Level Classification E-value
Superfamily Cadherin-like 0.0297
Family Cadherin 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) PTSG_06730T0
Sequence length 6376
Comment | PTSG_06730 | Salpingoeca rosetta notch-1 (6377 aa)
Sequence
MGSAQVRLSSPAIANYTLAQVTALSYGAQTVPDPQSVISLVTAPSSNPDTEDPYAAITLL
RGDVSANPSGAALSLSFADGVDITSAQLLRVYEAEDGCASAPCVHGTCIDTPAAYACNCT
GTGFTGPQCQHLITCNVTQDTTPQFSAQLFPEPPANSIAYDERAVFQCLSGFRSSASEDG
PALVFRTCTASGRLGAPSATCVDIDECASVALNECEQQCINTNGSYACACTAPDSMLSSD
NRTCTYAVQLFPDTIDTETALDVVDVLVVAQDRDLSTLPIDNITIGNLLITETAVIQSSI
RVRLQRADLRSTAAIPVSPSPILRQATLTALSSIPNTASVWQTTGVTSTTDERCVHALAS
EQVPGNSSVLAEVEYRVATSIFTIVHPTSATAPQATARRDSLTYDVTTALPAGTRIKAFG
CACPTGFEGDTCADNIDDCSPNPCGPGTCTDLVADFTCSCEGTGFQGDLCAQRVQCPAPA
TADAATITSRSPGVDVHFEDSVVYSCHSGNVVYNTTSGSTLTTLLPPDSTPAYMTSFTRT
CRADQTYTNTELACHDVNECDPDAQDQPRTYQQQALYTCDDGYTTTGDAQATATYTATCT
ATGNYTFQGTCVEVDECAAQPCMNGGVCTDLVNAFECACVGEWGGPTCSDDTNRPPERIS
LSTTRVPENTFPLPLVTVTVEDPDTHQSHTLQVLSSTPPGVIVVATTNGGSSSGGSGGAG
AASPTLEITRALDFEVDGSAIEAVVRATDNGTISQLSADFLVRITVDQDAGGTFTYALVS
PQGSTAAAAFQIMGDMLLVKSAVPLNHEANPVVTTSISSIDNNGAVVTSTFNISVVDINE
PPTDVVVHAINDTSQQPLQQFLVPEQPAHATTAAQPWLVGAVRVLDPDTADATADPQPFV
FDVSPAGVAYIENQQLFVDRSALDFESQSSFVINVTARDARDSSLAVSHQFTLLVQDLNE
SPTSVVPNGVPSISEHALPGTVAGSLTVQDPDSIANDTHTFLLTDASGKFALVPGSDRRT
VQLVLAAGATLNYEVRSVFVVAVTATDQGGLSATSQILVFVQDRNDAPTTPTITSNRLPE
NTTALPFVVGTLQATDEDASQMLQFAIDSIDPPHVAPVFALDGAALVYDNTDSPDVVVSH
ETTPTITFTITVQDSGGPGDNDPPLSTTTDIVVRVVDSNDAPQGLVLINATAAVPENAVA
GTFVGTLAVTDEDMDETITFEILLGGEFLAVATPQPTACSGGAGLCTADVVTTTPLDFET
TPTFTLLARATDSVGSSTTQFFTIAVQDVNDMPRVLGLTNTVVSEAPSSPLPLTIATVMV
DDEDGDVVSCSVLAPADAAFTIESAAPASGETQPQQQLVLVDAQPIDFEASHTFDVALAC
SDGNGGATDATFTLTVAEDVPVRSTVASIIVVDQDAGDTHNVTVASEPAGVFEATQDSVA
VIIRAEDSAHNVLQTTITVSVLDANDPPSAVTLQPSSLNEHTAAAAASSDAIDFEVSSVV
EVDVQAEDSEGARGPVTTLFIDVVDVPEAPRQLSLQGDVVEENAVATAVGVVSVLDPECK
HVALGPTTCVDDAATGGTRCTATLSTTSAAANAPNYEETPFVDVVVVATDLLYTPPYLLE
LSTSSVAENTPPGIPIATIVAFDEDRSGDEHTCQLLSGGDSALLSLLPPNTLTVAAGMTV
NFEERASLTFAVWCSDVNDATVFATANLTLDVVDVQEPPTAVVFAASGPVPEVRSQAIVL
GTLSTIDEDAQDSTIFVLSDNDGGRLTIAQTNEVVILADAPIDFETQPTRTFNVTAIDLS
GIPVSRQFTYEVADVNEPPTAPTITPTLGNGGDNSSDVTIIPVPETTAVGTVIATLTSTD
PEGTPVSFVVGGQDSASIVLATPATPVNFEGGSSSSSDARLFFWARASDGVNDSPATHAG
IAIGDVNEAPSSIVVSPTRVPENALPGAVTLSVTITDPDEEDSVVITGIATDPANIDITI
QTQPSCVSVPPREGNNGTLAAGYTECSGAVLSLNQPFNYEQLPSFVLTLHAADAAGAQTT
ATGSVEVANLNDPPTSVAPATVQLPENSAANTTVAVLTIADEDAGPAGTYTCALRHPSSA
SWPFVVSGGRRVLVDPTATASELKAALNYEDTPVLLVEVTCTDGSTATTPLAINHTLTVQ
LTNVPEAPTDIVIVPGSVPENSPAGTRSLPLVVARGAVLDHEATPQLSVTLGAFDPTFRY
VIRTIAIPIVDVNEPPTMPMYEGALTLPENAAPGTVIGEVSAMDQDDGQTITFSIGDGGD
AVGASSFTISGTSVVVGPAPTLDFETQPSVSFEVVATDSDAASPLSASTIVTISLTNAND
APSTIAFEGSFTLPENAPAGTPIGVVIATDEDASDTLNITVYESSSSSRISGVRAGVTAC
EPGEGGVGVVCRAQVLSTHTFNYEELLASSATMRPIVVEVTDTMGGLASASTSFTVLDVN
DEPTDLVAPALPLLLPENAVPLPYTVSTLTLEDEDQVQVSPVSCTVDGAYADSFVATATQ
GTQRIAVQATRTFDYEEEPDTFDIAVTCQDGAFSITRTFPVRVLDEPEAPTAITPMQISV
AENSPVGALVGEITAVDPDTFDRHSFAFVDTANLPLTVVDDNTADRSAHVIVGTPAPGQA
ALDFEDESVRETTLLVRVTDSTGLFADVNVTLIITDVDEPVPSFTVAPAAPIPERTTSPT
LAATLTAVDPEGDLVTFTVTDESQAVATGARACLAVIPPVMPAAATMPSDTITTISTTTT
TTTTPAPAFTGSGDGTVVSSGDGDAGADEIVDTTATPVSTTDATGTVPTLAADPSQTECR
TSMTVTSSDSVVLMIVPGADGLDFERWPVFNVSVHASGMMDTPSGITRDVPVYVLDEPEP
PRDVQLTATAVSESTAVGTIVARVQAEDDDAGSQLTFAVHDPTDTFVLAAPMNASDVQCE
TSAADGVGAVCSAPLALTRPLNYEQQPSHVITVVVSDGQHTTTANLTASVTDANDAPTAL
SLDPSSVREDVRWRRVPVARIVVEDEDLLHNPAEQHTCVVVEPASAFIIVANDTTGQIIN
EVFATSAAAIDFERASERTITCMDNGVPRASLMQDVVLTIEDVNEAPVQVLLQPPSVPEN
AAVGTVIARIALDDPDLYTFTSPDSAEPAELAMRRGFGNANVTLVTEPDLHPFTQRGQEL
VLTAPLDYERVDSYTISFLVQDGPHTATFSVDMQVLDRNDAPAPAALDNNVLLENATAPA
VVGLLLLFDQDEGQELRAAVAAGVGNSSLFTIDAQNQLVFVGAPSFVLDHEQAPVLTVVV
IVSDALNATSTTVLEVMVGDANDAPIVRAPQPSPVQVTESAPVGTSVATLTAVDVDVRDT
QLVFVLSDPTNTFALGNVTCMPASSASQLTGVTCTTDLRTRLPLDYESTPTYVAVIEAVD
AQGARGGRAITVDVTDANDAPVIAGLLPDASVPETLAVGDTAAVVATTDQDANHAVMCTL
LPPTDSDTPAPPFALVGGNRLQLTDPLNFEVQAEYTLRIECVDTHTPVAGRTTASLALYV
ADRNDAPTRVWLNTTATGPLPEDAAPGTVVGQLHVDDEDAADAGMHRLRVVGGGSSAPYS
SRLAISADGRQLVVAQPPSPLAANEFLFDYETAPVVLVEVEATDPSNASVVSQLTIAIAD
ANDAPSSPLLDGSSTVGSIPETAAPGFVVGSLHAMDADGDAVTFFIASDGGAAGADNGLF
DVVPAAGNSTALLVFAGPTSALDFEGGNTMLAVSVGASDGRGGEAFTDLTIAVTDANEAP
INLVVSSPLAVAENMAVGTAVGTLTATDPDSNMEALQFDFIPADVLSVVGVTCDHAQPTG
NSTAIATTCTGVVVTAQVFNYEQTPSVSLVARVTDAQGLAITRSVQLQVADANDRPAAVQ
LLQAEPLPENAQAGTRVGVLIVSDEDTTGQQHSCAVTDTSATGGALFSVTTDKNELVLRS
DAAGVLDFETSPTVEVEVTCTDTTPQRLNVTQGVTVRIGDVNEPPTGITFDGELREDAAA
GMLLGELVVADPDVNDTHTIAGVGMDARLQLRGNLLYVRDDAATLFDYETEPLLQLHLVA
IDSVGQFVQQLVNITVEDVNEPPVDVTLTPTQPTAVENTVPTTLIGRLSALDPEGSGPVT
FRSGVPAVLAVTSDGNVTLVAPLNFEDSSGGADGVQWFSVVAVDATGLETEARIPIRVTD
ANDAPVFTEPAQANAFITAPENQPVPVDVAVVDEDLSDDLTLFLVEDPEATAVQITPCAA
SSSSSMANGSEVVLRAGAEPLNFEEAQTASVTIRCVDSGVPQRETTTTFTLTVTNANDPP
TRIALSPTAVSEFASVGSTVAHVWAEDEDADAARFHRFVALSAGDPPFFVFGTSLVLSRR
LDYETQRSHNITIRVEDTAVSPPATLTQTLTVRVIDEVDCDATRCFNGGDCVDIPGEGFE
CMCPPGYDGQRCEINVDECAQQAEPCGPGVCVDGINEYTCNCTGTGFRGDVCDEPVMCTA
VPPPNAALVVNGSLVDGDDASQQLAFMEQVSVACLQGYTTNGLVDGPTTFTEVCQDNGMV
SSTASCIDADDCLAEPCLNGGLCEDRVAGFYCQCPPGFSGERCQFARDCFLDQAGATATV
TTYEDLALLAECRVFHSNVTISITAAPANATADDTANATTAHLASLSNLQRVAGVLALRN
NDHLVSLRGLANLTWVGGLVLENNAQLRTVSGLSALTTIGGELVVAGAGPSTLQGLEPST
CGSVIIRHSSLVSLEAVNLPMWLQGSFVLENNDYLLRVNGLAAVLQIAGDVIVRNNLRLM
TLDEPSNLARVDGDILVRDNTVLLDVDALASLQSYGGSWVIANNSNLCFIGPFLLENIAS
LEVNSDSEEFPDVMLARDDCPAERADTDGDGVFDDKDNCPRVFNPQQRDSDSNGIGDVCD
CNEGSDAFCANGGTCVQDTQARHYMCACPVEFEGPTCNISSVYPVPQMQVEHSALFQQQP
VRVSSVAFTQPYLLLPGTNPEVITDSATTDTVTTRLSAAWASLQAEEDVDVPRQPAHDAV
ATLLSASTVWFDAPHVRVHLQARDASFNMRTAATTVSITLSARVDTDGDGTKEDVSVQAW
CETSAASSGCIAWTSVPQSWFPLENSDGTPAAMRAVTVAYGIGASLSASAFMQELPGQLQ
LASRPAIAARVRNTVAVLVPFAPVYREDTTQLAVYAQADRGVSAVVLRMDMIDSDALRFA
RVVAAPGWSARTSTSGNTTLAISLQRQVQQQSGGGGSGSYANPEVLCTVEMEVMPQAVEG
RFSELQATAVGFVDLQGRNLQLEGFYPLPSPALHLHRNRGVLASKSIGYVPVAANTLVAL
LPHLAQAELINTARLDGNTVSSALALWSADARGTIAPVSPSTTALACESGKQSVVRVAAD
CTAVVLLGTEIEGQSRVSVTVRVSGTAVSASVDVRVWFPLASTTTASLSRPTTRPLAGLR
DHNCQPVYEDVFLDVSIALALGSPSSQTAPPTVRVDATRLLLSGITSSDPATASLDPSSP
GRGTAVRVRAHQPGIVLFLVQTASGSFIGAVALEVQSEAAQLQVFARPIVDLTLASSSSS
SSSPTAVAASVPEELEPTRRFATATYGNRVTRFPQNVNILAVGIASENANTTASSSMVDP
EPTDLSGVLGVVTELYDASQDVVSLVDGSTLLVEGLTSPSGPVANVSVLSLCDGAVAGTV
PLHVRTTPAVPTQVEVLLGAPRLAVSGNAAAASDVGIPTRTRVRVILHFADGTQQDATFA
PGVDVQVPAGSVLLTGNDVLTLATTSNAAAGSFEIAVTLTNFDNMEQRAPVTFVRAAALL
LDAELSPGGSAVPDASSSTSRVHIGPLQALAADRSLRVSVQLLLTDGTAVDVSASSFTTL
RAVVGGTTAPSNTVELVLEAGVFVVTQSSGLQGDVDVFASFADEAVTSSRLLLSVSSTAL
PVETIDVATVGNNVEPGLDHGGNGGNGGDEVVVTPLCRVTGTAVPLTFALGFADGTTAAG
AEDLAMDAAAFAFASSAPDVVEVTADGSAVLLQSSLTPVQLHVALSSATTINTTVHVLPN
ALPAVGDVDLGELCGPPLVKSGTTVAVPVWVNTGARAAALVDLHVTYNPSALRAVSATAG
ADFSGQLVQRLNDPPGVVAVGGISDPFSGPRAQVAVIAFEVLDDTLPLDLGGVVQTLATL
RGADIGGATPRAIVAGRLASGDAGDVSAPVTTTATPATPAAAAAAGRTRREATPCSNPPC
NCSEIVLGDTNADCVFSIKDVSYLQAILADLALDEEGTRAMLLDAQLREMDVDRNGVIDA
RDSSLLLRINFGLLPFVGKAATSFDENECGLSLSLPLPALPATSAANTYVFFAVDMQERA
SCFVHRRQPPRVVVGL
Download sequence
Identical sequences F2UEM4
PTSG_06730T0 XP_004992127.1.12839

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]