SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|427732558|ref|YP_007078795.1| from Nostoc sp. PCC 7524

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|427732558|ref|YP_007078795.1|
Domain Number 1 Region: 3865-3952
Classification Level Classification E-value
Superfamily Cadherin-like 5.81e-16
Family Cadherin 0.02
Further Details:      
 
Domain Number 2 Region: 3310-3397
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000131
Family Cadherin 0.015
Further Details:      
 
Domain Number 3 Region: 3494-3581
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000157
Family Dystroglycan, N-terminal domain 0.078
Further Details:      
 
Domain Number 4 Region: 67-167
Classification Level Classification E-value
Superfamily beta-Roll 0.0000000000000379
Family Serralysin-like metalloprotease, C-terminal domain 0.00086
Further Details:      
 
Domain Number 5 Region: 3403-3493
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000235
Family Cadherin 0.03
Further Details:      
 
Domain Number 6 Region: 3773-3860
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000471
Family Cadherin 0.063
Further Details:      
 
Domain Number 7 Region: 3683-3771
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000643
Family Cadherin 0.047
Further Details:      
 
Domain Number 8 Region: 3587-3672
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000133
Family Cadherin 0.031
Further Details:      
 
Domain Number 9 Region: 902-989
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000235
Family Cadherin 0.029
Further Details:      
 
Domain Number 10 Region: 995-1077
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000297
Family Cadherin 0.016
Further Details:      
 
Domain Number 11 Region: 3957-4042
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000157
Family Cadherin 0.02
Further Details:      
 
Domain Number 12 Region: 3227-3303
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000366
Family Cadherin 0.02
Further Details:      
 
Domain Number 13 Region: 412-491
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000825
Family Cadherin 0.03
Further Details:      
 
Domain Number 14 Region: 4370-4594
Classification Level Classification E-value
Superfamily Tricorn protease N-terminal domain 0.00000575
Family Tricorn protease N-terminal domain 0.02
Further Details:      
 
Weak hits

Sequence:  gi|427732558|ref|YP_007078795.1|
Domain Number - Region: 1143-1195,2991-3049
Classification Level Classification E-value
Superfamily vWA-like 0.000258
Family Integrin A (or I) domain 0.025
Further Details:      
 
Domain Number - Region: 1388-1456,1490-1538,1605-1626,1680-1742,1793-1826
Classification Level Classification E-value
Superfamily Quinoprotein alcohol dehydrogenase-like 0.011
Family Quinoprotein alcohol dehydrogenase-like 0.031
Further Details:      
 
Domain Number - Region: 4606-4825
Classification Level Classification E-value
Superfamily Tricorn protease N-terminal domain 0.0314
Family Tricorn protease N-terminal domain 0.016
Further Details:      
 
Domain Number - Region: 4965-5157
Classification Level Classification E-value
Superfamily Tricorn protease N-terminal domain 0.0745
Family Tricorn protease N-terminal domain 0.029
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|427732558|ref|YP_007078795.1|
Sequence length 5642
Comment RHS repeat-associated core domain-containing protein [Nostoc sp. PCC 7524]
Sequence
MYTEINVFDLAPSPVLAPSPLASPEQLGRFDGMEELNIPVITPTPLSPDFSIAIKSQTLL
VEPSVNEIIGTGGRDILTGTALSDRIIGGFGADIITGGSSADIFVYQSIRDAGDIIKDFE
LRQDVIDLSVVLTSVGYQGFNPIADGYIQFGTYTGGTIILLDSDGKGSLAARPYIYVENV
TPNDLSSYPNHFIPNPGEPPQIEASLVNDTGVSPSDRLTFDPTINGKVTATSNLVSLKAG
LNNQPVTVDIFDTLNSDGGFSLTSAQLRQINGGSLNDGAYTLKLQAQDIKGNISNVYSFD
FVLDTTAPILDLQLDPNFDSAPVGDLQTTFEKVNLLGTTEANLTVSLQPTGVSSTTNDLG
EFTFANISLSQGSNLFTVVATDLAGNRGEFSQTFQRLTPQVNAAPTNLNLTPASSAENVP
DNSIIGTFTTTDPDAGDIHTYTLVTGEGDTDNTAFTIVDNELRINESPDFETKSVYSIRV
KTTDAGGLFFEKIFNIDITNVNEAPVIALPQAQFVAENTDLIITGISISDVDAGGGELQV
TVTANQGVLTLSQTTGLTFTAGDGNADSNVTFTGTLTAINQAIAGLTYRGNPNFSGNDTI
TFTVNDLGNSGSGGALTTNNNLQVIVNSSGIVLREDNFFLRNYEQIITIPDTTSVLSFTY
SDLNFDTTDPDSIKDAFEVALVDTQGNSLVHTIQANRDAFLNITESQGVALAAGATIEGQ
TVKVNLAGLAPGSAKVIFRLVNNDSDTNTTVRLSNIQISPAEGITPVIATPELSLAAINA
SINFQLLSDVTSSFSPEYQRTSFNEDTKLLYADIAVRNAGTYIVDAPLVVAIKSLSDPSV
QVVGADGLTPEGLPFFNLSNLVADGSLDPNEQTLARTISFFNPQQTQFDYELVFLGQLNI
APEFVSEPDVEALAGKAYVYEAKAEDANNDTLTYSLLVAPEGMVIDGATGKISWTPAANA
VGNYTVTVQVEDGRGGVDQQNFVLGVIAPPPNRSPLITSTPIVDAKVNHEYQYQAVATDP
DGDTLTYSLLSAPVGMTVDSNTGLIRWQVQNNQLGLQDVQLQVADGRGGTALQIYQILSL
AEVGNRNPIFVSTPVTNYNLPGISNSPSGNVNLNGIDLTLLLGETATQTVALTLPTGGQS
TGSADIVFVVDESGSMAGEHDWLAGMVQDLEAALQAKGISSNRYALAGFGGAGSREPGHL
FNIGGNFNLSLSRFSNQLFASSNFGTVVQPLTVQLAHDGSYIIVINSSATAGSVNYSFQV
KDTSSAPVAATGWGRIESGTIAAGEQVTLNLTAPAGLPVYFDSQDVDNDQIQVELRDSNN
TLIFTTNASSDRGIFTLPTSGNYTLTIRGTNASSTGDYSFQLLDLTANTTDLNTNQQINE
TIEAFATKIYRFNGTPGQKLYYDALENDQDRVNIRVIIPSGNNIFSSNADNDQAILTLTE
TGTYYLFIENNEASNRDYNFQLLDAATANTLVLDTTIIGSLEPGRQTQLYQLQVNGGQRL
YFDDLGSTTGASWQIYNANNQQITSGAINSDREFVIANTGTYILALQGNSNTPVNYNFRL
VDASTSTTTLSLGTVINGSIAKPGEQDEFTFTGTVGQRLYYDALGNLAGITAQLISPSGT
NVFSTSANSDTSLFTLTEAGTYRLVLDGNSASTGDYNFQLLDAATANSLVLDTAITGSLD
SGRNTQLYQLQVNGGQRLYFDDLGSTTGASWRIYGAANQQISTGPVNSDREFVIANAGTY
ILALQGNSNTPVNYNFSLVDASTSTTALSLGTVINGSIAKLGEQDEFTFTGTVGQRLYYD
ALGNLAGITAQLISPSGTSVFSTSANSDTNLFTLTEAGNYRLVLDGNSANTGDYSFQLLD
AATANSLVLDTSIIGSLDIGRNTQLYQFSVNGGQRLYFDGLGSTTGASWRIYGGANQQIT
SGSINSDNEFVIANTGTYILALQGNSNTPVNYNFSLVDAATSTTALSLGTVINGSIAKPG
EQDEFTFTGTVGQRLYYDALGNVGGITAQLISPSGANVFSGNANSDSSLITLTEAGTYRL
VLDGNSGNTGDYNFQLLDAATANSLVLDTNIIGGLDPGRQTQLYRFEGIFGQRLLFDSLA
SVSGSNWILYGLGNQAIANSSLSVDLQVLLPVSGTYLLAVQGNSTTPVNYSFQVKDTSSA
PVAATGWGRIESGTIAAGEQVTLNLTAPAGLPVYFDSQDVDNDQIQVELRDSNNTLIFTT
NASNDRGIFTLPTSGNYTLTIRGTNASSTGDYSFQLLDLTANTTDLNTNQQINETIEAFA
TKIYRFNGTPGQKLYYDALENDQDRVNIRVIIPSGNNIFSSNADNDQAILTLTETGTYYL
FIENNEASNRDYNFQLLDAATANTLVLDTTIIGSLEPGRQTQLYQLQVNGGQRLYFDDLG
STTGASWQIYNANNQQITSGAINSDREFVIANTGTYILALQGNSNTPVNYNFRLVDASTS
TTTLSLGTVINGSIAKPGEQDEFTFTGTVGQRLYYDALGNLAGITAQLISPSGTNVFSTS
ANSDTSLFTLTEAGTYRLVLDGNSASTGDYNFQLLDAATANSLVLDTAITGSLDSGRNTQ
LYQLQVNGGQRLYFDDLGSTTGASWRIYGAANQQISTGPVNSDREFVIANAGTYILALQG
NSNTPVNYNFSLVDASTSTTALSLGTVINGSIAKLGEQDEFTFTGTVGQRLYYDALGNLA
GITAQLISPSGTSVFSTSANSDTNLFTLTEAGNYRLVLDGNSANTGDYSFQLLDAATANS
LVLDTSIIGSLDIGRNTQLYQFSVNGGQRLYFDGLGSTTGASWRIYGGANQQITSGSINS
DNEFVIANTGTYILALQGNSNTPVNYNFSLVDAATSTTALSLGTVINGSIAKPGEQDEFT
FTGTVGQQIYYDSLGAISGNLNTKLISPSGNTIFDIRTQDDRGPFYLTETGTYRLVVDGI
GAATADYKFQVLDVGAAPTLSTDIITTGSLDSPQAVSLYRLPGIAGQRLSFAPTSDFFFA
DAATFAESTTILQTSGGTEDGYDGIDAALNGLSFRPGAAVNFILVTDEDRDNTDPSLTFS
SILNALSNQQALLNAVINGNFRNTNNQTVLGVDSASQAYLANGTGGYTITAIGSVVGDGN
TKPDYIDLALATGGAAWDLNQLRSGGLTATSFTQAFVDIKAREILEQLPITVIASDPTVS
FENLTGAISGIGAGQTATFNTKLTGDGIARSFDLLFVRPESGTILGSIPVTINNNYFYLA
QAVDPDNDTLTYSLRQGPTGATIDANTGRIHWQPAQGGDYQFSIEVDDGRGGRSTQDYIV
TVKTGQPNTAPTITSTAVTTTAIGRPYTYAVQATDPDDDTLAYYLSEAPEGLTIDRTTGV
VTWTPTQAQLGNQSVKLRVLDGRGGEANQSFTLAVTPDVNNQAPVIQTTPVTQVIAGEVY
RYNVNATDGNGDPITFDLPLKPEGMTIDATTGSILWQPTAAQVGNHTIVLRARDGYGGLD
LQAFDITVVSLNNAPTITSSPVLEAVAGLPYQYQIRAQDADGDAIAFRLDTAITGLNIDS
NSGVITWTPLNSQIGQQTVQITASDGKGGEATQTFDIQVVASATNAAPEITSTPRTTIPL
GSNYLYSVAVSDPNGDPLTFSLSNAPAGMTIDQQGLISWQPQPNQLGVNPVKMTVSDGRG
GVATQEFAIAVVGQFTPQTNQSPQIISTPTLTATANQSYEYNLSGSDPDGDLLVWDLATK
PEGMSIDATTGRIRWQPQLSQIGQHEVVVQLVDSFGGLATQTFTLAVRGINTPPVITSTP
ITTAAVNQLYTYTLQAQDATGDPLNFELLAAPQGMVIDRQRGLLQWIPTPGQIGQQTVTL
AVSDSQGAVTTQQYQIVVAAATINNPPKITSTPTFAAPSGQVYRYAVTASDPDGDRLTYQ
LLKAPTGMTIDAETGLLTWTPTNAQVGINAIQIAAVDPTGAAALQSYSLAVQINNSPVIT
SRPVESVTAGATYRYDLKANDPDRHPLTFTLTESPEGMTIDNFGRITWSAPPNAQGTYRV
QVNVSDNFGASTTQAYDLAVVRDEQAPLVNLTLSSTPVRVGQPLTVLVAASDNVGVTGLN
LTVNGTAIALDAQGRGTITLNQVGEFAAIAQASDASGNLGSANVSFLVIDPNDREAPQLS
LTGITNGQEITVPTAIKGAIADNNLLSYTLAIAPVSGGTFREIARGTQPAADGTLGILDP
TILANDSYILRLSATDAGGNRSTLDTTVDVAGQLKLGNFQLSFTDLEIPVSGIPITLTRT
YDTLTTNTKNDFGYGWRMTFRDADVRTNLRPDEIYREIGYRTVGFEIGTQVYITLPNGQR
EKFLFRPTPFLLPGLYRPAFVATDPGVTSTLTVDSTILVGASGQFYGFYGGAYHPANFGG
YYRLTTKEGIVYEINAETGKIDTITNRNGDQLTFTQAGITSSSGQKVTFGRDAQGRITTV
TDPTGNQIRYEYDRIGNLISVTDGTGDKTRFEYNNQQVHYLEKVIDPLNRPIARTEYNAD
GRLAQILNFNGESFRFTYDPENLVQTIRDALGNPTILEYDSRGNVITQINALGGVTRKSF
DANDNILSETNPEGETTTYTYDRNGNKLTETDALGNTIRYTYNNNSRPLTVVDAVGNSTT
YTYNAQGNLTTTQDGNGAITRYSYDARGRIISSTDNAGNIIEYGYDNFGRLTSQINALGH
TTTYTYDANGNLLTETKTQTTPTGTRNLVTTNTYDAAGRLIAETNPENQVTRYEYDSFGN
FVAMIDPLGRRTEYRYDAQGRLIETIYPDSTPADLSDNPRMLSAYDALGREVARTDRAGR
TTYYVYDALGRLIETIYPDGTPDDLSDNPRTKSEYDKAGREIARINELGDRTEFTYDDAG
RRVQIKDALGNLQRYVYDAAGRKTAEIDALGRTTRYVYDALGRHIQTIYANATQSTFTYD
AVGNLIAIKDPAGNTTRYEYDALKRTTAVINALDQRTSYTYDEIGNLTQIKDAKNQLTRL
EYDGIGQKIATVLPLGQRETFVYDAVGNLQRQTTFNGETITYTYDTNNWLIRKNLGETTT
VNYTYTPTGEVSSITDGRGTTTYSYDPRNRLVQLTETDGRTLIYTYDSVGNRTSIQTPSG
KTTYTYDALHRLQTVTDAGGNFTQYTYDAVGNLTRTILPNNLVETRQYDQLNRLINLTQT
RPDNTVIASYNYTLDAVGNRVSVTEGNGRRVNYSYDALYRLTREAITDTIPGDGVPPTVG
DRTISYTYDAVGNRLSRNDSLEGNTTYTYDGNNRLLQAALGNQTTLYTYDNNGNLLRQTQ
GTNQTRYTWSPENRLLSAEIINANGTSQVQYKYNDQGIRVATIVNGQETRYLIDVTQPYA
QVIEEYTPDGVVAKSYVYGRDLISQLQNGQQFVHLGDGLGSTRMLTDASGNVTDRYIYDA
YGQIISQIGSTDNTYLFAGEQRDSNLDLDYLRARYYDFRSGRFISADPFEGFLNDPMSLH
KYQYAHANPVNFTDPSGLVTLQDGILVHAILGRHFVLGDPANRVTDISIAKISQETGSYI
RGATGNRWRPDLTDFGVRQIYEIKPDGRFSEGVAQLNRYLTLGAGLIGQGWTVGTAANYM
PLPMFVIPPIKIVRVDPPVQGVIIYHLTDYTGVIIAATVLVLAIRFVPLPSFSFGFGLAL
AF
Download sequence
Identical sequences K9R0F2
gi|427732558|ref|YP_007078795.1| WP_015141599.1.5551

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]