SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|428218848|ref|YP_007103313.1| from Pseudanabaena sp. PCC 7367

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|428218848|ref|YP_007103313.1|
Domain Number 1 Region: 1149-1236
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000023
Family Cadherin 0.01
Further Details:      
 
Domain Number 2 Region: 2385-2469
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000117
Family Cadherin 0.0086
Further Details:      
 
Domain Number 3 Region: 1239-1326
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000314
Family Cadherin 0.01
Further Details:      
 
Domain Number 4 Region: 2292-2379
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000385
Family Cadherin 0.031
Further Details:      
 
Domain Number 5 Region: 2199-2290
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000628
Family Cadherin 0.049
Further Details:      
 
Domain Number 6 Region: 1801-1872
Classification Level Classification E-value
Superfamily Hypothetical protein PA1324 0.0000000000536
Family Hypothetical protein PA1324 0.01
Further Details:      
 
Domain Number 7 Region: 1910-1998
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000471
Family Cadherin 0.021
Further Details:      
 
Domain Number 8 Region: 1333-1419
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000518
Family Cadherin 0.033
Further Details:      
 
Domain Number 9 Region: 2117-2191
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000832
Family Cadherin 0.024
Further Details:      
 
Domain Number 10 Region: 2475-2562
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000275
Family Cadherin 0.038
Further Details:      
 
Domain Number 11 Region: 2906-3192
Classification Level Classification E-value
Superfamily Tricorn protease N-terminal domain 0.0000418
Family Tricorn protease N-terminal domain 0.024
Further Details:      
 
Weak hits

Sequence:  gi|428218848|ref|YP_007103313.1|
Domain Number - Region: 809-865
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.00314
Family Pre-dockerin domain 0.047
Further Details:      
 
Domain Number - Region: 2011-2098
Classification Level Classification E-value
Superfamily Cadherin-like 0.00518
Family Cadherin 0.084
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|428218848|ref|YP_007103313.1|
Sequence length 4259
Comment RHS repeat-associated core domain-containing protein [Pseudanabaena sp. PCC 7367]
Sequence
MQISNEALLANTPSLLDAATLGNPLLDASLDAPLKYPSRYKSGSQRNPLQPNSPYLDLGN
GSNINGLAGITDDLLVPAQSRLTNPTSGNGFGDRIPTFNLDDEQIAPISAGSDLSLQVQE
SLSSSPESSELSSPPLSSSPLAASAVTSIPEFAIKAEGRITINNGGDFDGAPLDTSDDAL
VYGKEGFTINKSATLPVERDSQGNPVLDSEGKPILVPFAVAVSEGYNVANAPNNPYSNLL
PPQIVDPQTVDVPGFNTVKDQELADRIPAGITPIVFDRANQFNNTNNWNQNFPAGGTADN
PTVVEITNGGLNIPNGVTIENTVIIVKRGNINFNGNNHALNNVAIVANNGGVNLANTQVN
NSAILASRSINLNNGARFSGESLIAAGSQNIIFNGATTTTDEADFLTVIAQGDIIFNGAS
ATRGEFLTAKNFISNNGSELIGSIGAKGNAIFNNNATVTAIVSDQDSPIITGQLANDTGV
SDGITSNPTISGTVTDDSTITELTAGFGSIPVADYVDILSELQSDGSFTLSQTMLEQING
APLANGDYVLNLQAVDQFGNTSNFTIDFSLDTTAPNLDIGLDPASDTDPVGDGQTTAEIV
TLTGQSDPGAQIELIQTGQTTTADGTGAYAFTNVALALGDNSFDVKATDIAGNETTISPT
FTRLPVDADPPVITGQLANDTAIGGTNGDGITSDPTITGTVTDASNIDEFIAGFGDIPVG
EYTGLIPELLPDGSFNLDQADLETINGAPLVDGDYTLNLQATDEFGNASGNVDIAFTLDT
TPPAAPTLELDPVFDTEPLGDGRTIAEIVTLTGQGDPNTQVRLVQTNQVTATDSTGQYTF
TDVPLILGENTFDVIATDIAGNEVTTTQTFTRLEEGTLILQEGQSFQVDLTESLDILEQP
AILTFTYDDDFDLTDTAGINDALEVAIVDANGDPLVQPFATGRDAFFNLTEGEGAAFSID
ASLVDKTVTLNLPAFTLGQTTNLIFELINNDSDTQTTFTISDIAILPGGTGTSTPVTPTS
ETPVSADIDFSSITEVTPSLEAKYGRTSYVNKDKVLSTEVSLENIGQYDINGPLLVAIDN
LSDPTVQVQNFDGVTPDGIPYFDFTNLINEQGLAIGSTSDQRTLSFFNPNESQFTYNLRV
LASTNRSPDITSEPGVEAFVDRPYTYQVQATDPDNDQLTYELLAAPDGMAIDASTGAISW
TPIIDNVGNYTVTVQVSDGNGGITEQSYNLGVLDGNRNRPPLFTSTPIVQAAINAEYTYQ
ATASDPDQDGFIFSVVDAPDGLAIDADTGLVTWTPTGEQAGTFDVTLQVTDVRGGEALQT
YKIFTAAEEGNNAPIIISEPQTDGFTATGYVQRIEAIDPDDDALSFTLTEAPTGMAIDPE
TGWISWTTKPENAGAHDISVMVDDGRGGFDNLSFTLDLSSEQPGQIWGRVFYDSFEPPED
LLNQPQVFKPRTNNRLTPPELNSGNVEIVDISYQDPTFGIDSFAYDDTTGQLLATLVNPP
GLRVGLGDIVALEQDGSLTQIVSATPEPGIILGRDGFAVVPEDFIGDFNPGDKFVTRDFL
VQTGLQKITTGESGFEFEPDFADIGGPPFSFTDDPNTTSQEIQRILFDETGLFGGDLITS
SWYSKAGTAGFNLAINRVNSEGEVSNIANVEISTSVVSDIVPNDLSYGPLAGKILATYGT
TLYTIDPSGVIEEVPYIVTSLIPGDLEFKQAGFRLVEPNQNLVANHFAVPSIGLNPGVSA
IGADYFQPYIGDLMALRDGGGGSRSRLMYWDGENVRIVQVPAKFPPLDGGLANGTFSEDR
TFAPVAIGDIPAVEPLSLANEVVFIDENDNNLRDSGEIWTTTDDNGIYNFNLAPGDYKVV
QEVQEGFEVTSPESDNYNLTLASGEVLTGNNFGNVRNFIVPPAPDENEAPEFITNGPNLA
QVDERILYRPQAIDLDGDLITFDLVSGPEGLTFDIDRNIMAWRATADQVGTETVIIRATD
ARGAVTLQEFEIEVVPQNLPPRFYTVPPATSQAAIGQNVQFLVEARTPESDPLTFILEPD
GTSATVVDEDPRLPRFNTFEDVLFKWQPNTTGIFNFSITADDGEGGTAATSFQIEVVDNL
PNDAPTLNIEGLPRITTQAIPYVGTVIASDPDNDPLEYRLTEAPTGMTIEPNGNIFWAPQ
PDQVGTHIVTVEVDDGRGGIVSDTYELTVASTGTIPNTPPFFVSDPRVNATANLDYEYQI
EVNDTDIVNDNLLLTLDTAPDGMFLDPATSRLLWTPGLNDIGTNNVVIRVTDSQGAFELQ
EFTVTVNAVNIPPIITSAPITEAAQDKAYSYLVQAEDSDNGQLSYELINAPTGMIIDPGT
GLIEWTPTATDLGNATIEVRVDDGQGGIATQTYDLLISDTAANLAPAFTSQPIYAAAIDE
PYTYQATASDPEDGTLTFSLGNSPTDMTIDANTGLIEWTPTTDQTGDFSIEVIVTDPDGA
TATQSFVVSTIVNSAPVILSTPATGIQSGDTYLYNLRVEEPDGDPLTYTLTEAPEGMTID
NLGRLRWDVPLGLNVLQPVTITVADNRGASVQQSFDIAVSGDDQAPVINLSAPFNVNINE
PANIVVTATDNIRVTELLLIIDGESITLDANGVAEYTPTQSGVFTAQAIATDAAGNMATV
EQEFGVIDPTMNNAPDISLDVIEGTEFTAPEDIIGSVLDDDLTSYSLSVAPLDSDNFTEI
ASGSDTVNGAALGEFDPTVLQNDTYTLRLEATDVAGNIAIVDRQVNVVGELKLGNFRLSF
TDLQIPVTGIPITVTRTYDSLTSKNSDDFGFGWRMEFRDTDLRTSLGRDEVFEQLGIRSE
AYQEGTKVFITLPGGDREVYTFKPERDPLSNFFPPVVDGEDTTIYRPAFESEAGSYNTLT
VKDTRLTRSNGKFIGLGGQLYNPADGFFGGTFVLTTKEGIVYEIDGNTGDLLTVTDRNGN
QLTFTDAGVESSTGKQVVFERNAQGRITALIDPAGNRITYDYDANGDLIAVTDRENNQTE
FQYNEPTRDHFLTDIIDPLDRPASRTEYDDKGRLKQILDVNGESVEMTYDPDNSLQIVKD
KRGFDTVYEYDNRGNVVKETDPVGKVTERTYDADNNILTEKVITDESGSDGFVSKFTYDS
RGNVLTETDPLDRTNRYTYNSFNQLSTIINPLGHTTSYEYDSRGNLTAETDATGYTRSYI
YDPVGRVKLIVEDAGRDITEFNYDSFGNLSSLIDALGHETTLTYDENGNPLTETRTQTTP
EGERTLTTTVTYDNDNRVTSVLDAEDNLTRFEYDGNGNRTVVIDPLGRRTEQVYDDLNRP
SETIFPDGTPGDLLDNLRLRNEYDVVGNRTAIVDPSGKTTNYVYNPINLVTETISPDDTP
GDLSDNQRILSDYNQRGLATGLTDEDGDPVELIYDAAGQLVGSSNTLNDSVTTVYDAAGR
SIATTDPLGRTTLFDYDELDRVTRVETPDGESIAVVYDQFGNVTSLADQAGRSSQYEYDV
LDRLVAVTDENGERTEYSYDELGNLIQQKDANDRLTKYEYDRLSRRIAVERPLGQREEMS
YDEVGRVDRITNFNGDVIEFEYDELDRLRAKNYVNESRLFEYTYYDSGQLATYADDRGIT
TYNYDDRNRLASRLEPDGTEISYTYVSGGAIESITTPTGTTSYTYDEVNRIDTISKDGEV
TDYEFDDAGNLVQIILANGVTETMSYDDLNRLVGVTNTDANGNILSSYIYTLDELGNRTR
VEESSGRIIEFTYDDLYRLTQETITDPVNGDRTITYTYDEVGNRLSRDDSIEGLTTYTYD
DNDRLLTSFLNGVETTYAYDDNGNLLSANNPDRQVVYDWDAMNQLVGADITDVNGIKEID
YKYDASGIRVASIVDGEETRFLIDTTRPYPEVLEEYAPNGDPIASYVHGLDLISQERNGE
SLFYLSDAHSGVRQLSDDLGGVVSTYDYDAYGNLLNSTGTATNNYLYRGEQFDPNLDLQY
LRARYYDPNLGRFPSVDPFEGDLENPMSKHRYQYGFNNPISYIDPTGAFNINELVASQTI
QSILEGSDLNVAAETAIWAGLALSGLTLSLVQKDASYYERLKNNNLVYWEGEVFATGLNT
LAPIISSVRSIFNFKEFIEKLANPVSATSSQFSASFINAKSDPFFLPVSSKFRGIPVNAV
SRNITVTSNLVGSLPSSNALSVLSVGGFTGTSPVDVTNLPNFVSLPNSDGGVARSFAGGF
LGYTGLTGTYLTGTFSAGGFQSGYVEADSSGTGFIVGLGIGLSFDFSAGFTFNVAGSVS
Download sequence
Identical sequences K9SL80
WP_015165841.1.5078 gi|428218848|ref|YP_007103313.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]