SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|427729226|ref|YP_007075463.1| from Nostoc sp. PCC 7524

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|427729226|ref|YP_007075463.1|
Domain Number 1 Region: 1105-1302
Classification Level Classification E-value
Superfamily vWA-like 6.68e-16
Family Integrin A (or I) domain 0.012
Further Details:      
 
Domain Number 2 Region: 227-350
Classification Level Classification E-value
Superfamily beta-Roll 6.8e-16
Family Serralysin-like metalloprotease, C-terminal domain 0.001
Further Details:      
 
Domain Number 3 Region: 1325-1406
Classification Level Classification E-value
Superfamily Hypothetical protein PA1324 0.0000000000301
Family Hypothetical protein PA1324 0.011
Further Details:      
 
Domain Number 4 Region: 1897-1981
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000022
Family Cadherin 0.026
Further Details:      
 
Domain Number 5 Region: 590-679
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000877
Family Cadherin 0.016
Further Details:      
 
Domain Number 6 Region: 1610-1669
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000007
Family Cadherin 0.033
Further Details:      
 
Domain Number 7 Region: 1003-1077
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000864
Family Cadherin 0.026
Further Details:      
 
Domain Number 8 Region: 695-784
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000942
Family Cadherin 0.012
Further Details:      
 
Weak hits

Sequence:  gi|427729226|ref|YP_007075463.1|
Domain Number - Region: 1990-2082
Classification Level Classification E-value
Superfamily PKD domain 0.00157
Family PKD domain 0.01
Further Details:      
 
Domain Number - Region: 3028-3111
Classification Level Classification E-value
Superfamily N-terminal nucleophile aminohydrolases (Ntn hydrolases) 0.0712
Family Proteasome subunits 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|427729226|ref|YP_007075463.1|
Sequence length 3457
Comment RHS repeat-associated core domain-containing protein [Nostoc sp. PCC 7524]
Sequence
MYNEFNVFDQGNSSLDFQYPLLDQSLFPSPERLRERPSGLMRVDNIEIPSLTPTTLAPNF
AIHSNFSGLHSLVDPLVGTSPVVNREAINLALSQVGQSFEDFLQKPNYIDSLQVAFGSGW
QQETATALIKDLAQGQNLLAIEVVTGQELEAQGAFSQQTNTIYLAQEFVQQNPTNAIATV
LTEELGHYIDSQLNPVDTPGDEGELFAAIVDGVELTPGQLQAIKVEDDHKIVTINGQTLL
VEQSVNEIVGTGGRDVLTGTPLSDRIIGGTGADIITGGLGADVFVYQSIRDAGDTIKDFE
LRQDVIDVSQVLSSFGYQGLNPIADGYIKFSTYTGGTIILIDSDGTGSLSARPYIYVENV
TPNDLSSYSSHFIPNPGEPPQIEANLVNDTGVSDSDRLTFDPTINGKVTATSNLVSLKAG
LNNQPVTVDIFDTLQPDGTFSLTSARLRQINGGSLNDGAYTLKLQAIDNKGNISSVYSFN
FVLDTTAPNLNLQLDPDFDSAPVGDLQTNFETVNLVGTTEANLIVSLQQTGVSNTSNNLG
QFSFADISLTQGNNLFTVVAKDLAGNQGEFSQTFQRLVQPVNAAPTNLSLIPASSAENVP
DNTTIGTFTTTDPDVGDTHTYSLVAGEGDTDNTAFTIVGNELRINDSPNFETKSSYSIRV
RTTDAGGLSYEQVFNISITNVNEAPTDILLDGDKVEENAVGAVVGTISVTDPDLVADFLN
NTVTVNDDRFEVVNNNGTLQLKLKDERNIDYEVETTVPLTLTATDVSDSSLTYSKNFTIN
VTDVNENVQLPTISAALANDTGVSNSDRLTQDPTVNGQITDATTLQGNLNGNGFVDISDA
LNEDGSFTISLEQYDVLSNGALPDGDYTLELKAKNITGQESEIVTISFILDLTPPPLTFG
LAPESDTGILGDGITSDRFVTLEGQTEPGLTVALLETQQMLTADSQGNFSFIDVPMPVAG
QAPFTIVAVDTAGNQGRSQQFLTREGINGAPEITSTPESIFDTQTQSTYTYQITATDPDG
DDLTYTLLNAPLGTEIDENRVLSFTPSADLKPFYEFTVEVNDGRGGTDIQTFTVEVPAFA
NFGTIRGIKWNDLNGNGVRDNELVQGANPDVVYVLDVSGSADFGFVGSPIGDFNGDGLEN
TRLDAEIAAFIALNQQLNSQGLGDRAQVAIVVYSGFAAHADMNLGTNGLQLTTTLGTDSN
SDGTTDVEEILRSIRSGAFGVGNNTGTNPEVALRKVEETFASIGTQTGNGNVIFLTDGEQ
NRGGSIVDEVERLKAQGFNLSAFGVGNDASLSVIQTIDPDGTKFTSIDDILDAFGGIGEG
SRSVLEPGLAGVSIYLDINNNGILDSGEPVQVTAQDDPNTLDIDETGQYEFNNLAPGTYI
VREIIPDGFEQTFPLRNVTRPGDGYADIILEFVSGGNAPSPLIEPYGSTGGLPSGSPFNG
NGRYTVEPVNPEVILGAPPPSPIIGRNPEVDWLALPLGSYVTVGFTDEVIIDGPGDDIFI
RSFDPIDSANEFADVFVSSNGIDFELLGTINQRGLVSLDLANINFTKPVIAVRVQGLDNR
GTSPGFDLISVEVLPGSIASPDFYTVQLEAGEIVENIDFGNVQIAINQDPIITSTPIIEA
VVGQPYEYLVRANDPDGDPLTFSLNQAPEGMVIDPQTGRISYTPTITAITSNFDTSDSQI
RPGADNQATFTNISQFNFTDAISDYAIGEGFNVFTNLPVGPVFRSAVSFDLSSLSSQVTS
AKLQLLKNRTSGDPIETLGLFEVTTEINQLYTNRIGLVSPEIFEDLGTGTSYGTFDVATG
GNPSEILEFELNEAAIAAINAASGDFFSIGLALLSANPNNTTQAAEYLFAFSGNAGIQRL
VLETENKETVEILVNDGKGGEAVQNYTLSIVESSDNQPPVITSTPTISIALGETYEYQIE
ATDPNADSLTYSLINFPDGMEIDEFGKITWIPTEIGEFTLEIAVSDGRGGADTQIYRIEV
VRDLAEDTEAPQVNLSFNSTVLKLGETLNLQIQGFDNIGLADLDLSFNGNSLVLTPDTVT
NGLINTASITLNKTGVFEVVATATDFSGNTDTETISIRVINPNDTQAPITELDLSGFDPL
NPVISELTDIVGTINDPDLEFYRVELAPVSLINLSNPAANDPDYITIAEGRANVDNGVLA
QIDPNLYRNDSYYIRVYTQDYSGNANVQGVVLGINSQNKPGRFALEFTDLSIPLTGIPIE
IQRRYDSLDAKFSGDFGYGWSLGLQDAQIQEAAPTGVDLSRDNFFGGNSFTVGTRVTLTT
PDGRRVGFTFNPVPDLAGLLGVRYKPTFTPDAGVYDRLEVDYTPLSIRSDGSVGLYLFGF
TYNPSQYRLITKDGTTYRYDQYKGLLDITDRNGNKLTYTDAGIFSSTGQSVNFNRDAQGR
ITEIIDPAGQGIIYNYDAQGNLVSVTDQAGLSATHTYSDSRPHYLEQIVDPRGNIVIKTE
YDPQGRVIGVTDALGNIISNSYDVNATGSSKTQIDPLGNTTTTVWDDRGNVISIRDQVGA
ITTFTYDANNNPITVTDPRGFTTTRTFDTRGNLTSITNALGNSRTFTYDQFNNVITDTNS
LGHITQFIYDANGNLVQVINATNQSNRFTYDNLGRVNSFIDANGNAITFSYANTTLGKPT
QITFSDGSTQQIEYNQFGQITRLVDENGHATTYITDSIGRLIAQRDPLGNEITYTYDAQL
ITSVTDSLGNAESYEYDNAGQLIRRIDPFNGVTQLSYDALGRRISETDPLGNTTTTTYRG
DGLITAITDAAGNTTYFEYDLAGNQTAVIDPLGNRTAFTYDALGRQIMQTDPLGNVTTYN
YDAVNNLIAIIDRNNRQRTFTYDAVNRLLQENWLVDDTPVQIINFTYDAVGNLISVTNPD
ITNTFVYDSRDRVIQASTEISGLSPVVLTYTYDRTGNRISASDNLGVSVNSTYDPRNLLT
SQTWQGTGIDPVRIDYAYNSRGDRTQIQRFSDLTGTQLIGSSTFNYDALQRLTEITHFDG
AGSTLASYNYNYNAASLINSEIYKGQTTNYTYDSVNQLTNADRSILPDGNYNYDENGNPI
GNGFVVGANNQILSDGTFNYSYDAEGSLVTKTNIATGNVTNYVYDFRNRLIEVVNKNADG
NTTQFVEFKYDGFGRRISKTVNGETTYFINEGNNLWSELNEVGEVINRYLHGAKVDELIA
RYSPNERTSWYLTDRLGTVRDVANTVGELINSIDYNSFGQILAQTNPSAGDRFTFTGREY
DEEIALYYYRARYYDANLGRFISQDPIGFAGQDVNLYRYVGNNPVNATDPSGLIAAIEYK
IIIDLFVLGEPGSFIGALIGFLQGFGATNLVFIGNILEIANAGGDVIAEWGTAIDRTIEK
MEEIQNELSRFNAVDIKQGLVSGFVDGAGLDVVKIQLKIPDLIVSEDDPFFGEIDLLERG
GAALGIPTSIDIPLKGGGFEDGYREARIYLESLNPRR
Download sequence
Identical sequences K9QRU9
gi|427729226|ref|YP_007075463.1| WP_015138314.1.5551

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]