SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|427728789|ref|YP_007075026.1| from Nostoc sp. PCC 7524

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|427728789|ref|YP_007075026.1|
Domain Number 1 Region: 1494-1672
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.21e-34
Family Laminin G-like module 0.023
Further Details:      
 
Domain Number 2 Region: 521-728
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.23e-21
Family Leech intramolecular trans-sialidase, N-terminal domain 0.079
Further Details:      
 
Domain Number 3 Region: 982-1120
Classification Level Classification E-value
Superfamily Ricin B-like lectins 8.92e-20
Family Ricin B-like 0.016
Further Details:      
 
Domain Number 4 Region: 1290-1464
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.86e-17
Family Leech intramolecular trans-sialidase, N-terminal domain 0.055
Further Details:      
 
Weak hits

Sequence:  gi|427728789|ref|YP_007075026.1|
Domain Number - Region: 1155-1225
Classification Level Classification E-value
Superfamily NAP-like 0.00445
Family NAP-like 0.0088
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|427728789|ref|YP_007075026.1|
Sequence length 2351
Comment laminin G domain-containing protein,putative carbohydrate-binding protein [Nostoc sp. PCC 7524]
Sequence
MFLQQEKLQHINSISHEGKIVVFATDASGKIYYTVKQDGFEDSYGNTQVTGWEDWQELEF
PQEEKGDRSVIEQEQAELTYTKNGKNFYFLQSVYHTQNQSDVAPVQLISGLGHLYVFRQS
KANTLLVDRFVLDGLENKLVRKLEVRYKRSRQKYQPLEQTNGNKLSLVDSLDFRDVDGNP
FYEPTTELSIINNLYSGWFSVVLLPTNEHDQHRWHIFAYNSQTQRVELTSIRASKDGLFD
VKDYTILEPQPGDETVLLPRSIPGIIKRTLDIKGVNLTDGLAATKYDIQVERQTQDGEQQ
LLKEATRVMLVIPTEQGDTAALSFPVAGDGTLAQIDETPATEILRSNDRHVLLPLNTLDE
IKAIGESMPPAQGAITGMERTEADNVKIISNQSEKLKYGDIVQISGTNHYNGHYVAKTID
ANTFEIEAKWLESELGNWEVVPKEESGLVFDGIITAYEKTVDGKLRVTAPNHGLHHGDEV
QITDTQAYNDTYPIMEVEGDSFTIGMKWQSGEAVNLKLESKKRRGITFNGDRDYISIPPL
DLKKPHADISCGETYSAWVYVSDSQTQEQLIVGQKDELMQLFVHQGKAVLKVHFIDGFKQ
IEDTELLPEKEWVHLAGVFSYNKKTQRTTLSLCRNGQEVAKSEFQQLVSPLLPAISALEE
PRNNGKGSNAQWTPEFWIGGAGDRQAFFAGKISDVQIWNQPRTAQEIKDSMYLQLTGREV
GLVGYWRLGAIMEDKQRHVVDFSIYGNDGTVYGEAFVSAVSLERTLRDGKTPAIKYVNNE
LFAVSQRATYLETFEFRGIKPNNFEFSYWGKPNRNSQQQIAFAGGVTEFKDLGNGWYLAS
CRFTVPDEITLVRSFEISHVQGDWKQLEIRKHYIRIVSDSITQDNYTDSVTLKTLDAKNA
ELEQILKKLPPLERQEADLLKERNKLEADLVILRDAKLREQQKQQLVKDISEYEKKISLL
NQEVNKYRQQYRAELNNPLNYYCYIASSDSSKVWDIEGGQGRNEADIHLWKKDPRSHTQQ
WKFEPVNNYYMIVNRKFNAYVADIEGDKDRNKADVHLWAKSQNQSKQQWQIEPFNRHYVL
VNRRFNTRVAAVEGDPNRDKADVHLWEKTADRNKQQWRLEKTNVVVNNKIDQARNVLNKK
QAELTQAQNELQKLKQELSKFELKENQIEQEIKDLAARLQTVIEQLQAVQAELNRLNNEF
LKQVRNIQQSPQSMPELSTDSRGLVTKGALLGFVRPLSRINAIETSEGNVQLSYFDTQGR
MRQTNYDATSDSRNAAFEQWIPDSVRACPNFDKSKSVVLIEDSIALNEEWTIEAWFSYPL
PETTAWNTLTRGKEKDHHIIVKEGRQLGTFIGSFQDSGYNMEQLSPGWHHIAAVGRGEGE
QATTTFYIDGRQVGECQAKTTSDVYAIANYQTGGQQFGKVAEVRIWQIALSAEEIAVNSK
TLLSSNEPGLLGYYPMNEAQGTEIRDHSENNRHGKMQSTTWFACTALIGHPGHTVMQFDG
KDDYINCGKVDFAGDEYTLEAWFKTTATAIGDIFAATTTDHGILLEVENEGTLRYLHRHP
IGVSGGTNLRTKEKYNDGQWHHIAAVKSSTAMLLYVDGYLQQEDDAPSGFSQPFNVDIGR
IGSHALARLFNGEIADVRVWNKARSLSEIQTDMHQRLTGKEANLIAYWPLNEINMERSPY
TVSDLTGNHNGTVHEALTTEDNTLPIGHSALISAEYSAISIDPDTQRKSAMMRRFFAMPA
LDGAIVLPDKRVELLELKWIGNAQFEPTLLGYIEGAPPVPSENLTVLDNYNNATSVELTA
SEDVAYSWNREQKGGLGTSTSLFIGAKTDIHIGLIKTIGALSSRQGFIGELSTAYSFLNS
SNVSANSNLSISDKLELRGTTEQIPKFPHLGNRFLPKNVGYALVVSGLADVYITRLTRSK
RMVGYQVQPNEDVPPDVNTITFLMNPAYVMNGSLDGLVGSSAADERFYQHVPQMRSQYGS
LYPASYYRLQEAYDLERQIEQQDRERQSYFENFNSRLVDETSLNREIGANASEYDNYGQI
SVNLEENRTEENQENQEDQEETNEEKITRILEILEGDREQQEQKNEEKRQEIESKIKEQE
KRVQATESFAGWQRKLEDVQIRAGKRNIVNTYVWDADGGLRSESQQFASTVEHTIGGSFN
LQGGLGLETAGNVLVAAAELKARGTFHLTQTLTKTERRSKGFALNVELRTGFRGLESTGI
TDADDRPILPGEKVNRYRLKSFYLEGGVQHFHDFFNYVVDPEWLISNSEEARALRQTKAG
KANKTWRVLHRVTYVERPALMGFGRDIRQLPTATTAPDNTLLLAKIQKLERNNQELEQKI
DQILNLLQSQQ
Download sequence
Identical sequences K9QQU2
WP_015137882.1.5551 gi|427728789|ref|YP_007075026.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]