SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for gi|16331135|ref|NP_441863.1| from Synechocystis sp. PCC 6803

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|16331135|ref|NP_441863.1|
Domain Number 1 Region: 2583-2980
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 1.96e-33
Family Integrin alpha N-terminal domain 0.0013
Further Details:      
 
Domain Number 2 Region: 1318-1460
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.19e-18
Family Clostridium neurotoxins, the second last domain 0.044
Further Details:      
 
Domain Number 3 Region: 3650-3748
Classification Level Classification E-value
Superfamily CalX-like 1.02e-16
Family CalX-beta domain 0.0011
Further Details:      
 
Domain Number 4 Region: 1913-2173
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 0.000000000000244
Family Integrin alpha N-terminal domain 0.0021
Further Details:      
 
Domain Number 5 Region: 2285-2411,2459-2568
Classification Level Classification E-value
Superfamily Sialidases 0.00000000000334
Family Sialidases (neuraminidases) 0.017
Further Details:      
 
Domain Number 6 Region: 3382-3535
Classification Level Classification E-value
Superfamily beta-Roll 0.00000000000645
Family Serralysin-like metalloprotease, C-terminal domain 0.0059
Further Details:      
 
Weak hits

Sequence:  gi|16331135|ref|NP_441863.1|
Domain Number - Region: 3083-3329
Classification Level Classification E-value
Superfamily WD40 repeat-like 0.000311
Family WD40-repeat 0.047
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|16331135|ref|NP_441863.1|
Sequence length 4199
Comment integrin subunit alpha [Synechocystis sp. PCC 6803]
Sequence
MTINLADRLSALTTDSAGTTHVVWVENTSLWHAVYDPNSAMWVNAQVIANVGNQSLTSLN
LIANDKLIQPPGNNSNAAPAPGLVVVYQQGSGNDSDIFYTAARYNSGGRTEWLSTPQALT
SDQVGDLAPRAIAEENGTVLVVGQKVDAEQAKNQAIREDTDLYYQSFTVNQNQFSSNQNS
PASGNIIQGNSYNYLGRNRNAPAASYEVLSAGEASFLGNKPTYQGLGLNWNASNEFNINA
LSLILAKKLGQEPEKVLKSMFGPLILSISLQGSSGNNPEWSLFGGGESTKGRLLLNSLAT
VGRKQRRNVGNSEDSNRFDKWRSQGRGALSPDSQEGFDLSFTLSSLYSYGNSNGNSDGKF
PLYLETGGALASLAGSIPLFGFDDILNRGLNFNFGLFFNAGVALQWQVAPKDPGNYFPLL
APYLDLSGNTDRLFITDGFYNALPTYSEVTAALALLSNLEAAFASIDQNSSVELESLLVG
IPVTLGFEGTLEIPFIFKLFGSIGFFADGTFTIKGDTPSSITIGAPFNLTVSLLGLIGAR
IAGFPTWTQTFAASSSQNNQTGLEANNTASVDGFLLTINLAQFLEPNLSLNPTDFTVTVT
NANGQISTIPVFGVVVQTDSQSNNSEVILRLEQEIPYTTVNGQPFSNIIVKYSGNQLGNF
TSSPVALQSPDTLIYTYNPTSGTGNDYTQNNQQITIAFNGPLDPDILPTENQLLNWFAVT
NSSNQAIAITDVAVNQSRIVLTLASNTAINPGEIYTVQYNPDANSDQQLKAANQKTITAF
TVSNGTAPTAWGLINRTFAGVINVQTQPVLSNLTSDFAQDTSPALALTSQGDILLAWSSD
TPPITPISVLAEGDYLYLVFADNLKNDSANPPSNSQFTIKTSDGNTTTPTNVSLAQNTIT
LTLTNSVNASQIVEVSYSLSGTNLTSNLYLADATNTSFWVPDFTNSVQSAGSTSTAPTSL
LGSVISNLITIPFNQTLNVNQIPNGGQFQVTVNGNSTSPITVTSVTVADTSITLVLNQII
GQGQLVTITYTPNSNGENNLVSNTGTPQTVASFSTNNFLTTASSTGTVIKTAFSPFGDSG
ISSITTIPGTTGINSDVVATLYQNPSTKALQNVVAWVNVDTSALSLKTIPGQNYGPNEAA
LITTAAQQSDIYYAVLGPDNQWGLAAPIFSSQPGQDQKVTLGVGPGGNLLAAWLNTQLDS
DGDPNTTIQLATFNGTTWTNPTILGGANAGINPNSFSELSISSINGQPAIFWTESRPPSY
SNLVSEQNPLVYLRLGELSGTTVINQGQLSVAGNGTYSTAGYTLGQVGALENTNTNTGDF
NPAVLFNGGGITINSPVPVSVQGFSVEFWFKLPTSDGVVGLANLAGVFDLSLNEDSLTFT
LNNGSNPQISGTVTTGNWHYVVGTYDPVKQILDLYLDGQLVNTLENIAFANLPQSGTLTL
AGSGGSVYLDEFAFYNSILSYVDNGSSPSSSNNNFLNLTGSQLINGIWGVNEVGSHYQAR
FFEPVTAGPETNYSVWDSSGNSWQSPVSINPVDEVVPTILSAANNPIWDIVSANPAGNNN
AQIAPNGNPDTIFQVNLTGQQGSEITGFTVTTSNNQLWTVGTDGTGNAFSESWQLGVILA
ENADSTTPQLEFISGDKLLNSLNPGATFSHRVMGATETFTIFVDTEGSPLTSPATVNIYL
QGQTDPITFTSLSPIPNQGGPVSANSPDYLDNQVLGIATIKEANDASLSLVDSGFVIDTD
NPAIAAVMASGFSNGALAYVAVGNRGYTTQGNAVQGSIQILFAGGDVLSQKSTLPLTTTD
LSGNGDGVLITGITDAGDINNNVPMALVTGDVDGDGVDDLVIGNANANGGTGSIYVINGH
YLNGLKGKNQIIDLSNASNWTSDQGFVIDGVDAEGGAGFSVAIGNFTGNDPQIAFGAPFA
KNGNGVAVGKVYLVSPSNPSQLSPIHIGNTFNLTNPQNPAQTVTVGETAGYSLGVSRKIS
GGPVTFTNNSGDDLFIGSSTYGVQVSNQWVGKSALPSNNQGNYPDTTMIAAGAVHVYSQT
SSQPFGKVATYTGPNIPAANGVGANYLAGAAISLGDFTDLDGDGHQDLAISALGVNGSAG
AVYALSGSKFTPSSSLQALNEAGNLIINGGIAGGRAGMTIMTPGDVNGDGYQDFLITAPQ
AGNGTGQSYLLFGPLDLSTEIVPIIELNAIANDSKQVFVLNGSLPNQLAGTAVVSLGNIT
GTQGPNNRPIDSFLISAPNAQQFYVVFGQPWLAADGSLNLADVASDNGFVIDGNLIGNPP
TTFETTSQYIDTTPAILINGSNLYLAYKGFGGNNQIYFTVSTNNGQSWNSEVQLPQSAQT
IFPPAIAFFNNVLYLAYVDGNNGLNIITSQDQGQTWNAPLALGGTSSTPPTLFVYQGTLS
LLFAANNSTSTVLQFYLNSSNEWIYANEIGSNQTAISAISATVLGDTLYLVYKGGTRNTP
STLDYITSTTNADLSANDWSSIPIPGVSSQGGPSLTNDGTNLYLSYLDSSNQLNFVSSGN
GINWSSPQVITNNISQSPPAIAFANNELYLSYPGPQGSQELNVTSFPLPFTGSILGNGSL
VRFLGDVNGDGFADVFSGGTNAGAIIFGNSTKDLLTTASGSEDLVISVPNATLRDVISVG
DFNGDGIKDLGVLDGNGNFYVVLGNTSLGDLKTLSITSSSSPVVINQVGGVTKSMAIGDY
NGDGYDDVLLWGDNGNQVAWGNSTGVLNSFTNIDYPETQTTATGVDLNSDGIPEIAIGSD
ERKIAGQISTSGSFSLLPTPTTSSVINTLAAANQLENIGDFNGDGIADLAVLASNYYAAA
IGEPNNLPNYLSRPGNQGGVFIFYGNSNGLSNTAQPDVILAAPFTNPSGQISTYQLSRIA
QAGDVNGDGFDDLLISSPYTVDAENNQGGVFVVFGGDDWNNQPFDLGQLRANQSQGSNPR
GFAIDGSPNSQAGIALNGGGDINGDGFADFIIGAPGENNLQYNQQIVFIENGELSDDDKY
SYILYLDGNQTIQMGGGDWQANQVWTNQVATNWNNSSRPPEAVIGQSNGDIWYYPGGNQN
WQSWGKLPAEINELAVNWNTSGNPQIIAGLGGKGGIEYYNGSTWVNNGPYQGDGWRSAIT
QMAVQWGEDGSPSQIVVGLADGAVIYYNTQSGWRTINNFGKSVTQLSVQWQEASNPNIVV
GLDNSEVQYYQGSNGVWTQFHDDGWVYPVQQLAVQWTSNDAQPLVVVGLGDDNGNNGSVW
YYQGSGEQGGWTFLSGLPSGAAIAQMAVQWNFSSSPNPPNNVNDLKIVVGQADSTVSYYN
GNGWTATPAINSSLQIPTLNAITVQWSANGQPQITVGLGDPEYDNGQLWYLPNPSQSWQE
LQGSVNYASPITQIDSSWTESLVPNSQTDNLSYVFFGSDFNDTVNQTGTIGDDVMVGSAT
GESFLAGQGDDQILTKGGLDVVYAGPGDDWVSVSDTYFRRLNGGTGFDILALQGYNGQNW
DLTTLSPGLRLQDFETIDIRDYGANQLTLNSLSVINLSSNNTVIVLMDESGDSLQLSSDF
GADGTTYQYGQRFYQYKSNNSAAIVLVNQPTMPSFTAPSQNKPQPVLPNGNGTSNTAALN
TNIANTGNANTGNFNDENINTGNANTGNFNNGNTNTGNVGDINIATKLFVSSPTASEALG
EVDFTIERTGDLDKYVVVSYLTQDMDGKAGDRYLPVAGQLVFKPGETKKTIKVKVPNDSI
YTGDKQFGLLVSLLEEGLQPGNGEQAFFLAADANGSQIRNWNYLPGESANSLTGGVINFS
TTVNAGQALVKLDVNGLAEFNDFGSYNPVAGNYESLMLNGMTGAKFTNFDNENNPQGLEL
QLWDGDRGDADGLTNGLVQTKGYLGRVIPGLISNDNRVFWAPTSADGQVQLRLINSPNQN
YAMGWIEVDDTNGSIDGLLPDDPSYEAAALARKQLIFSDQNGASTKALTRSLAQQSFTNV
DNLIATESQFFGDFSNANLEPNRYYILYSQQGDETTFSIDTAPIIETDSRGYHQLNFAGI
TAEIGSKTLVVPGVLGQSVTAEVSISRAGAYDNAIALYKVDSLTGGLDLNGDRQIDLKPG
DSGYTEAALGRAQAPLTGVSLTAPDGFFSTTQQTVNLLGNQMYGMVIIPNSTIAEVLSQN
PSNDPNFGPVALFSFNGANHNGISQMSRLGSNLFGFEDMVGGGDQDYNDLILQFDFLPA
Download sequence
Identical sequences P74440
gi|16331135|ref|NP_441863.1| gi|383326047|ref|YP_005386900.1| gi|383491931|ref|YP_005409607.1| gi|16331135|ref|NP_441863.1| 1148.slr0408 WP_010873164.1.11876 WP_010873164.1.1889 WP_010873164.1.18904 WP_010873164.1.33690 WP_010873164.1.35395 WP_010873164.1.47586 WP_010873164.1.99424 gi|383322878|ref|YP_005383731.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]