SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|111225961|ref|YP_716755.1| from Frankia alni ACN14a

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|111225961|ref|YP_716755.1|
Domain Number 1 Region: 617-801
Classification Level Classification E-value
Superfamily Fibronectin type III 4.84e-33
Family Fibronectin type III 0.0013
Further Details:      
 
Domain Number 2 Region: 122-245,308-376
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000000165
Family Legume lectins 0.034
Further Details:      
 
Domain Number 3 Region: 817-891
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000267
Family Cadherin 0.04
Further Details:      
 
Domain Number 4 Region: 989-1064
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000534
Family Cadherin 0.042
Further Details:      
 
Domain Number 5 Region: 897-979
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000691
Family Cadherin 0.035
Further Details:      
 
Weak hits

Sequence:  gi|111225961|ref|YP_716755.1|
Domain Number - Region: 1728-1800
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.000816
Family Cellulose-binding domain family II 0.014
Further Details:      
 
Domain Number - Region: 2637-2713
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.00121
Family Cellulose-binding domain family II 0.031
Further Details:      
 
Domain Number - Region: 1339-1411
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.00184
Family Cellulose-binding domain family II 0.014
Further Details:      
 
Domain Number - Region: 2110-2184
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.046
Family Cellulose-binding domain family II 0.01
Further Details:      
 
Domain Number - Region: 1469-1547
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.067
Family Cellulose-binding domain family II 0.023
Further Details:      
 
Domain Number - Region: 2772-2845
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.0732
Family Cellulose-binding domain family II 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|111225961|ref|YP_716755.1|
Sequence length 3144
Comment hypothetical protein FRAAL6628 [Frankia alni ACN14a]
Sequence
MLLKVIDYKLYVMNGVRGTRRRSPRKMAAIRGGRAFKALVIGLVSLFLSGSISALMPTVA
YAAGTLLFNQPFHNNTPDGLGSVALPLPPVSATGGNQACLTASGNSNTGVLRSCTSSNDA
QGAGKLRLTANATSLQGGVFGATSVPTSQGLDVTFNSYQYGGSSADGIAFVLAAVDPAAP
TAPARMGQLGGGLGYSAWLAAGLSGLSTAYLGVGLDVFGNFSNTTYQGSGCTNPAYISTT
GAQVKGQVVVRGPGAGTVGYCAVNSTATTTSSPVVPLHAATRAASVVPVEVVVNPTAQTI
VTDSGVSATPGSYKVIFTPVGGTSRTLSGTLPTVPSGLYPSSSWTTPAGIPRQLAFGWVG
STGGSTDFHEIDNTRVVTFNPVPQLNVAATSFTQTTLAPGDPVTYSVTAGVSAGADESLP
IAVTQTMPAGVVPVGAFGSGWVCQAPSGQTITCTNSNGPFTNGTSLTPITVVAIVTGASV
TPAFVRSATTTTASASDANPGFGAAATAGTIAATPTGIAISPTSGSISGGGAATVTGTNI
TGATAIEIGTTAQQQAGTPVVLLPCQSGPAAGCFTVNANGSLAISSMPARTSAATVGVTV
VTVGVAGVTSYVYTSAPATPVAPTATAGVTSATVTWVAPASNGSTIIGYVVTPIRNGVAQ
TPVSFDASTTTRTLTGLTAAASYTFTVAAVNAVGTGAASPASAAVVPYALPAAPTITAVS
AGSTSANLSWTAPASNGSAITGYVVTPYIGGVAQTAQTFTSTATTQSITGLTGGTTYTFR
VAAINAAGTGPQSAASTAVTINVSPSLALPAPPLGEVGAAYTDQFTVSGGTAPFTWSIST
GSLPPGLSLNAATGLLSGTPTTAGSYPFTVRVADASGQAATQSLTISIASAPTLPFPPPP
AGEVGVGYSNQLTVSGGTSPFVWSVSAGSLPPGVTLNSSTGLLSGTPTAAGTASFTVRVV
DAFGQAVTKSVSLVIVPRPNLAFPAPPAGQVGVAYSNTLVVTGGTAPFTWSVSAGSLPPG
LTLNSSTGVLSGTPTTAGSTPFTVQVSDAFGATDTQAVTLTVGSGPIVIVKSSNATSAAP
GGVVTYTVTATNTGAAAFSGVTFTDALAGVLDDATYNADATATIGAVAFTSPNLTWTGNL
AAGAAVTVTYSVTVNNSDTGNQILSDTVTSPTVGTTCPVGGTDPRCSTAVTVSVLTITSA
STVTTAEPDQVVGFTYTAVNDGQTPFPNATFSVPFANVVDDATYNQDGATTTGQIANVGG
SLVWTGSLAPGASVVVTLSLTVKNPDTGNKVLTTIVSSATQGSTCPVGNALPACTSTVPV
LTPGLAITNTADVSTVTPGGTVTYTVSLTNTGETAYTGTTVTSSLAGVLSDATYNADATA
STGTVAYSAPNLTWTGSLAAGASATITYTITVLDPDPGDKLLVNTVTSPAIGSNCPVGGS
DSACTAVVQVLVPDLTIAKTASSATTTPGGVVTYTVTVTNSGPTPYTGASFTDSLSGLLD
DATYNADATATSGTVGYTAPAVTWLGDLAPSASATITYSVTTHSSLTGDAILTDTITSPT
VGSNCPTGGTDARCTVAVPVSQLIFNSSFGSPTATPGSVVGLNITFTNTGQTAYNGITVG
FNGTGITDDAVGNGDQTASSGTISVNPGQGAIWRGDIPVGGVVTLASTVTVKNPDPGNLV
MTLVTQSAAPGSNCPAGSSDPRCTATANVVVPGLTIATSANAATVQPGDTVDYTVTVTNS
GQTPYTGVTVTDALAGLLDDAVYNGDVSASSGAATYTAPTISWTGNVALGAVVTITFSVT
VLDPDPGDKILASAVTSEAVGSSCLPAGGNPACRSSVVVLTPALTIEQTADENNAVPGQV
VVYTVTVTNSGQTAYPAATFSNPLSAVLDDATYNADVTASTGTPSFAGATLSWTGSLNPG
ATATITYSVTVRNPDPGNESLASTIVSTTGGSNCPSGGSDPRCTVVLPVVAAALLTFTKE
ADAPSVAAGGTVHYTVTVANAGLTPYLGAAFTDDLTDVLTDATYNADAVASAGIVSYTAP
VLSWTGDVPASGSVTITYSVGVTGPGTGDDILVDSVASASVGSNCQAASTDPRCTATVTV
SELTFAESANVTSTTPGGIVTFTTTFTNTGQTPYTGITASLVGDDVVDDSSPYGGQSASS
GTMVVGATGLQWTGNIPVGGTVTIIGSVQVNDPDTGNRVLKGSIVSDAPGSNCPTGGTDP
ACFESVPVLLPGLTMTTATNVNATVPGGVVTYTVTITNSGETPYTGATVTNDLGGALDDA
VYGGNATASAGTVSFASPTVSWSGDLAVDEVVIVTFSVTVRNPDPGDKILASVLASTDVG
SSCLPASGSTGCGHNVVVLTPALTIVKSASAATTVPGGTVTFTIAVTNSGQTPYSGAVVS
DALTGVLDDATYNNDAAATVGTVGFASPTLTWTGNLNPGSSTTITYSVTVLTPDTGDAKL
ANSVSSTVAGNNCATGSTDSRCSASVLVSELVTTFTADTSTAIPTQTVLFTLTTTNTGAT
AYTNAEVNAAFLGLLDDATYNADAVASSGLLQLNLTTGQLNWVGDLAIGQTLTITGSVTV
NNPDTGDRTMTAAASSPTAGSNCPAGNTNPDCSATVTLLAPRLAIAKAADVTTVTPGGTV
TYTITVTNNGESAYQGASVSDNLTGLLADAVYNADASATSGTVTYAAPTVTWTGDLAIGA
SVTISYSVTVNDPDLGDRTLNNAVTSPAIGSTCPTDGSGGLGCRVAVPVLVPALTITKAA
ATGTGNATVVAGSAISYTVTVVNTGQTPYTGASFTDDLADVLDDAAYNGDATASTGTVAF
TSPDLTWTGNLALGATATITYSVTTALPANGDHVAINAVASTTAGSTCLTGSADAACTTT
TAVLVPALAITKTVDQTSAVVGSTVQYTITATNNGQAAYTGATITDSLATVVNNATYNAD
AAASAGTVTYAAPTLTWTGNLAVGAGVTITYSVTVDDAATAGADLVNRVASTAAGSTCTG
TGTEPACTTATAITAQSLALTDLTPAFTLTGEPNSSVVQNGAVTMTVTTNSTDGYTVAVQ
ATSPTLTGQTAGNGDSIPVSSLLVRQSGTSTFTPLSDTVAVPVYSKGQPSAPGGDAVSND
YAIDVPFVASDTYSTTLDYIAASQ
Download sequence
Identical sequences Q0RBD5
gi|111225961|ref|YP_716755.1| WP_011607665.1.38265 326424.FRAAL6628

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]