SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for D2BBA4 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  D2BBA4
Domain Number 1 Region: 607-795
Classification Level Classification E-value
Superfamily Fibronectin type III 7.74e-32
Family Fibronectin type III 0.0011
Further Details:      
 
Domain Number 2 Region: 115-217,277-364
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.00000000000000736
Family Legume lectins 0.017
Further Details:      
 
Domain Number 3 Region: 974-1054
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000173
Family Cadherin 0.03
Further Details:      
 
Domain Number 4 Region: 794-881
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000014
Family Cadherin 0.041
Further Details:      
 
Domain Number 5 Region: 894-968
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000816
Family Cadherin 0.031
Further Details:      
 
Weak hits

Sequence:  D2BBA4
Domain Number - Region: 3148-3222
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.00314
Family Cellulose-binding domain family II 0.016
Further Details:      
 
Domain Number - Region: 1722-1796
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.0251
Family Cellulose-binding domain family II 0.06
Further Details:      
 
Domain Number - Region: 3274-3354
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.0523
Family Cellulose-binding domain family II 0.031
Further Details:      
 
Domain Number - Region: 3018-3093
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.0607
Family Cellulose-binding domain family II 0.019
Further Details:      
 
Domain Number - Region: 1852-1928
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 0.0732
Family Cellulose-binding domain family II 0.019
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) D2BBA4
Sequence length 3911
Comment (tr|D2BBA4|D2BBA4_STRRD) Uncharacterized protein {ECO:0000313|EMBL:ACZ84127.1} KW=Complete proteome; Reference proteome OX=479432 OS=NI 9100). GN=Sros_1128 OC=Streptosporangium.
Sequence
MSEWPGVRAGTRPGRRGRDVVGAVAWTPRHRRRVARIMALALTGSALTGTQAAAAAAAGT
LLFSQPFRNNTANGTGAVVLPALPSGTGTTNFACLTASGNTSTGVLRSCTTSTDSAGSGK
LRLTNATTSKAGGVFSATSVPTSQGLDVTFNTYQYGGGGADGITFVLAAVDPANPQSPAN
IGQLGGALGYSANGGSPGLAYGYLGIGFDVYGNFSNSTYQGSGCTNPAYIGTGSVRVPGQ
VLVRGPGNGTVGYCALNSTATSTSSSALALRASARTAVPVQIGINPTSSVLTTAAGLAVP
ANSYRMVVTLVGGATRTLTGTLPSVTSGLYPSSWLNASGIPRQLAFGWVASTGGVTDFHE
IDEAAVSTISAVPELTVAQTGYIASTLAPGDPVTYNVVAGVAAGLPETSPVSITQTMPAG
TVPVGAYGTGWVCDAPSGRSITCTNGNGPFAAGAALPALTVVGIVTGGNVTPALVQSATV
ATASSIDASPAYSSSTTAGTLPAAPGGIAVSPALGSIAGGNTVTVSGTNISNATAVEIGT
TAQQEAGTPVVLLPCAAGVTTGCFTINANGTLTIPSMPARATNGAVNINIVTRGLDAAAT
YTYVASPGTPTTPTAVAGVTSATVSWTAPAGNGGAITGYIVTPYRNGVAQTPVSFDASAT
SRTLTGLTADVPYTFTVAAVNAIGTGSAGPASNPVVPYNVPGRPVITAATAGTSSATLTW
TAPAGNGSAITGYVVTPYVNGVAQPTQTFNSAATTQSVTGLTPGTAYTFTVTAVNAAGPG
QPSEPSATATPNSPPAFTFPAPPAGEVGAAYSVPLTVSAGTAPYTWSVGAGSLPPGLTLN
ASTGVLSGTPTAAGGYSFTARVTDAGNVSTTREVTLVIAPRPAFTFPAPPGGEVGVAYSV
PLTVSGGTAPYTWSVGAGSLPPGLTLNASTGVLSGTPTAAGGYSFTVKVLDAQNQSDTTA
VSLTIVPQPAFTFPAPPAAQVGVTYSVPLTVSGGTAPYTWSVGAGSLPPGLTLNASTGEL
SGTPAATGSHPVTFRAVDANGQATTRAVTLVVTSGPLVVVKTASASSAVAGGTVGYTITV
NNTGPSAFTGVTVNDALAGILDDAAYNGDAAATAGAVSFAAQTLTWTGDVAAGTTVTITY
SITVNSPGTGNKVLANAVTSPTVGSTCPAGGGDPRCSATVTVAGLSIVKTADVTTATPGG
TVRFTVTATNNGQTPYTGATFGDALAGVLDDAVYNGNATATSGSLSFSGSTLTWTGNLAV
GASTTVTYTVTVRNPDPGDRSLAGTVLSGTPGSTCPQGNPGPQCTAVVTVLVPALAITSS
ADATTTTPGSVVRYTFTASNTGQTPYAGTSFTTSLVGALDDAAFNGDLAATSGSAVLNPD
GTITWTGDLAVGAAVTVTGSVTVKSPDNGDRVLRTSVTSGAPGSTCPVGNQSPACLTGVS
VLVPGLTITKTADVSATTPGSVVRHTIAVTNSGQTPYTAATVADALAGVLDDATYNADAA
ATSGSVGYAGSTLTWVGDLDVGASATITYSVTVRDPDPGDMTLTGTVSSPTTGSNCPAGS
GDSRCAGSVTVLVPQLTITTATGGATTTPGAVVPYTVTLANTGQTPYTGAGARFVIADVL
DDATYNGDLTTDAGSLSVAPDGAILWAGDIAAGATVTITGSVTVHAPVTGDKVLRTSVTS
AAPGSTCPVIGATSPGCFTVVTVLVPALTITNTADTQSATPGDTVTYTITVANTGETPYT
GARVTESLTRVLDDAVYNGDAAATTGTVTFAGTDLSWSGDLAVGASATITYSVTVRDPDP
GDRQIAAVVISPTQGGNCPAGGTDPRCAAAVAVLVPELTISKSADATTAAPGSTVQYTVT
VTDSGQTPYTGATVTDLLAGVLDDAVYNGDAAATTGTVGVAGTDLSWSGDLAVGASATIT
YSVTVRDPDPGDALLTSTAVSPARGSNCQAGSTDPRCTVSVPVARLVLEQGYTRTGAAPG
SVVRLNATFTNTGQVPYTGIRVFSASGDTVDDAIPNGDQVADSGTLVLDAQGITWTGNIP
VGGVVNITGTLTLKNPPTGDRTLTGTLVSEAPGTTCPPGGSDPRCTSRLDVLVPGLTITK
AADTAATVQGGTVGYTVTVTNSGQTPYTGAAFTDALAGVLDDAVYNGDAAATTGTVGVAG
TDLSWSGDLAVGASATITYSVTVRAPDPGDRSLTGTVSSPTTGNNCAPASGDPRCTSSVI
VLIPALTITKSVTPTTAVPGSTLTYTITAANTGQLPYTGAAFTDALAGVLDDAVYNGDAA
ATTGTVTFAGTDLSWSGDLAVGASATITYTVTVDNPVTGDRNLASTITSATPGTTCPAGG
TDPRCGTGVPVTQATTLTFDKSADTRSVAQGEVVTYTITISNSGLIPYNGAAFTDSLAGV
LDDAAYNGDAAAGTGLVSVAGPLLSWTGNVPANGSTTVTYSVTAGTPGTGDDILTSTLVS
PSPGGNCEAGGGDPRCAATVTVARLSIVTTADAPTTEPGDVVRYTTVMTNTGQTPYNGTS
VLFNGYGGLDDAVPGGDQVATSGSLSLGLDGLTWTGSIPVGGSVTLTGSVTVNNPDLGDR
VIPLTVVSAAQGSTCPVATAPGCTVIVNVLIPELTITKAADRNAAVPGGAVAYTITIANT
GQTPYTGATATDSLAGLLDDAAYNGDAAATTGTVGFAGQTLTWSGDLAVGATATVTYSAT
ADTPDVGDKLLTNSVVSTEAGSTCPPASANAACSARVVVLTPALTIVKTADRASATPGDT
VTYTVNVTNTGQVPFAAADFADALAGVLDDAVYNGDATATTGTVTFAGQALGWTGGLAPG
QGATVTYSVTTGSPGTGDQRLTGEVTSTTAGTTCPAGGTDTRCSNTVLISRITITASADV
ATAIPTGVVHHTVTIANTGQTPYGSAVVDGLLADVFDDAAYNGDGTASAGNLTFVPGSGQ
ARWEGPLAVGDTVTVTFSVTVRNPDPGDKVMNAVMTSGTPGNNCPAGSPAPACASAVTVL
TPVLAVSKSADRSTVTPGGTIAYTITVANTGQAPYTGATVTDRLTRVLPDAVYNGDAAAT
AGTVTFAGSDLTWTGDLAAGASATITYTVTVRDPDPGDKQIVNRAFSDTLGSTCPSTGSV
PACTTLVTVLVPALRIVKAANTVVATPGETVGYTVTVTNTGQTPYTGATVADALAGVLDD
AVYNGDAVATSGTVTFAGSDLTWTGDLAAGASATVDYSVTVDIPDTGDRLLTGAATSNAP
GSTCPAGTTDPACVSTVTVLIPGLAVSTVADRATTTPGGTARYTVTIANTGQTAYSGISV
SDVLTEVLDDAAYNGDATATAGTVVFSGPVLTWTGDLATGETVTVAYTVTVADPDTGDKV
MTGTVASSAPGSTCPVGSTAPACGATVTVLIPALDIVKTAGAPATVPGGTVGYTITVTNS
GQTPYTGASVADSLQGLLDDAAYNGDAAATTGVLAYAEPVLTWTGDLAVGASATITYSIT
ANGTATGDKTLTNVVTSDAPGSTCPAQGTAPACSTLVRLLVPELTIVRSADRATVVAGGT
VRYTITVTNTGETGYPGATVTDRLAGALDDAVHNGDAVATTGVLAYAEPELTWTGDLAVG
ATVTITYSVAVAYPARGDRLLSGTVVSAVPGSTCPAGGTDPRCTATATVLVPALGITKTA
DTGGEVVAGGTLRYTVVVTNTGEAPYDAATVTDRLAGVLDDAVYNGDAVATTGVLAYAEP
ELTWTGALPVDASAVVTFSVTVADPATGNAELDNQVTSTTTGSTCPAGGTDPRCSVVTSV
AATSMTLTGATEDFTLTGPPNTTVRGEDVVTMTVVTNSVDGYTVTARAAAAELSPAQPGV
TVGIPVANLRVREHGTSTFRSLSTTDPVLVYDKPLPSAPGGDGISNDYEVDIPFVPTGRY
TVTIDYVATAR
Download sequence
Identical sequences D2BBA4
WP_012887872.1.100029 479432.Sros_1128 gi|271962674|ref|YP_003336870.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]