SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for gi|113476963|ref|YP_723024.1| from Trichodesmium erythraeum IMS101

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|113476963|ref|YP_723024.1|
Domain Number 1 Region: 2878-3170
Classification Level Classification E-value
Superfamily Integrin alpha N-terminal domain 2.35e-34
Family Integrin alpha N-terminal domain 0.0056
Further Details:      
 
Domain Number 2 Region: 505-612
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0000000000000981
Family Collagen-binding domain 0.0029
Further Details:      
 
Domain Number 3 Region: 990-1089
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000000111
Family Collagen-binding domain 0.003
Further Details:      
 
Domain Number 4 Region: 624-731
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000000144
Family Collagen-binding domain 0.0029
Further Details:      
 
Domain Number 5 Region: 862-969
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000017
Family Collagen-binding domain 0.0028
Further Details:      
 
Domain Number 6 Region: 743-850
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000000235
Family Collagen-binding domain 0.0029
Further Details:      
 
Domain Number 7 Region: 386-493
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000000314
Family Collagen-binding domain 0.0038
Further Details:      
 
Domain Number 8 Region: 267-374
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000000654
Family Collagen-binding domain 0.0036
Further Details:      
 
Domain Number 9 Region: 147-255
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000418
Family Collagen-binding domain 0.0044
Further Details:      
 
Domain Number 10 Region: 1100-1203
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0000000000222
Family Collagen-binding domain 0.0078
Further Details:      
 
Domain Number 11 Region: 2790-2876
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000458
Family Cadherin 0.011
Further Details:      
 
Domain Number 12 Region: 2690-2776
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000903
Family Cadherin 0.0076
Further Details:      
 
Weak hits

Sequence:  gi|113476963|ref|YP_723024.1|
Domain Number - Region: 1216-1300
Classification Level Classification E-value
Superfamily E set domains 0.00035
Family E-set domains of sugar-utilizing enzymes 0.055
Further Details:      
 
Domain Number - Region: 1578-1678,1741-1776
Classification Level Classification E-value
Superfamily WD40 repeat-like 0.0122
Family WD40-repeat 0.027
Further Details:      
 
Domain Number - Region: 2592-2682
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0871
Family Fibronectin type III 0.0043
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|113476963|ref|YP_723024.1|
Sequence length 3193
Comment YD repeat-containing protein [Trichodesmium erythraeum IMS101]
Sequence
MNGALSGVKVVILNSIEDGVEQITKVIGKYLHLSSVHIVSHGRPGCLFLGNGQLSLDNIN
NSYTSDLEGWSVSNLLLYGCNVAAGDGGREFLERLHGLTGANIAATAGLTGSLALGGDWE
LEVNAGNVDLRLPFREDAIATYSHVLSDNTKDTAKSMGNLGPIQIFEDWVGSSDRYDYYQ
FNLLQNSIVNFNLSGLDEAAYIDIYDQNNESLRSQYFSGQDTDEPLNFNLNSGTYYARIS
NSSGWYSVTNTPYQLEASAIEIADRAGDTEEDPKDIGNLNEEQTITDWVGDIDNYDYYQF
NLQENSIVNFNLSGLDEAAYIDIYDQNNQSLKSQYFSGQDTDEPLNFNLNSGTYYARISN
SSGWYSVTNTPYQLEASATEIADGVGDTEEDAKDIGNLNEEQIFTDWVGDIDNYDYYQFN
LQENSIVNFNLSGLDEAAYIDIYDQNNQSLKSQYFSGQDTDEPLNFNLNSGTYYARISNS
SGWYSVTNTPYQLEASATEIADRAGDTEENAKDIGNLNEEQTFTDWVGDIDNYDYYQFNL
QENSIVNFNLSGLDEAAYIDIYDQNNQSLKSQYFSGEDTDEPLNFNLNSGTYYARISNSS
GWYSQTNTPYQLEASATKIADGVGDTEENAKDIGNLNEEQTFTDWVGDIDNYDYYQFNLQ
ENSIVNFNLSGLDEAAYIDIYDQNNQSLKSQYFSGEDTDEPLNFNLNSGTYYARISNSSG
WYSQTNTPYQLEASATKIADGVGDTEENAKDIGNLNEEQTFTDWVGDIDNYDYYQFKLQE
NSIVNFNLSGLDEAAYIDIYDQNNQSLKSQYFSGEDTDEPLNFNLNSGTYYARISNSSGW
YSQTNTPYQLEASATEIADRAGDTEEDAKKVGNLNQKLTITDWVGDIDNYDYYQFSLQEN
SIVNFNLSGLDEAAYIHIYDQNNQSLKSEYFSGEDTDEPLNFNLNSGTYYARISNSSGWY
SQTNTPYELEMSAVSIADNSGNTIGEAKDVGILKGNKTFQDWVGDIDRYDYYQFELKEDS
RINFHLSGLDDAAYIHLYNQNNESLRSEYFSGQDTDEPMNFNLDSGTYYARISNSSGWYS
QTNTPYNLEMSAVEIVDKAGNSFNTARDLNILVGNHSFNDFLSDIDTFDYYRFELNKDSS
FELEIENLDRNAGVILYDNHGQVIKSVTATSWNGASFNQGLEAGDYFVRVANFGDDNFYT
LNLEATPNEEELPLKISDITPETGSNSGETTISLQGSSFSLASQVSLVGNGSSTNADEIV
WQNERNLQATFDLGGLPPGKYQVKVTDEGETATAQETFSVNEVPQGELTINLNVPERVRP
WWQGDVVIHYTNDTDSNFPAPLLNLVAEGAELQDPFTGEWTDEPVQFLAINSQGVAGVLP
PGAKDSFNILVKPTVGVGEQINFTVNAVTPDEEVDWNSFKERLRPDNVPDSAWDEVWNNF
VASIGNTAADYERVLAENATTLSKLGEYTNDAGRLLGFELQQASNNQDIVVRYRLGSLGR
GQTFFGDIRAVLDEDGNVAIENGPSRLVFESQDDGSFASPPNITMVLTKVEDVYQLRQSD
GTIIRFRSNGRLDYLENTNGDRLQTTYTNGHLTSWENSNGEKTDFVYNSQGRIIRTTDSS
GWVTNYGYDGTGEKLLSVTTPDGTVSYTYNDSFAITSVTDIDGTKSLFKYDGQGRLIEES
WGDGSEKVEYTYRDDGSVQVRDANGATTNMLLNDRGQVGQMTDALGRQTQMRYDQNGNLS
QVVAPDGSVTGFIYCGCGSLLAQTGADGNTTNFEYEPNFNQLSVVTDAKDNQLKYSYDDR
GNATDIIYADGSRDRFKYDSEGNLILTTNRREGQVGYSYDDVDRLIHKQFGNGDSLNYEY
DDRGNLIKVTDESGETVMVYDDADRLTKITYPTERWLEFTYENNRRSSMVDDSGGIVKYG
YDGVGRLRKLTDGENELIVQYSYDNIGRLAKEENGNGTATVYEYDDVGQLVKLTNFDADN
QVNSQFEYSYDDLGRRSTAKTLDGDWEYGYDAVGQLISAKFDSSNSEIPDQNLSYTYDAV
GNRISTKVNGQNTDYETNNLNQYGQVGDIEYEYDADGNLIEKTEGGNVWQYEYNVENRLV
KVVEPDGIETEYEYDVLGNRIATVYDGNRTEYLVDPFGFGNVVGEFQDGDLVARYVHGLG
LVSRVDSDGEGNFYDFNAIGSTVGLSDGQGDYVNRYHYAPFGKDVFEQEQVANQFEFVGQ
FGVMEEANGLDFMRARFYDGETGRFVSMDPIGLNGGDENLYRYVGNGPVDYIDPEGLKSK
DKNDSIKRMIERAKKLREREKKYLGNGGKSGLLTRGAAIDMVNDKNLKDSERGKRAKDLD
FDSADKRAGTGAEFLSDATGIFAPTTPIGTPGYIIGLCITGACNTVERQELPIGGAVDPN
DIIGPAGVGEENWLTSPQTLPYTIRFENDAEKADAPAVFVTVTQQLDSDLDWNSFELGNF
GFGDINIEIPPGYQNYTERIDLTETIGYFVDFNAEMNAETGAVKYTLETIDPETGEYPTD
FDAGFLPPNVNPPEGDGFISYTISPGQDVQTGDVIDAEASIVFDTNDPIATPPVFNTIDI
TAPTSKVEELPETTSGAEIEVTWTGEDQGSGVATYDIYVSENGGDFELWLDDTTETSATY
TGEIDKTYAFYSVATDKVGQVETTTAQAQATTKVKESNKAPTALELDKETIDENVAANSV
VGTFSTTDPDPGDTFTYALVAGEGDTDNQTFTIEGNQLKINSSPDFETKPTYNIRVQTTD
RNVASYQEQLTINVNDINEAPTDLDLGNKNIDENVAPNSVVGEFYTVDPDSGDSFTYELV
TGEGDKDNQAFTIEADQLRINDSPDYESQSSYNIRVQTTDRNVASYQEQLTINVNDIDES
EPVQFDFNADGVADILWRHKSHKNGPNRIWLMNDDGTRNQTVNPGSFGSAWDVAGVADFN
TDGVADILWRHQSHKNGPNRIWLMNDDGTRNQTVNPGNFHSAWDVAGVADFNTDGVDDIL
WRHQSHKNGPNRIWLMNDDGTRNQTVNPGSFRSAWDVAGVADFNADGVDDILWRHQSHNN
GQNRIWLMNDDGTRNQTVHPGILGLAWDVAGVADFNADGVDDILWRHQSHNNGQNRIWLM
NDDGTRNQTVHPGILGLAWDVAGVADFNTDGVADIHWRDQSGANRIWLMNDDGTPNQTVN
PGGFGSAWDVAGM
Download sequence
Identical sequences Q10YX3
203124.Tery_3459 gi|113476963|ref|YP_723024.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]