SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|113475561|ref|YP_721622.1| from Trichodesmium erythraeum IMS101

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|113475561|ref|YP_721622.1|
Domain Number 1 Region: 2275-2377
Classification Level Classification E-value
Superfamily Collagen-binding domain 4.45e-18
Family Collagen-binding domain 0.0027
Further Details:      
 
Domain Number 2 Region: 1568-1676
Classification Level Classification E-value
Superfamily Collagen-binding domain 6.28e-16
Family Collagen-binding domain 0.0025
Further Details:      
 
Domain Number 3 Region: 1450-1557
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0000000000000196
Family Collagen-binding domain 0.0023
Further Details:      
 
Domain Number 4 Region: 1687-1795
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0000000000000902
Family Collagen-binding domain 0.0025
Further Details:      
 
Domain Number 5 Region: 1232-1324
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000196
Family Collagen-binding domain 0.0048
Further Details:      
 
Domain Number 6 Region: 2511-2628
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000366
Family Collagen-binding domain 0.0032
Further Details:      
 
Domain Number 7 Region: 2048-2152
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000732
Family Collagen-binding domain 0.0053
Further Details:      
 
Domain Number 8 Region: 384-469
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000915
Family Collagen-binding domain 0.005
Further Details:      
 
Domain Number 9 Region: 1116-1197
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0000000000379
Family Collagen-binding domain 0.0018
Further Details:      
 
Domain Number 10 Region: 1336-1437
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000392
Family Collagen-binding domain 0.0037
Further Details:      
 
Domain Number 11 Region: 738-829
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000471
Family Collagen-binding domain 0.0014
Further Details:      
 
Domain Number 12 Region: 111-228
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000000981
Family Collagen-binding domain 0.0044
Further Details:      
 
Domain Number 13 Region: 630-710
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000109
Family Collagen-binding domain 0.0085
Further Details:      
 
Domain Number 14 Region: 509-589
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000144
Family Collagen-binding domain 0.01
Further Details:      
 
Domain Number 15 Region: 266-351
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0000000017
Family Collagen-binding domain 0.0063
Further Details:      
 
Domain Number 16 Region: 1806-1915
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000615
Family Collagen-binding domain 0.01
Further Details:      
 
Domain Number 17 Region: 1930-2037
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000942
Family Collagen-binding domain 0.01
Further Details:      
 
Domain Number 18 Region: 2420-2501
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000209
Family Collagen-binding domain 0.01
Further Details:      
 
Domain Number 19 Region: 2185-2259
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000275
Family Collagen-binding domain 0.0083
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|113475561|ref|YP_721622.1|
Sequence length 2632
Comment peptidase-like protein [Trichodesmium erythraeum IMS101]
Sequence
MREEVIGLEPSQNILGFELESKHQSVNMMFPVYSVQEPDVLEQVPMDEVDNQLPQIIVPS
KIINSSQLDLTTVASTPEDIDGKDPLTGLTTEEANIPPEKNAYTIKQRKARKKDPGNNVK
TALNLGRVGDDVITYNDNIGMKSGKARDKNDYYKFNLKGKENDVSIVVDGLKDNANLELR
NKNGKSVLFNSTQKGKKPETIEQELAQGTYFLRVYPKGNAKTKYELSISAEEILQDPDSK
SPGATNLGPLGKKKKVITDEIGFKEKGTRDNRDFYKFTLTEKLNNLNIVLDGLKDDANIQ
LLDEDGKTIIDQSRSKGKKKEIIDQILEGGTYYLKVTPTGNARTKYRLSLDGDKIDDPDG
TRTRAENLGTLGNKPIKKRDEIGFEVGGKRDQKDYYQFNLNRDSEVNFSLDGLNQNVDLK
LLGSKGNLLYSSTNKGKEVEKISTILDKGKYYALVEPMGSDRSEYTLSIDGDSKIQDPDA
KLPGKNLGKLKGKTISRIDDIGLKKKGFLDTSDFWRIELTKETDLTITLDRLSQNADLEL
YDKDGTTLIYSSKEKGKKPEEISSILDKGIYYIKVKSKGGSSSKYKLSVTGNTQIGEKDD
ALPGTNLGELGNQKIVKKDEIGFESSQNIRDVGDFYNFSLAEQRGVNITLDGLTGDANLF
LVGNDGGEIEVSNNKGKRNEIINEILQPGNYYIGVKPNGGKVKTNYNLEVTTNLQKDDFN
NRQNAQQLGTLNLQDQVTQNNRVGFKEGSLIDQADFYSFKLTETSDVNLSLDGLDGDANL
LLLDNKNTLDESTAKGLKAENIKKKLEAGTYYAAIFPVSGAKTDYELTLGVTQKTVDLSS
LKFEALDVQDGLKAGERLKVNYQVNNIGNTKADAFEVGFYLSKDEGIDKGDFLLGTAKVK
SLSGDKNTKKLAKQLTLPESGDNFWQGSGDYYLGMMVDVQDAIVENNELNNTGSQRVGVD
LSSDLIVSGFDVQEKTVNPGQKFNAKVTLNNAGGKLDDFRVGFYVSEDNEITTSDELLGS
KMVTSLGGKKSTTINKQLTLPVASVSGVKKYFIGAIADDQDVLREVNETNNVSKEKVSLS
PVPQDNAGNKLNKGRKVGVLGNKTQKFSDWVGDFYGVSEDTHDYYKIELDDRSRLNLNLK
GLSANANLYLYDSDGDLLQKSEESENKEEAIDIKLLPGTYYGLVWKDDGGNTNYDLEMSA
LPLEYPVPQITGWNRNTALNIGAVSATPIAKTEYVGNPHGFVDDLNDFYKFQVEGSGSQV
QIDLTGLNKNADLYLYNSESSSSIARSETLQKADESIVKNLSPGTYFIEVQSINNAKTNY
NLQVVGTPLPDNGAGENPDQALDLGTLGETIVNDWVGDIDGTDYYMFAVSENSTVDMSLT
GMSGNANLKLFDNKEKSLSSSSQEGNADDGIAINLTPSNYFVEVERYSGVVGTPYNFQIS
ATSRAEDLVGNDLETAKDIGALTTFNQTEWVGSFDVSDYYKFSLAEDSEVDLNLTGMNSN
GQLYLYDSEGEVLTSSRNDSNTDEAIITNLNPGTYYVGVWRYKLYSREATSVNYNLTASA
TTIPDSGGNTRETAGDIGDISTPKTLSNWVGNVDPNDYYKFTLTQDSDVSLNLTGMDDNG
QLYLYDSEGEVLTSSINDSNTDEAIITNLNPGTYYVRVWRYKSYSREATSVNYNLTASAT
TIPDSGGDTLETAGDMGDISTPKTLSNWVGSADPNDYYKFTLTEDSNVNLNLTGMDDNGQ
LYLYDSEGEVLTSSINDSNTDEAIITNLNPGIYYVRILRYKSYSREATSVNYNLTASATT
LPNSAGDDFETARDLGMVSAAQTIRDWVGDLQYDDYYKFSVAANSSINLKLAGLTANTGL
YLYDSKEKLIGSSDNDSNADESIVYNLASGNTYYVKVEGRSPYYGGDNTYYNLELSATPL
TYSPDNIVGNTEAQTKNIGALGATQSFTDFVGNQYVDRDKNDFYQFVLGETSTVNLNLGS
MTADADLVLLDSEYSQITESSQLTNVDEVIIRNLQAGTYIAGVKSIENGNTGYNLQLSAT
AHPDGAGDTVGTARLIGTLAGPQTFNDWIGNVDEKDYYQFVIDAVSTVNINLGGMTGNGN
IGLYDIEESPLLYSQNTENADDVIVADLNPGTYYVQVWNDRGNFGTNYSLQLSPTVRVDN
AGGADSPLDLGALTTINATNWVSPGIDEEDYYKFSVGENSAVSLNLTGLTDAINFNLYNS
AMEEIASAPNDENTETAIAKNLTPGNYLVEISNFWGNGTEYNFVATATPISDNAGNDFDV
ARNIGAVSATQAFSDWVGRIDRNDYYKFDIPQNSEVTINLTGLASDADLYLYDSQENEIV
SSTNDDTNNELITEKLPSGTYYALVWQSSGHTNYNLDFSATPLTYSAPNIVGNTEAQTKN
IGALGEAQTFTDFVGISQNIDSDKEDFYQFTLAEDSTVSLSLNGLSANADLTLRNNEYYW
LDDSTNFLNASENITRNLVAGTYIVEVESIGDASTGYNLQLSAVAHPDAAGNTFDTSKPV
ENLAEVQTFNDWVSNIDENDYYKFNLSEKRTVNIELSGLSDEASLYLYNSSGDTFSYDAN
GHYQGSSYVWKINSGEMGSISRLLNSGNYFVGVFYESYNGLGTNYSLSMSAT
Download sequence
Identical sequences Q114C5
203124.Tery_1893 gi|113475561|ref|YP_721622.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]