SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for gi|428775225|ref|YP_007167012.1| from Halothece sp. PCC 7418

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|428775225|ref|YP_007167012.1|
Domain Number 1 Region: 1131-1416
Classification Level Classification E-value
Superfamily C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase 0.0000000000000262
Family C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase 0.03
Further Details:      
 
Domain Number 2 Region: 734-905
Classification Level Classification E-value
Superfamily Lysozyme-like 0.00000000332
Family Phage lysozyme 0.025
Further Details:      
 
Domain Number 3 Region: 380-431
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.000000863
Family Pre-dockerin domain 0.016
Further Details:      
 
Domain Number 4 Region: 163-255
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000419
Family Cadherin 0.0048
Further Details:      
 
Weak hits

Sequence:  gi|428775225|ref|YP_007167012.1|
Domain Number - Region: 2036-2134
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000981
Family Collagen-binding domain 0.017
Further Details:      
 
Domain Number - Region: 2337-2434
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0034
Family Collagen-binding domain 0.0093
Further Details:      
 
Domain Number - Region: 2934-3031
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0068
Family Collagen-binding domain 0.018
Further Details:      
 
Domain Number - Region: 3133-3230
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0126
Family Collagen-binding domain 0.019
Further Details:      
 
Domain Number - Region: 2235-2333
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.0157
Family Collagen-binding domain 0.014
Further Details:      
 
Domain Number - Region: 2735-2832
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.017
Family Collagen-binding domain 0.02
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|428775225|ref|YP_007167012.1|
Sequence length 3389
Comment cadherin [Halothece sp. PCC 7418]
Sequence
MGITENRARVAAGLDPLFTDRELPVVLAANNGVTWDIDGNGEVSALTDGILVIRFLAGFT
GEALISGAVATEGATRTEAEAILSYLESAENLLDVDGNGTTAALSDGILAIRAFAGFTGE
ALIEGAIGDGATRTDDSAITAHIESFLPSAEDTTSPEVTADQVLSFQENLEQGSPIATVE
ASDNREVVGFAITGGNEDGFFTIDENGVLSLTATGVTSPANDFETSPNTFSLEITAEDAA
GNVSTAETVTVQVNDDTADNIEPVQISPTNGEEMVALTRETIVRFGQEIDPATINEESIF
LIANGEEIPGRIQVSSTNEFATFFYDDPLPASTEVRAVVDGDEITLTDGTALDADGDGTP
GGRETADFTTLPVTPIAGTDVFGYVFDSYNQNPDGSNIPLQGVNIRLDALDIEATTDETG
FFRLEDVPAPEFFAYIDGSEAVPVANEGVEIQSTQYASLGKPFHSVPGEEVQLSMEGEVM
DIFLPQMAVEDQVELSPDQDTDVSFGETSLQLLQQTVPDADPEMLAQTTVTFPAGSAQDD
EGNLATRAQIVPVAPDRIPGSLPPGQDPELVISVQAGGENGFNREAQGGATNFDVPAPVE
FPNMEGLEPGETSLLWSFNHDAGEWEVNGTMTVSEDGQTIVSDEGVGIQAPGWHFTNPGS
PVGPGNPTGGDGSGDGNGDGSGDGSGDGSGDGSGDGNGDGSGDGSGDGNGDGSGDGSGDG
DGGNDPAEEQIDIDFISNEEGGQQTEGYIPDAPGNMSGVTIGTGVDLGGLDIDELDIPDN
LKDKLRPYEGKQGEEAEEFLNENPLSINEQEADTLDRAVKDEIFSDLQRRYNQNSDIPFN
QLPPEAQTVIADVATQYGPNLENRTPNFWNEVTEQRWEDAINNLRDFGDDFPSRRNREAD
LLEDALAQGNIDQNSGGDGGGQIVAPTQGSSANGQSEPLQQENNQLVRANSINTSRTESL
LNSSTTTVTLANQTTLVPQGEKTHYFAIFNLDRDTIEIRGKSETGVALESPVVLAPNTNY
RVFVLQADTQWHGYHDFTTLDSGVGVGLTDIRIGPKVFIDSDEDGLSNTAEFILGTNAGQ
KSTSNDGISDGAKFEQGLDLFGGQGFPTGIISTLPLQGEAQEIVVEGSTENARDQFAYLA
TGSHGLAIVDVSQFDNPIVFGQLDLPGNATDIDVDADLQLAAVATNDGGLQIVDVSDPML
PVLTRTAEINTNQTNRVEVVEGIAYAAVDNQLQAIDLLTGDLLETITLQGSQITGIAQEG
SFLYTMSDNNTLQAVDISNEIPVARGNISLDNGGGQLFVGNGIAYATASSFFRGGFSTVD
VSDPENLTLISGSDVASPNIAPQEAIIANGSGQGILIGTAGDNAFDLIDLSDPANTDTFS
TLLTRFSLSAAPQDVAVGSGIAFVATNSGLEVVNYLPFDAQGQAPNVTISTPADIDADTE
GIQALEGATIPINTDVTDDVQVRNVQLLVDGEVVSNDVSFPFDFSTIAPSITPDVNSTEI
QVRATDTGGNSTLSNPLTINLVEDTFAPEVVDTTPAEGGRRRIINSIAVRFNEGIDTDLL
DLNGITLTNLGEDGTLGTEDDAVVPLTDLRTRNLDRTVVVTPNEEEFTPNDYQLSIDANI
ISDRAGNALASPFSFNFTKRPLQIDLSLTETVAETLFPGENEIFTFEGTSGQRLYYDGLT
TVDGIRSTLFSPTGAELFDINTERDRDPFSLIESGTYRLIVDGGFDNANGDFSFQLSSLD
AIPTLELDTTITGALNPLESQVFQFSGIAGQRLYLDNIETDDRSGGNWRVYNQANQVINS
DSIGDDFEITLPTDGNFILVVESSSSDPLNYSFQVVSAETTTTELTLGETVTDSIAEPGE
INIYTFEGTPGQQLYFDGLDQESNIYAELLSPSGNRIFGEWTNRDETPVTLTESGTYQLM
IDPSGETTGDYNFNLFDIGASTELTFDTAITGTIDPGLESQVFQFAGTEGQQLYFDSLAD
VSGADWLLYNPGNDYLFWRSLGSDQEVTLPSDGLYSLILDGTSSDSTTNYNFQVTTPETT
TTELTLNTPVANSISESGETDIYTFTGTAGQQLYFDGLDESSDIYVELLTPSGNRLFWEW
TDQDGNPVTLIESGTYQLMIDPSGETTGDYNFNLFDIGASTELTFDTAITGTIDPGLESQ
VFQFTGTEGQQLYFDSLADVSGGRWLLYDPANQNLFNVGLSSDREVTLPSDGNYSLILDG
STPNGTINYNFEVTTLETVTTALTLGEAISSEISQAGEADIFTFTGSTGQRLYFDGLSSD
NNLDIELFSPSGDRLLNVDTDRDRAPFSLVESGTYEVVINPSGNTTGNYNFRLTDITTAT
DLTFGTAVSDTLDPGKETDLYKFTGSEGQRLDFDNLGSGSSANWRLYDPANQIVFSRSLN
SDQNNVLLPGDGTYVLEIDGEAPDSTINYNFQVNDVSDAAVTPSGFGTAQTGTLEAGNTQ
TFTFDAPAGLQVFFDDLGTPDDLRFDLDHPDGSRVFSSESSDRETFTLTQSGTYTLTVRG
EDETSTGDFNFQFLNLAADSTELTFNTPTTETLAAATTKIFRFDGTAGQQLNFDSRSNTD
FDVDVKLWSPSDQRLVRETSGDLDTEPFTLTETGTHFLVIDHQEDVANDYSFTLFDIATT
TALTFDTPINGTLDPGLEKQIFQFTGTEGQRLYFDSLNSATGGRWFLYNSANQVILNGRL
DSDGEVTLPTDGGYYLGLDGRTADSTVDYNFQVVTPETTTTELTLGTPISSSIADSGEID
RYTFTGSAGQRLYFDGLDSDSNLNAELFTPSSDRRLSIDTDRDGTPFTLIENGTYELIID
PNGATTGDYNFNLFDLGAATELEFDTPITGTLDPGIESQVFQFTGTEGQRLYFDSLNTAT
GGRWRLYDPANQEILDRRLDSDQEVTLPSDGNYSLILDGSNNESTVNYNFEVVTPETTTT
ELTLGTPISSSIADSGDIDRYTFTGSAGQRLVFDGLDSDSNLFAELFTPSSDRPLSLDTD
RDGTPFTLIENGTYELIIDPNGATTGDYNFNLFDLGAATELEFDTPITGTLDPGIESQVF
QFTGTEGQRLYFDSLNTATGGRWRLYDPANQEILDRRLDSDQEVTLPSDGNYSLILDGSS
SDTTVNYNFQVVTPETTTTELTLGTPTSGSITNSGEIDKFTFTGAAGQKLFFDGLDSDSN
LFAELFSPSGDRRFSEFTDRNENPFSLLEKGTYELIIDPNGATTGDYNFNLFDLDAATEL
EFETEISGTLDPGMESQVFQFTGTEGQELTFDSLSSGSNGTWQVYSPANESISFSNSLNR
DFDVELPGDGIYFLVLEGFATSGTINYQFQVSSDEATESLQASVFSSPSSNSLIAATPQN
LDAIIPDRDSITSNGINTSDLSEDDFLFT
Download sequence
Identical sequences K9Y9N0
gi|428775225|ref|YP_007167012.1| WP_015224676.1.10129

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]