SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Q114C9 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Q114C9
Domain Number 1 Region: 2111-2236,2474-2538
Classification Level Classification E-value
Superfamily Hedgehog/intein (Hint) domain 5.3e-38
Family Intein (protein splicing domain) 0.0013
Further Details:      
 
Domain Number 2 Region: 4-256
Classification Level Classification E-value
Superfamily PHP domain-like 1.43e-31
Family PHP domain 0.012
Further Details:      
 
Domain Number 3 Region: 720-820,2037-2084
Classification Level Classification E-value
Superfamily Hedgehog/intein (Hint) domain 1.37e-18
Family Intein (protein splicing domain) 0.01
Further Details:      
 
Domain Number 4 Region: 2583-2669
Classification Level Classification E-value
Superfamily Hedgehog/intein (Hint) domain 6.75e-17
Family Hedgehog C-terminal (Hog) autoprocessing domain 0.082
Further Details:      
 
Domain Number 5 Region: 2313-2414
Classification Level Classification E-value
Superfamily Homing endonucleases 0.00000000000112
Family Intein endonuclease 0.057
Further Details:      
 
Domain Number 6 Region: 925-1011
Classification Level Classification E-value
Superfamily Homing endonucleases 0.00000516
Family Group I mobile intron endonuclease 0.058
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Q114C9
Sequence length 2684
Comment (tr|Q114C9|Q114C9_TRIEI) DNA-directed DNA polymerase {ECO:0000256|SAAS:SAAS00367720} KW=Complete proteome; Reference proteome OX=203124 OS=Trichodesmium erythraeum (strain IMS101). GN=Tery_1889 OC=Microcoleaceae; Trichodesmium.
Sequence
MSFVGLHIHSDYSLLDGASQLPQLIDRAVELGMPAIALTDHGVMYGAIQLIKLCRNKNIK
PIIGNEMYVIKGDIEKQQRGKKFHQVVLAKNTQGYKNLVKLTTLSHLHGFQGKGIFARPC
INKELLEKYHEGLIVTSGCLAGEVPQNIMRGELEEAKKIAKWYKDLFGEDYYLEIQDHGF
QEDRVVNTGIVTIAKKLKIKIVATNDSHFISCRDVEAHDALLCINTQKLIAEEKRMRYSG
TEYLKSAEEMKQLFRDHLENEVIEEAIANTLEVANKVKAYEGILGEPRIPDYPIPPDHNA
DTYLEKLAWSGLLERLKLKQKSEISPIYKERMETELKVLQDKGFSTYFLVVWDYIKYARD
NNIPVGPGRGSAAGSLVAYSLRITNIDPVHHGLLFERFLNPERKSMPDIDTDFCIENRDV
MIKYVTQRYGEERVAQIITFNRMTSKAVLKDVGRVLGISFGEANKMAKLIPVARGKPAKL
KVMISDETPSPEFKKAYDNQETPIEDNKAGKISTISVRQWIDMAIRIEGTNKTFGVHAAG
VVISKEPLDEIVPLQRNNDGSVITQYHMEDIESLGLLKMDFLGLKNLTIIQNTAELIKKN
HHLPLVPDDLPANERKAIEILAKGNTKKMPEDVKKTYDLIKSGDLEGVFQLESSGMVDVV
KKLKPTSIEDISSILALYRPGPLDAGLIPKFIDRKHGSEKIEYQHPKLEPILKETYGVLC
LPKGTLIDQPDGSREAIENIKSGEVILTSDGRKVWEAKVAKQWRSGVREILKITLSSGTV
IYSGKNHRFLTPEGDKFAWELQPQVGRVKNALIYGSAVYEKWQVSSNQKQLRKNDAYLLG
LLVGKSNLISSTPNVSFSTQGAITWGKNLIDETWGGEAKHYFDTSRRQVYLNFNTQSKPT
ALTEFLDGIYGAQNWQVESVAKHLPEDILDYSEKDRIDLLRGLWDSGGFDGKKLLYYPGS
SPQLLSQVCQLLGSLKIDYYLADNSVRISDRSRFIDILENYQMSSQQKEEISESYLPASS
WFLKGGSENNIQKTDSSSRKTGEASQQKATLFTQNLFSAQTPAENWEKVGENHLLSSWFL
TDASENNIQKTDSSSRKTGEASQQKATLFTQNLFSAQTPAENWEKVRENHLLSSWFLTNA
SEIYLQRIDSSSRKTGEASQQKATLFTQNLFSVQTPAENWEKVRENHLLSSWFLTDASEN
NIQKTDSSSRKTGEASQQKATLFTQNLFSAQTPAENWEKVRENHLLSSWFLTNASENNIQ
KTDSSSRKTGEASQQKATLFTQNLFSAQTPAENWKKSRKNHLPSSWFLKGGSENNIQKTD
SSSRKTGEASQQKATLFTQNLFSAQTPAENWEKVRENHLLSSWFLKDASENNIQKTDSSS
RKTGEASQQKATLFTQNLFSAQTPAENWEKVRENHLLSSWFLTDASENNIQKTDSSSRKT
GEASQQKATLFTQNLFSAQTPAENWEKVRENHLLSSWFLTDASENNIQKTDSSSRKTGEA
SQQKATLFTQNLFSAQTPAENWEKVRENHLLSSWFLTNASENNIQKTDSSSRKTGEASQQ
KATLFTQNLFSAQTPAENWKKARENHLLSSWFLTNASEIYLQRTDSSSRKTGEASQQKAT
LFTQNLFSVQTPAENWKKARENHLLSSWFLTNASEIYLQRTDSSSRKTGGASQQKATLFN
QNLFSVQTPAENWEKVRENYLLSSWFLTNASEIYLQRTDSSSRKTGEASQQKATLFTQNL
FSVQTPAENWKKARENHLLSSWFLTNASEIYLQRTDSSSRKTGGASQQKATLFNQNLFSV
QTPAENWKKARENHLLSSWFLTNASEIYLQRTDSSSRKTVEASQQKATLFTQNLFSAQTP
AENWEKVRENYLLSSWFLTNASEIYLQRIDSSSRKTGEACQQKATLFNQNLFSAQTPAEN
WKKVRENHLLSSWFLTDASENNIQKTDSSSRKTVEASQQKATLFTQNLFSAQTPAENWKK
SRKNHLPSSWFLTDASENNIQKTDSSSRKTGEASQQKATLFTQNLFSVQTPELENWECEK
TYLQDVRVVHVVSVEEVGEAECFDLEMEDQSSPYFLAEGVVVHNCYQEQIMKMAQDLAGY
SLGEADLLRRCLSGSTKVIDAATGNLFSLKEIAAQPEYWLSRKVFSLDLKSQQVVQQPIT
EIHPNGVRDVWQITTRTNRKVCATDDHLFYTVLGWKPLKDFSVGDRLGLPNKIPINYRSQ
ISDSKVKFTAYLIGEGYLYTNSFSCSYFCNSDGELIADFYGCAEELFGSSAPIEKQLHLG
NKSVIYVRIGLISGLKNWVDSYLQCANSRVQEIPNWIFSLSQSQLQLFLGILWSTSGIFD
ETIGYTYYSSNSEVLVRQVQHLFLRLGIVSLFNVNKVKGQGELDVSYVVEVRGREDMLKF
YKLIKPYLSSYKQGLCESCYLVIKYQQSYQFKYFLTPDFFDLIVKAKKASSMTRALGVCG
GEISSVWNFQNTSNRSLSFDKFNNFSTVLADEELTAIANSDVFWDEIISIEYIGKEEVFD
LTIPETHNFIANDFIVHNCMGKKKVSEMEKHREKFIDGAAQRGVSSVVAKDLFEQMIKFA
EYCLTYETEIMTVEYGPLPIGKIVEYRIECTVYTVDKNGYIYTQPIAQWHNRGMQEVYEY
SLEDGTVIRATPEHKFMTEDGQMLPIDEIFERNLDLKCLGTLEL
Download sequence
Identical sequences Q114C9
gi|113475557|ref|YP_721618.1| 203124.Tery_1889

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]