SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for 31234.CRE09737 from STRING v9.0.5

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  31234.CRE09737
Domain Number 1 Region: 761-1192
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 1.45e-30
Family Reverse transcriptase 0.015
Further Details:      
 
Domain Number 2 Region: 1481-1641
Classification Level Classification E-value
Superfamily Ribonuclease H-like 1.73e-19
Family Retroviral integrase, catalytic domain 0.018
Further Details:      
 
Domain Number 3 Region: 347-387
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.000023
Family Retrovirus zinc finger-like domains 0.0033
Further Details:      
 
Weak hits

Sequence:  31234.CRE09737
Domain Number - Region: 513-581
Classification Level Classification E-value
Superfamily Acid proteases 0.015
Family Retroviral protease (retropepsin) 0.032
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) 31234.CRE09737
Sequence length 2796
Comment (Caenorhabditis remanei)
Sequence
MSLAGKKAALTKAVKGLEEKLTATTTQLDFIEDKTTEETLPYKEDLQLLLTTIETKSDNL
DKALNNFEVEVDKIPPANEEATKNAETRIAEALDVREDAIDSLIRLRHQLNRISSLTTQQ
ASREDSRTLPIQPNNIPAPNPPQQFGFREYLIENTRISKFKGNVWEFEAFWTQFEELIHK
SEQPDLFKFNKLLNLLEGEPRELIARFKITGDNYNKAIALLKKRYNDQEQIVSQLTAQLK
KETATSGHTTDQRKLFEKILITTNQLKDYQENVDTRMMKDEIVSKFAHRIQEDVYKKKLD
SPGDWTLDKILEDLENVIIREESLNMLLKKEEKTKNMDNSTQKQQKSKDNKRDNKTPFRK
NDDPCIFCKEKGHFFGHCPTKPNPMDRLQILKTEARCTRCTKTGHTPKDCKSKMCPVCNK
DHHSSCCFEKHKEALPPKTFKKQDQKKSSSSSTTTAAMALQGDNTVCEMDNSENKPDEIH
TIASAKTRGTNRGFIPTIVTKAYNHSTGQWEGITVMLDSGSDQTFITRSLLNRWNLPNLG
EVKVDANAFDSTCQKKQFGRSQIQLRLKDTRIQMDVYVADSLVGRISKAPLTHQDMQFLL
KEKLEINEDSLRTTSEPDMILGTDYWMEIVTGQLIQMPSGVGLIETKDGFATMGSTKDNS
CQPRYEEDKVIVMALNSDPHDPGRKTEDEQMRDTLMKKPHEFSGSLKEEQSERDKKTIQF
FEDTVEKRDEAYFVRIPYKEEHPPLPDNFSIALARLTQMRRQHSTENLQMIKDVFEDYKA
KKFIEEVNVYEETPNKLHYNALQAVITPSKTTTKCRIVVDASAHYKDKPCLNDCIEQGPT
ILPDIQDMIIRFRSGQTVLISDVEKAFLQVFLHEDDRDVTRVLWFKDINKPVNEDNIIVY
RFTRVLFGLNVSPFLLGQTIIHHLRSLKDDPIIREMPHNLYVDNSIITTDENAENVIQIY
KKVKKTFKDANMNLREFRSNCKTVNDGIAEEDKSKEEDMKVLGIWWTSSEDTITMDTTFD
LALTNSRRTVSSDMASKFDPMGYLTPLLLPPKLFQRELWDTTQYGWDTKLSEQHEDEYRK
LIQNINGFTIKMPREIVLKTGKNSIITFCDASKEATACCVYVKNDKGTHIILGKSHARPL
KEKWTIPKLEMHALLLGTEKTMKVVKALQLGQTTIDQVVIMSDSAIALAWIKSLPTQKEV
GTLIHNRLRDIVSHVEEMETMVTTVKFGHVRTHENPADLGTRGCTKEEFENSIWWKGPNF
IQTDTHTWSPEHQLFQVERPGQIHTAALVSKESEPLLNSQATNSLQKMIRIALRVLKAAK
IFSKPLGSERFPSLKDITLNDIANRVELKTAETLVIKDHQKGISCKTLQQYGNLGIIPNK
DNILVAKGRMELAGLEENARNPIFILPNSQLAKQIIADCHGSFHKTMEHTMDSVRRRFWI
PKLRQQTKSFIARCIPCQRNSKQPCRYPDMGRIPRDRVNKQRPFGSTGLDNFGPIQYRKD
DGTLANAYGTIFTCTTTRLIHVETVKNASALEFIQAFRKFVAIRGRPTKIVSDNGTNFVL
GQKIIEEAFERSDCPPDMHKIDWKFITPYAPWKGGVYERMVKSVKEAFYKAVGRSKLTFE
ELTTVLYEATASINQRPLTKLEDDINAEAPIRPCDFINQEMEIRLPLEGALDIKEDFCPA
AELQSKESMLNTVDALKSSIKASERVWKVWNSKYLAEMREGHKLRMDKKRGSPKLPKVGQ
IVLMCEDLQPRNVWKMAKILRLNESSDGVVRDVDILTPNGRTLNRAINLIVPLELDEEDK
EDETEHPSPQLDKPKEDPEKSSDNKKRYNLRSRKVVNYNEEQPVNNFVFSSGTKWTNMMF
ICTLLMMFSGTTATNNIIHCTPNGIKIEGQFESFESCVENYCTTKNRWEWSRGQGVNVWF
PPAIKIYPHHVTTKIKDGDLLQINEMDCEAVPFCQTIDCTVCWTNIINPECHISWAILGV
GALIFLLLFVIHATCKAPVKCKDVLGTGWTIIRVLWWILSTPIRKAWKWFRKETPVRRST
WKRLFTIICETLTEEIMHLNNAHKEGCLRIERNRTILRDIRVQLRAGCGCFYLSSGCLFY
RIYALPRGNDSMEILKCVDYIPIVNLRITVTTLNTWKNKVETVNISSPLGRSTWFKDMMF
TVVDINTPPSPSLNTWFITNATSMATWPENLLPHYQCNQKLNQCVLDEECQCSPAEDTMI
CTCKDTDMRELFRQPDRVLPVQAGHLRLEQDGNNVKGKMKFSTSTTMSIKMTDKWTTSIV
KTKESCSVASTTASGCYKCEKGATAEITCKTNEESTTANIECGEEEFAVECSPTGTKTSI
KFFGNKASFQRHCTVDCGGKQKGHFEVTGVLKYSGSIWTAMWHLLDGNTTIFNEINLPDM
GHIATSYMSFMKTMVAVTATVGIIFLLTYTVITNAGLAIVKTFVKICAWILWQPIRGTIH
LTSLITTKCRRRRGHLHVMILLTLVHNITPTNLPTTHHLAHAPDTLEHLIIPNLNISHSQ
SSDIPNFSLLTLSHLNFSNPHSSSPNRPSSQSHPSRWITMILTDMDTRQDAPVRAMIGRL
AAVEAKLDLILDMLVPERGGASSPIGNDSHSPRPYPDSPLIVSNVPSPDNYVEIVEEPIV
HGSDNDSHQDTDNTIHHEPDNDAHHEPDNNVPNEPDNAPRAGKLQDQDDVHKRRRSSKRQ
NRKSDKEEKSTKRFKEDNRLRCVFCGHGHYSSDCQRYKTYEERVRRGGPNMCRKCLGIMG
TNGHECRRRTKPCHHCRSTAHHTAFCKIQEKIRGPE
Download sequence
Identical sequences E3N4X7
XP_003096544.1.11157 CRE09737 31234.CRE09737

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]