SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMUSP00000059757 from Mus musculus 63_37 (longest transcript per gene)

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMUSP00000059757
Domain Number 1 Region: 2023-2216,2268-2451,2485-2540
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 4.11e-112
Family DNA polymerase I 0.00000094
Further Details:      
 
Domain Number 2 Region: 102-270,303-358,408-496
Classification Level Classification E-value
Superfamily P-loop containing nucleoside triphosphate hydrolases 3.62e-44
Family Tandem AAA-ATPase domain 0.00027
Further Details:      
 
Domain Number 3 Region: 661-828
Classification Level Classification E-value
Superfamily Sec63 N-terminal domain-like 6.8e-41
Family Achaeal helicase C-terminal domain 0.0035
Further Details:      
 
Domain Number 4 Region: 1855-1978
Classification Level Classification E-value
Superfamily Ribonuclease H-like 0.0000288
Family DnaQ-like 3'-5' exonuclease 0.01
Further Details:      
 
Weak hits

Sequence:  ENSMUSP00000059757
Domain Number - Region: 526-614
Classification Level Classification E-value
Superfamily "Winged helix" DNA-binding domain 0.00707
Family RecQ helicase DNA-binding domain-like 0.032
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMUSP00000059757   Gene: ENSMUSG00000034206   Transcript: ENSMUST00000054034
Sequence length 2544
Comment pep:known chromosome:NCBIM37:16:37011872:37095502:1 gene:ENSMUSG00000034206 transcript:ENSMUST00000054034
Sequence
MSLPRRSRKRRRSSSGSDTFSGDGDSFVSPQLRCGPVLSPPPGLGRGRRLTGTGTNKRRV
SDDQIDQLLLANWGLPKAVLEKYHSFGVRKMFEWQAECLLLGHVLEGKNLVYSAPTSAGK
TLVAELLILKRVLETRKKALFILPFVSVAKEKKCYLQSLFQEVGLKVDGYMGSTSPTGQF
SSLDIAVCTIERANGLVNRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLTKICYVT
RKSASHQAESASTLSNAVQIVGMSATLPNLQLVASWLNAELYHTDFRPVPLLESIKIGNS
IYDSSMKLVREFQPLLQVKGDEDHIVSLCYETIQDNHSVLIFCPSKKWCEKVADIIAREF
YNLHHQPEGLVKSSEFPPVILDQKSLLEVMDQLKRSPSGLDSVLKNTVPWGVAFHHAGLT
FEERDIIEGAFRQGFIRVLAATSTLSSGVNLPARRVIIRTPIFSGQPLDILTYKQMVGRA
GRKGVDTMGESILVCKNSEKSKGIALLQGSLEPVHSCLQRQGEVTASMIRAILEIIVGGV
ASTSQDMQTYAACTFLAAAIQEGKQGMQRNQDDAQLGAIDACVTWLLENEFIQVAEPGDG
TGGKVYHPTHLGSATLSSSLSPTDTLDIFADLQRAMKGFVLENDLHIVYLVTPVFEDWIS
IDWYRFFCLWEKLPTSMKRVAELVGVEEGFLARCVKGKVVARTERQHRQMAIHKRFFTSL
VLLDLISEIPLKDINQKYGCNRGQIQSLQQSAAVYAGMITVFSNRLGWHNMELLLSQFQK
RLTFGIQRELCDLIRVSLLNAQRARFLYASGFLTVADLARADSAEVEVALKNSLPFKSAR
KAVDEEEEAAEERRSMRTIWVTGKGLSAREAAALIVEEAKMILQQDLIEMGVRWDPKSPL
SSSTHSRTSTSEVKEHTFKSQTKSSHKRLASMGRNSIRASGSNDKPSPDAERGIDDCSEH
ADSLCKFQGNFEPQTPSICTARKRTSLGINKEMLRKSLKEGKPSTKEVLQTFSSEKTRKT
ALSFSSEQVNNTLPSGRDRKYQKKSWGSSPVRDSGMHRGDLQGQTMCTSALCEDSQKSLE
EQNAEFRSPGLFAKHLPSCAKEKCKKPSLPLQRQQACSRRSTESCAAVGHPAAGSSPAAA
RDRRGLAARETEKGNEALTENGGESQLQDTYPVSQYLEYHSEKHTNTCTRQKTLTEGQAG
SSYVARDSNDAAPIKCERMKLNSKDRDSNPCRQALGSYTGRTEALQSTAKLGQAGGQCEN
LLNSSGVQGKTGAHATNRTEHSHASNPAFCDFGDSLDLDTQSEEIIEQMATENTMQGAKA
VVIMEEGSAMQNKCHSTPGDQHVPGAANTDHVDSKKVESVKANTEKNINRGAPVSLIFHT
QGENGACFKGNEHSVTDSQLNSFLQGFETQEIVKPIIPLAPQMRTPTGVEEESLPETSLN
MSDSILFDSFGEDGFGQGQSPDIKANQPLLSEMTPNHFSNPPHPQEDPVMTPTVSEPQGT
QQQGVCLSGESIIFSDIDSAQVIEALDNMAAFHVQENCNSVALKTLEPSDSAVLGNECPQ
GKLVRGDQNEGSPKPKLTETNQDNSFTWSGASFNLSPELQRILDKVSSPRENEKPKMIHV
NLSSFEGNSKESHEREEINSDLGTVQRTSVFPSNEVKNRTEGLESKARHGGASSPLPRKE
SAAADDNGLIPPTPVPASASKVAFPEILGTSVKRQKASSALQPGESCLFGSPSDNQNQDL
SQELRDSLKDYDGSVADTSFFLQSQDGLLLTQASCSSESLAIIDVASDQILFQTFVKEWQ
CQKRFSISLACEKMTSSMSSKTATIGGKLKQVSLPQEATVEDAGFPVRGCDGAVVVGLAV
CWGAKDAYYLSLQKEQKQSEISPSLAPPPLDATLTVKERMECLQSCLQKKSDRERSVVTY
DFIQTYKVLLLSCGISLEPSYEDPKVACWLLDPDSKEPTLHSIVTSFLPHELALLEGMET
GPGIQSLGLNVNTEHSGRYRASVESVLIFNSMNQLNSLLQKENLHDIFCKVEMPSQYCLA
LLELNGIGFSTAECESQKHVMQAKLDAIETQAYQLAGHSFSFTSADDIAQVLFLELKLPP
NGEMKTQGSKKTLGSTRRGNESGRRMRLGRQFSTSKDILNKLKGLHPLPGLILEWRRISN
AITKVVFPLQREKHLNPLLRMERIYPVSQSHTATGRITFTEPNIQNVPRDFEIKMPTLVR
ESPPSQAPKGRFPMAIGQDKKVYGLHPGHRTQMEEKASDRGVPFSVSMRHAFVPFPGGLI
LAADYSQLELRILAHLSRDCRLIQVLNTGADVFRSIAAEWKMIEPDAVGDDLRQHAKQIC
YGIIYGMGAKSLGEQMGIKENDAASYIDSFKSRYKGINHFMRDTVKNCRKNGFVETILGR
RRYLPGIKDDNPYHKAHAERQAINTTVQGSAADIVKIATVNIQKQLETFRSTFKSHGHRE
SMLQNDRTGLLPKRKLKGMFCPMRGGFFILQLHDELLYEVAEEDVVQVAQIVKNEMECAI
KLSVKLKVKVKIGASWGELKDFDV
Download sequence
Identical sequences Q80XB7 Q8CGS6
NP_084253.1.92730 ENSMUSP00000059757 ENSMUSP00000059757 ENSMUSP00000071396

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]