SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for XP_001415454.1.19716 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  XP_001415454.1.19716
Domain Number 1 Region: 6-345
Classification Level Classification E-value
Superfamily Clathrin heavy-chain terminal domain 7.85e-122
Family Clathrin heavy-chain terminal domain 0.0000000371
Further Details:      
 
Domain Number 2 Region: 1199-1531
Classification Level Classification E-value
Superfamily ARM repeat 3.57e-97
Family Clathrin heavy chain proximal leg segment 0.0000000152
Further Details:      
 
Domain Number 3 Region: 457-792
Classification Level Classification E-value
Superfamily ARM repeat 5.47e-78
Family Clathrin heavy-chain linker domain 0.0025
Further Details:      
 
Domain Number 4 Region: 348-500
Classification Level Classification E-value
Superfamily ARM repeat 1.32e-46
Family Clathrin heavy-chain linker domain 0.0000119
Further Details:      
 
Domain Number 5 Region: 900-1066
Classification Level Classification E-value
Superfamily ARM repeat 1.29e-35
Family Clathrin heavy-chain linker domain 0.029
Further Details:      
 
Domain Number 6 Region: 1050-1196
Classification Level Classification E-value
Superfamily ARM repeat 1.87e-30
Family Clathrin heavy chain proximal leg segment 0.0069
Further Details:      
 
Domain Number 7 Region: 809-914
Classification Level Classification E-value
Superfamily ARM repeat 0.00000213
Family MIF4G domain-like 0.066
Further Details:      
 
Weak hits

Sequence:  XP_001415454.1.19716
Domain Number - Region: 1499-1592
Classification Level Classification E-value
Superfamily Pseudo ankyrin repeat-like 0.0732
Family Pseudo ankyrin repeat 0.027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) XP_001415454.1.19716
Sequence length 1688
Comment predicted protein [Ostreococcus lucimarinus CCE9901]; AA=GCF_000092065.1; RF=representative genome; TAX=436017; STAX=242159; NAME=Ostreococcus lucimarinus CCE9901; strain=CCE9901; AL=Complete Genome; RT=Major
Sequence
MAAPAVPVTVKEAIQLKTCGVNPQCISFTNLTMESEKYVCARESGTTNNVVIVEVNNPLQ
PMKKPITADSALMNPTQNVIALKARVENENGVEDSLQIFNIDQKAKIKGHDMEPVVFWKW
ITPKMLGIVTNTAVFHWSIDDANAPVKVFDRTANLNGNQIISYKASEDMQWFTLIGIAQG
DASRPALVKGNMQLYSVAQQRSQPLEAHMAAFTTHQVPGNAQKSQLVCFAQKMVQADGSV
VSKLHVIELGAPAGQTPFTKRTSELFFPPEFADDFPVVMQVSDKYGVIYIVTKSGLLFVY
DVETASPIYRSRISQDPVFVGASATSVGGLYVVNRGGQVLLITLNEAAVVPFISSTLNNL
ELALSVASRGNLPGADALVMPKFDMLFNSADYKGAAELAASMSSLRTDQTIARFRGVPTQ
PGQSSPLLQYFGACLQRGKLNKLESVELAKLVLAQNKKQLLDTWLSEDKLEASEELGDML
APTDSDTALKIYVKARASPKVTAAFAQRGEFDKMAQYCSAVDYKPDYMYMLQALMMKDPA
SAVQLAQKISQMTPPPCDMGAIADLFLQRNMIREATSILLDLLKGDDESQAALQTKVLEI
NLVTYPNVADAILAQGKLTHYDRPRIAQLCEKAGLYIRAMEHYTELADLKRCVVNTHSID
PQALTEFFGTLSREWALDCLKELLTFNMRQNLQMAVNIAKEYTEQLEIHSVVKMFDKFES
AEGLFYYLGYFVNTCEDKDLVYKFIEAASKTGQIKEVERVTRESDHYDAERVKVFLMEAK
LSDARPLINVCDRYEFVPDLTTYLYNNNMLRYIEGYVQKVNPKQAPKVVGTLLDLECPDD
FIKTLILSVRSLLPVAPLVEEVEKRNRLKILTQFLEHLVNEGSVDPQVHNAMGKMLIDSN
QNPEHFLLTNEYYESAIVGRYCEKRDPYLACVAYKRGNCDAELVDCTNRNSMFKVQARYV
VERMDADLWASVLTEENKYCRQLIDQVVSTALPESKNPEQVSVTVKAFMTAEMPHELIEL
LEKIVLQNSAFSNNPNLQNLLILTAIKADASRVMDYVNRLDSFNGPEVGEIAAGNELYEE
AFAIFKKFDLHVDAMKILLESLEDLDRGIEYARKVDLPEVWVQIGKAQLKVGTPEAVKAA
IKSYIKAQDGSDFVDVIHAARQADMYEDMVPYLLMVRKNKKEARVDTELVYAYAKINDLA
KLEDFLATPNSANQQSVADRCFGEGLYEAARLLYTALSNWGCLASTLLKLRMFQGAVDAA
KKANSPRTWKEVCFTCLEEGENKLAQLAGLNIIIQADELDSVSEYYQANGKFTELIQLME
AGVGVDRAHMGIFTELGILYANHMADKLMEHIRLFSARINIPRLITTCNHVALWPELAYL
YRCYDEYDNACEVMMKHPDAWEHVVFKDVCVKLANADLYYQAIEFYLREHPTEMTNLLGV
LQSRLDHSRVVSLMRKEGKLAMVKEYLLAVQGANLTAVNDAVNELAIEEEDHAALKTSLD
MYDNCDQLSLAVQCESHELIEFRRISSYIYQRNARWQQAIDLSKRDGLLKDAMEIAAKSG
DATIVDELLDYFIDQGNKECFSAALCTCYDLLKPDEVMQKAWLKGLSDWVMPYMIQVMRD
MNGKLEILMKDKADRNEEKVNEEKERVAAEMNSNLYAQLMPAALPAPPMPGMPGYEQPQP
GYGQPQYY
Download sequence
Identical sequences A4RQV5
XP_001415454.1.19716 jgi|Ost9901_3|28794|eugene.0100010188 436017.A4RQV5

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]