SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for XP_001978024.1.56816 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  XP_001978024.1.56816
Domain Number 1 Region: 4-330
Classification Level Classification E-value
Superfamily Clathrin heavy-chain terminal domain 1.09e-147
Family Clathrin heavy-chain terminal domain 0.000000000107
Further Details:      
 
Domain Number 2 Region: 1184-1517
Classification Level Classification E-value
Superfamily ARM repeat 1.78e-114
Family Clathrin heavy chain proximal leg segment 0.0000000000579
Further Details:      
 
Domain Number 3 Region: 445-780
Classification Level Classification E-value
Superfamily ARM repeat 1.18e-97
Family Clathrin heavy-chain linker domain 0.00056
Further Details:      
 
Domain Number 4 Region: 332-488
Classification Level Classification E-value
Superfamily ARM repeat 1.78e-54
Family Clathrin heavy-chain linker domain 0.000000444
Further Details:      
 
Domain Number 5 Region: 888-1051
Classification Level Classification E-value
Superfamily ARM repeat 2.88e-35
Family Clathrin heavy-chain linker domain 0.023
Further Details:      
 
Domain Number 6 Region: 1038-1181
Classification Level Classification E-value
Superfamily ARM repeat 1.01e-28
Family Clathrin heavy chain proximal leg segment 0.0035
Further Details:      
 
Domain Number 7 Region: 804-889
Classification Level Classification E-value
Superfamily ARM repeat 0.00000558
Family Clathrin heavy chain proximal leg segment 0.028
Further Details:      
 
Weak hits

Sequence:  XP_001978024.1.56816
Domain Number - Region: 1539-1581
Classification Level Classification E-value
Superfamily Insect pheromone/odorant-binding proteins 0.0377
Family Insect pheromone/odorant-binding proteins 0.016
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) XP_001978024.1.56816
Sequence length 1678
Comment uncharacterized protein Dere_GG19369, isoform A [Drosophila erecta]; AA=GCF_000005135.1; RF=representative genome; TAX=7220; STAX=7220; NAME=Drosophila erecta; strain=TSC#14021-0224.01; AL=Scaffold; RT=Major
Sequence
MTQPLPIRFQEHLQLTNVGINANSFSFSTLTMESDKFICVREKVNDTAQVVIIDMNDATN
PTRRPISADSAIMNPASKVIALKAQKTLQIFNIEMKSKMKAHTMNEDVVFWKWISLNTLA
LVTETSVFHWSMEGDSMPQKMFDRHSSLNGCQIINYRCNASQQWLLLVGISALPSRVAGA
MQLYSVERKVSQAIEGHAASFATFKIDANKEPTTLFCFAVRTATGGKLHIIEVGAPPNGN
QPFAKKAVDVFFPPEAQNDFPVAMQVSAKYDTIYLITKYGYIHLYDMETATCIYMNRISA
DTIFVTAPHEASGGIIGVNRKGQVLSVTVDEEQIIPYINTVLQNPDLALRMAVRNNLAGA
EDLFVRKFNKLFTAGQYAEAAKVAALAPKAILRTPQTIQRFQQVQTPAGSTTPPLLQYFG
ILLDQGKLNKFESLELCRPVLLQGKKQLCEKWLKEEKLECSEELGDLVKASDLTLALSIY
LRANVPNKVIQCFAETGQFQKIVLYAKKVNYTPDYVFLLRSVMRSNPEQGAGFASMLVAE
EEPLADINQIVDIFMEHSMVQQCTAFLLDALKHNRPAEGALQTRLLEMNLMSAPQVADAI
LGNAMFTHYDRAHIAQLCEKAGLLQRALEHYTDLYDIKRAVVHTHMLNAEWLVSFFGTLS
VEDSLECLKAMLTANLRQNLQICVQIATKYHEQLTNKALIDLFEGFKSYDGLFYFLSSIV
NFSQDPEVHFKYIQAACKTNQIKEVERICRESNCYNPERVKNFLKEAKLTDQLPLIIVCD
RFDFVHDLVLYLYRNNLQKYIEIYVQKVNPSRLPVVVGGLLDVDCSEDIIKNLILVVKGQ
FSTDELVEEVEKRNRLKLLLPWLESRVHEGCVEPATHNALAKIYIDSNNNPERYLKENQY
YDSRVVGRYCEKRDPHLACVAYERGLCDRELIAVCNENSLFKSEARYLVGRRDAELWAEV
LSESNPYKRQLIDQVVQTALSETQDPDDISVTVKAFMTADLPNELIELLEKIILDSSVFS
DHRNLQNLLILTAIKADRTRVMDYINRLENYDAPDIANIAISNQLYEEAFAIFKKFDVNT
SAIQVLIDQVNNLERANEFAERCNEPAVWSQLAKAQLQQGLVKEAIDSYIKADDPSAYVD
VVDVASKVESWDDLVRYLQMARKKARESYIESELIYAYARTGRLADLEEFISGPNHADIQ
KIGNRCFSDGMYDAAKLLYNNVSNFARLAITLVYLKEFQGAVDSARKANSTRTWKEVCFA
CVDAEEFRLAQMCGLHIVVHADELEDLINYYQNRGYFDELIALLESALGLERAHMGMFTE
LAILYSKFKPSKMREHLELFWSRVNIPKVLRAAESAHLWSELVFLYDKYEEYDNAVLAMM
AHPTEAWREGHFKDIITKVANIELYYKAIEFYLDFKPLLLNDMLLVLAPRMDHTRAVSYF
SKTGYLPLVKPYLRSVQSLNNKAINEALNGLLIDEEDYQGLRNSIDGFDNFDNIALAQKL
EKHELTEFRRIAAYLYKGNNRWKQSVELCKKDKLYKDAMEYAAESCKQDIAEELLGWFLE
RDAYDCFAACLYQCYDLLRPDVILELAWKHKIVDFAMPYLIQVLREYTTKVDKLELNEAQ
REKEDDSTEHKNIIQMEPQLMITAGPAMGIPPQYAQNYPPGAATVTAAAGRNMGYPYL
Download sequence
Identical sequences B3NXJ2
FBpp0137915 XP_001978024.1.56816 XP_015011275.1.56816 XP_015011276.1.56816

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]