SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSAPLP00000003781 from Anas platyrhynchos 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSAPLP00000003781
Domain Number 1 Region: 1268-1356,1385-1607
Classification Level Classification E-value
Superfamily NHL repeat 6.02e-25
Family NHL repeat 0.0016
Further Details:      
 
Domain Number 2 Region: 1505-1550,1577-1595,1622-1755
Classification Level Classification E-value
Superfamily YVTN repeat-like/Quinoprotein amine dehydrogenase 0.00000000275
Family YVTN repeat 0.046
Further Details:      
 
Domain Number 3 Region: 963-1034
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.00000275
Family Carboxypeptidase regulatory domain 0.088
Further Details:      
 
Domain Number 4 Region: 677-702
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000381
Family EGF-type module 0.038
Further Details:      
 
Weak hits

Sequence:  ENSAPLP00000003781
Domain Number - Region: 811-836
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000233
Family EGF-type module 0.047
Further Details:      
 
Domain Number - Region: 711-737
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000502
Family EGF-type module 0.075
Further Details:      
 
Domain Number - Region: 781-805
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000921
Family EGF-type module 0.085
Further Details:      
 
Domain Number - Region: 643-670
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00577
Family Integrin beta EGF-like domains 0.044
Further Details:      
 
Domain Number - Region: 840-873
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00819
Family EGF-type module 0.048
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSAPLP00000003781   Gene: ENSAPLG00000004223   Transcript: ENSAPLT00000004388
Sequence length 2805
Comment pep:known_by_projection scaffold:BGI_duck_1.0:KB743590.1:1429851:2016355:1 gene:ENSAPLG00000004223 transcript:ENSAPLT00000004388 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDVKERKPYRSLTRRRDTERRYTSSSAESEDSKVPQKSYSSSETLKAYDQDSRLTYSNRV
KDMVHQEADEFCRAGANFSLRELGLEDVTPTHGTLYRTDIGLPHCGYSISTGSDADTEAD
VVMSPEHPVRLWGRNTKSGRSSCLSSRANSNLTLTDTEHENTETGVGFPLTCTLTSSALV
ESHPTPPHPSPTAKDCVQGLLDHGTAAAPSARPAADSDSEEEFIPNSFLVKSGSGNLCVA
ANDHPPNLQNHSRLRTPPPPISHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSG
EQPASTQEPAHAQDNWLLNSNIPLETRHFLFKPGGTSPLFCTTSPGYPLTSSTVYSPPPR
PLPRSTFSRPAFNLKKPYKYCNWKCAALSAIVISVTLVILLAYFIAMHLFGLNWHLQPME
GQMYEITEDTASSWPVPTDVSLYPSGGTGLELPDRKGKGTSEGKPSSFFPDDSFIDSGEI
DVGRRATQKIPPGIFWRSQVFIDHPVHLKFNVSLGKAALVGIYGRKGLPPSHTQFDFVEL
LDGRRLLTQEARSLEGPQRQHRAFVPLSSHETGFIQYLDSGIWHLAFYNDGKESEVVSFL
TTAIESVDNCPSNCYGNGDCVSGTCHCFLGFLGPDCGRASCPVLCSGNGQYMKGRCLCHS
GWKGAECDVPTNQCIDVTCNNHGTCIMGTCICNPGYKGESCEEVVDCMDPTCSGRGVCVR
GECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCSCDPNWTGHDCSIGNTEICAAD
CGGHGICVGGTCRCEEGWMGTACDQRACHPRCNEHGTCRDGKCECSPGWNGEHCTIEGCP
GLCNGNGRCTLDMNGWHCVCQLGWRGAGCDTSMETACGDGKDNDGDGLVDCMDPDCCLQP
LCHVNALCLGSPDPLDIIQETQAPVSQQSLHSFYDRIKFLIGKDSTHIIPGDNPFEGGHA
CVIRGQVMTADGTPLVGVNISFVNNPLFGYTISRQDGSFDLVTIGGISIILHFERAPFIT
QEHTLWLPWDRFFVMETIVMRHEENEIPSCDLSNFARPNPVVSPSPLTAFASSCSEKGPI
VPEIQALQEEINVSGSKIKVSYLSSRTAGYKSVLRISMTHPTIPFNLMKVHLMVAVEGRL
FRKWFAAAPDLSYYFIWDKTDVYSQKVYGLSEAFVSVGYEYESCPDLILWEKRTAVLQGY
EIDASKLGGWSLDKHHALNIQSGILHKGNGENQFISQQPPVIGSIMGNGRRRSISCPSCN
GLADGNKLLAPVALTCGSDGSLYVGDFNYIRRIFPSGNVTNILELRNKDFRHSHSPAHKY
YLTTDPITGSIYLSDTNSRRIYKIKSTTSVKDIVKNSEVLAGTGDQCLPFDDTRCGDGGK
GTDATLTNPRGITVDKFGLIYFVDGTMIRRIDQNGIISTVLGSNDLTSARPLSCDSVMDI
SQVRLEWPTDLAVNPMDNSLYVLDNNVVLQISENHQVRIVAGRPMHCQVPGMDHFLLSKV
AIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTTNGEISLVAGAPSSCDCKNDANC
DCFSGDDGYAKDAKLNAPSSLAVCADGELYVADLGNIRIRFIRKNKPFLNTQNMYELSSP
IDQELYLFDTSGKHLYTQSLTTGDYLYNFTYSGDGDITLITDNNGNLVNIRRDSTGMPLW
LVVPDGQVYWVTIGTNSALKSVTTQGHEVAMMTYHGNSGLLATKSNENGWTTFYEYDSFG
RLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQDQVRNSYY
IGADGSLRLMLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQAR
GQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLN
GVNVTYSPGGHIAGIQRGTMSERMEYDQSGRITSRIFADGKSWSYTYLEKSMVLLLHSQR
QYIFEFDKNDRLSSVTMPNVARQTLETIRSIGYYRNIYRPPEGNASVIQDFTEDEQLLHT
LYLGTGRRVIYKYGKLSKLAEMLYDTTKISFTYDETAGMLKTINLQNEGFTCTIRYRQIG
PLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVINETPLPIDLYRYDDVSGKTEQFG
KFGVIYYDINQIITTAVMTHTKHFDAYGRMKEVQYEIFRSLMYWMTVQYDNMGRVVKKEL
KVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSARLTPLRYDL
RDRITRLGDVQYKMDEDGFLKQRGNDIFEYNSAGLLIKAYNKASGWSVKYRYDGLGRRVS
SKTSHGHHLQFFYADLTNPTKVTHLYNHSSSEITSLYYDLQGHLFAMELSSGDEFYIACD
NIGTPLAVFSGTGLMIKQILYTAYGEIYMDTNPNFQIIIGYHGGLYDPLTKLIHMGRRDY
DVLAGRWTSPDHDMWKHLSSNNIMPFNLYMFKNNNPISNSQDIKCYMTDVNSWLLTFGFQ
LHNVIPGYPKPDMDAMEPSYELIHTQMKTQEWDNSKSILGVQCEVQKQLKAFVTLERFER
IYSSSIAGCRHVKKNKNFASGGSIFGKGVKFAMKDGRVATDVANEDGRRIAAILNNAHYL
ENLHFTIDGVDTHYFIKQGPSEGDLSILGLSGGRRTLENGVNVTVSQINTVLGGRTRRYT
DIQLQYGALCLNTRYGTTLDEEKARVLELARQRAVAQAWAREQQRLRDGEEGIRSWTEGE
KQQVLNTGRVQGYDGYFVISVEQYPELSDSANNIHFMRQSEMGRR
Download sequence
Identical sequences U3I960
ENSAPLP00000003781

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]