SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSAPLP00000012337 from Anas platyrhynchos 76_1.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSAPLP00000012337
Domain Number 1 Region: 1059-1399
Classification Level Classification E-value
Superfamily NHL repeat 8.11e-25
Family NHL repeat 0.0014
Further Details:      
 
Domain Number 2 Region: 1241-1333,1372-1603
Classification Level Classification E-value
Superfamily DPP6 N-terminal domain-like 0.000000118
Family DPP6 N-terminal domain-like 0.027
Further Details:      
 
Domain Number 3 Region: 757-836
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.0000049
Family Carboxypeptidase regulatory domain 0.055
Further Details:      
 
Domain Number 4 Region: 497-524
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000461
Family Integrin beta EGF-like domains 0.076
Further Details:      
 
Domain Number 5 Region: 565-589
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000796
Family Integrin beta EGF-like domains 0.071
Further Details:      
 
Weak hits

Sequence:  ENSAPLP00000012337
Domain Number - Region: 466-490
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000461
Family EGF-type module 0.074
Further Details:      
 
Domain Number - Region: 633-667
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000503
Family EGF-type module 0.056
Further Details:      
 
Domain Number - Region: 431-458
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00067
Family Integrin beta EGF-like domains 0.064
Further Details:      
 
Domain Number - Region: 595-620
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00114
Family EGF-type module 0.061
Further Details:      
 
Domain Number - Region: 1966-2064,2156-2216
Classification Level Classification E-value
Superfamily NHL repeat 0.0141
Family NHL repeat 0.014
Further Details:      
 
Domain Number - Region: 401-425
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0411
Family EGF-type module 0.043
Further Details:      
 
Domain Number - Region: 531-558
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0838
Family Integrin beta EGF-like domains 0.032
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSAPLP00000012337   Gene: ENSAPLG00000012443   Transcript: ENSAPLT00000013072
Sequence length 2601
Comment pep:known_by_projection scaffold:BGI_duck_1.0:KB743323.1:1077459:1320050:-1 gene:ENSAPLG00000012443 transcript:ENSAPLT00000013072 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPGSPHNQFTFRPLPPPPPPPHACTCTRKPPPAADTLQRRSMTTRSQPSPAAPTPPTSTQ
DSVHLHNSWVLNSNIPLETSNAGKIPSYMEIKMNLSHLIGRLSLPTATTDKHFLFKHGSG
SSAIFSAASQNYPLTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWKCTALSATAITV
TLALLLAYVIAVHLFGLTWQLQPVEGQLYENGVSKGNRGTESMDTTYSPIGGKVSDKTEK
KGMCEEADKTKQLVFQKGRTIDTGEVEIGAQVMQMIPPGLFWRFQITIHHPMYLKFNISL
AKDSLLGIYGRRNIPPTHTQFDFVKLMDGKQLIKQETKNSEEPQQAPRNLILTSLQETGF
IEYMDQGAWHMAFYNDGKKMEQVFVLTTAIEIEVLDDCSTNCNGNGECISGHCHCFPGFL
GPDCAKDSCPVLCSGNGEYEKGHCVCRNGWKGPECDVPEEQCIDPTCFGHGTCIMGVCIC
VPGYKGEICEEEDCLDPMCSGHGVCVQGECHCSAGWGGVNCETSLPVCQEQCSGHGTFLL
DAGLCSCEPQWTGSDCSTELCTLDCGSHGVCSRGICQCEEGWVGPTCEERTCHSHCAEHG
QCKDGKCECSPGWEGDHCTIAHYLDAVRVLDGCPGLCYGNGRCTLDQNGWHCVCQVGWSG
SGCNVVMEMVCGDNLDNDGDGLTDCVDPDCCQQNNCYASPLCQGSPDPLDLIQHSQPPFS
QHPPRLFYDRIRFLIGKESTHVIPGDVSFESSRACVIRGQVLAIDGTPLVGVNVSFLHHN
EYGYTISRQDGSFDLVAVGGISVTLVFDRSPFISEKRTLWLPWNRFIIVDKVVMQRTESD
IPSCDISSFISPNPIVLPSPLTAFGGSCPERGTVIPELQVVQEEIPIPSSFVKLSYLSSR
TPGYKTLLRIILTHATIPSGMTKVHLIVAVEGRLLQKWFPAAANLVYTFAWNKTDIYGQK
VSGLAEAMVSVGYEYETCPDFILWEKRTVTLQGFEMDASNLGGWSINKHHVLNPQSGIVH
KGNGENMFISQQPPVISTVMGNGHQRSVSCSNCNGLALNSKLFAPVALASGPDGSLYIGD
FNFVRRIFPSGNSIGILELRNRDTRHSTSPAHKYYLAVDPVSESLYLSDTNTRKIYKAKS
LVETKDLAKNADVVAGTGDQCLPFDQSHCGDGGKASEASLNSPRGITVDKHGFIYFVDGT
MIRKIDENGVITTIIGSNGLTSTQPLSCDSGMDITQVRLEWPTDLTVNPLDNSLYVLDNN
IVLQISENRRVRIIAGRPIHCQVPGIDHFIISKVAIHSTLESARAIAVSHSGILYIAETD
ERKINRIQQVTTNGEISIIAGTPSDCDCKIDPNCDCFSGDGGYAKDAKLKAPSSLAVSPD
DTLYVADLGNVRIRAVSRNKAHLSDTNMYEIASPADQELYQFTINGTHLHTLNLITRDYI
YNFTYSSEGDIGTITSSNGNSVHIRRDTSGLPLWVVVPGGQVYWLTISSNGVLKRVYAQG
YNLALMTYPGNTGLLATKSDENGWTTVYEYDSDGHLTNATFPTGEVSSFHSDVEKLTRVE
LDTSNRENVITATNFSATSTIYTLKQDNTQNIYRVSLDGSLRVTFASGMEITLNTEPHIL
AGVISPTLGKCNISLPGEHNSNLIEWRQRREQTKGNISTFERRLRAHNRNLLSIDFDHVT
RTGKIYDDHRKFTLRIMYDQTGRPVLWSPISKYNEVNITYSHSGLVTYIQRGTWTEKMEY
DPSGNIISRTWADGKIWSYTYLEKSVMLLLHSQRRYIFEYDQSDYLLSVTMPSMVRHALQ
TMLSVGYYRNIYTPPDSTAAFIQDITRDGRLLQTLYPGTGRRVLYKYTKQSRLSEILYDT
TQVTFTYEESSGVIKTIHLMHDGFICTIRYRQTGPLISRQIFRFSEEGLVNARFDYSYNN
FRVTSMQAMINETPLPIDLYRYVDVSGRTEQFGKFSVINYDLNQVITTTVMKHTKIFSAN
GQVIEVQYEILKSIAYWMTIQYDNMGRMVICDTRVGVDANITRYFYEYDADGQLQTVSVN
DKTQWRYSYDLNGNINLLSHGNSARLTPLRYDLRDRITRLGEIQYKMDEDGFLRQRGNEI
FEYNSNGLLNKAYNKVSGWTVQYCYDGLGRRVASKSSLGQHLQFFYADLSNPIRVTHLYN
HTSSEITSLYYDLQGHLIAMELSSGEEYYVACDNTGTPLAVFSSRGQVIKEILYTPYGEI
YQDTNPDFQVVIGFHGGLYDFLTKLVHLGQRDYDVIAGRWTTPNHHIWKHLNAVPQPFNL
YSFENNYPVGRIQDVAKYTTVDIGSWLELFGFQLHNVLPGFPKPEIEALETTYELLQLQT
KTQEWDPGKTILGIQCELQKQLRNFISLDQLPMTPSYSDGKCYESVKQPRFAAIPSVFGK
GIKFAIKDGIVTADIIGVANEDSRRIAAILNNAHYLENLHFTIEGRDTHYFIKLGSLEED
LALIGNTGGRRILENGVNVTVSQMTSVINGRTRRFADIQLQHGALCFNVRYGTTVEEEKN
HVLEIARQRAVAQAWTKEQRRLQEGEEGIRAWTDGEKQQLLSTGRVQGYDGYFVLSVEQY
LELSDSANNIHFMRQSEIGRR
Download sequence
Identical sequences U3IYL1
ENSAPLP00000012337

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]