SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSTGUP00000002105 from Taeniopygia guttata 76_3.2.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSTGUP00000002105
Domain Number 1 Region: 2099-2229
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-34
Family Cadherin 0.00042
Further Details:      
 
Domain Number 2 Region: 3913-4120
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.96e-34
Family Laminin G-like module 0.0062
Further Details:      
 
Domain Number 3 Region: 196-316
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-32
Family Cadherin 0.00052
Further Details:      
 
Domain Number 4 Region: 422-542
Classification Level Classification E-value
Superfamily Cadherin-like 4.28e-32
Family Cadherin 0.0008
Further Details:      
 
Domain Number 5 Region: 2620-2730
Classification Level Classification E-value
Superfamily Cadherin-like 3e-30
Family Cadherin 0.00027
Further Details:      
 
Domain Number 6 Region: 640-743
Classification Level Classification E-value
Superfamily Cadherin-like 2.86e-29
Family Cadherin 0.0011
Further Details:      
 
Domain Number 7 Region: 1267-1377
Classification Level Classification E-value
Superfamily Cadherin-like 5.5e-28
Family Cadherin 0.0019
Further Details:      
 
Domain Number 8 Region: 3139-3250
Classification Level Classification E-value
Superfamily Cadherin-like 7.71e-28
Family Cadherin 0.00076
Further Details:      
 
Domain Number 9 Region: 1365-1494
Classification Level Classification E-value
Superfamily Cadherin-like 8.77e-28
Family Cadherin 0.0016
Further Details:      
 
Domain Number 10 Region: 3040-3151
Classification Level Classification E-value
Superfamily Cadherin-like 9.85e-28
Family Cadherin 0.0012
Further Details:      
 
Domain Number 11 Region: 1162-1273
Classification Level Classification E-value
Superfamily Cadherin-like 2.14e-27
Family Cadherin 0.00042
Further Details:      
 
Domain Number 12 Region: 1792-1901
Classification Level Classification E-value
Superfamily Cadherin-like 2.14e-27
Family Cadherin 0.00084
Further Details:      
 
Domain Number 13 Region: 3350-3471
Classification Level Classification E-value
Superfamily Cadherin-like 1.43e-26
Family Cadherin 0.0012
Further Details:      
 
Domain Number 14 Region: 2003-2104
Classification Level Classification E-value
Superfamily Cadherin-like 1.96e-26
Family Cadherin 0.0015
Further Details:      
 
Domain Number 15 Region: 4135-4364
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.77e-26
Family Laminin G-like module 0.0025
Further Details:      
 
Domain Number 16 Region: 532-639
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-26
Family Cadherin 0.00054
Further Details:      
 
Domain Number 17 Region: 947-1050
Classification Level Classification E-value
Superfamily Cadherin-like 6.14e-26
Family Cadherin 0.0011
Further Details:      
 
Domain Number 18 Region: 1890-2012
Classification Level Classification E-value
Superfamily Cadherin-like 6.85e-26
Family Cadherin 0.00085
Further Details:      
 
Domain Number 19 Region: 2817-2947
Classification Level Classification E-value
Superfamily Cadherin-like 9.99e-26
Family Cadherin 0.0019
Further Details:      
 
Domain Number 20 Region: 2309-2410
Classification Level Classification E-value
Superfamily Cadherin-like 6.71e-25
Family Cadherin 0.00073
Further Details:      
 
Domain Number 21 Region: 839-946
Classification Level Classification E-value
Superfamily Cadherin-like 1.13e-24
Family Cadherin 0.0028
Further Details:      
 
Domain Number 22 Region: 1690-1791
Classification Level Classification E-value
Superfamily Cadherin-like 1.7e-24
Family Cadherin 0.0012
Further Details:      
 
Domain Number 23 Region: 2210-2321
Classification Level Classification E-value
Superfamily Cadherin-like 3.28e-24
Family Cadherin 0.0011
Further Details:      
 
Domain Number 24 Region: 2928-3053
Classification Level Classification E-value
Superfamily Cadherin-like 2e-23
Family Cadherin 0.0014
Further Details:      
 
Domain Number 25 Region: 2724-2829
Classification Level Classification E-value
Superfamily Cadherin-like 2.36e-22
Family Cadherin 0.0012
Further Details:      
 
Domain Number 26 Region: 746-851
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-22
Family Cadherin 0.0017
Further Details:      
 
Domain Number 27 Region: 2411-2519
Classification Level Classification E-value
Superfamily Cadherin-like 2.88e-21
Family Cadherin 0.0037
Further Details:      
 
Domain Number 28 Region: 1046-1163
Classification Level Classification E-value
Superfamily Cadherin-like 5.57e-21
Family Cadherin 0.002
Further Details:      
 
Domain Number 29 Region: 3249-3354
Classification Level Classification E-value
Superfamily Cadherin-like 1.16e-20
Family Cadherin 0.002
Further Details:      
 
Domain Number 30 Region: 3460-3563
Classification Level Classification E-value
Superfamily Cadherin-like 7.42e-19
Family Cadherin 0.001
Further Details:      
 
Domain Number 31 Region: 87-208
Classification Level Classification E-value
Superfamily Cadherin-like 4.28e-18
Family Cadherin 0.0018
Further Details:      
 
Domain Number 32 Region: 2521-2626
Classification Level Classification E-value
Superfamily Cadherin-like 1.07e-17
Family Cadherin 0.0043
Further Details:      
 
Domain Number 33 Region: 1474-1581
Classification Level Classification E-value
Superfamily Cadherin-like 6.85e-16
Family Cadherin 0.0028
Further Details:      
 
Domain Number 34 Region: 304-426
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000000101
Family Cadherin 0.001
Further Details:      
 
Domain Number 35 Region: 3762-3774,3805-3899
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.0000000000267
Family Growth factor receptor domain 0.0095
Further Details:      
 
Domain Number 36 Region: 1583-1697
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000357
Family Cadherin 0.0035
Further Details:      
 
Domain Number 37 Region: 27-93
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000101
Family Cadherin 0.0098
Further Details:      
 
Domain Number 38 Region: 4385-4423
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000432
Family EGF-type module 0.017
Further Details:      
 
Weak hits

Sequence:  ENSTGUP00000002105
Domain Number - Region: 3565-3655
Classification Level Classification E-value
Superfamily Cadherin-like 0.000754
Family Cadherin 0.016
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSTGUP00000002105   Gene: ENSTGUG00000002042   Transcript: ENSTGUT00000002125
Sequence length 4942
Comment pep:known_by_projection chromosome:taeGut3.2.4:4:6366597:6489156:1 gene:ENSTGUG00000002042 transcript:ENSTGUT00000002125 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
ADPRQVFRVLEEQPPGTWVGTIATRPGFTYRLSEHHELFAINATSGALHTRATIDRESLA
SDVVDLVVLSSQPTYPSEVRVLVLDLNDNAPVFPDPSIVVTFKEDTGSGRQLILDTATDA
DSGTNGVDHGSYRIVAGNEEGRFRLNITLNPSGEGAFLHLVSRGGLDREATPTYQLLVQV
EDKGEPRRRGYLQVNVTIQDINDNPPIFSQTLYQARVPEDAPVGASVLQVTAADADEGTN
ADIRYRLEGGDLPFEVDPESGVIRIRERLDYEVRQQYSLTVQATDRGVPALSGRAEALIR
LLDVNDNEPRVKFRYFPATSRFASVDENAAPGTVVALLTVSDADSPAANGNISVSILAGN
EQRHFEVHSSKVPNLSLIKVAAALDRERIPAYNLTVAVADNYGAPPPPPGSPTRSSVASL
VIFVNDINDHPPVFGQSVYRVNISEDVPPGSYVRGLSATDRDSGLNANLKYSIVSGNELG
WFRISEHSGLVTTSSRLDRETASQVVLNVSARDQGVQPKFSYAQLVVTILDVNDNKPRFG
QPEGYQVSLAENSPSGTELLVLSATDGDLGDNGTVRFSLQEAEPALVAMFRLDPVSGKLS
TISQLDREEQSHFSLQVLATDLGSPPLSSVARVNVTILDVNDNSPVFYPVQYFAHIQENE
PAGTYVTTLSATDPDLGPNGTVKYSISAGDTSRFQVHGQTGVITTKIALDREEKTAYQLQ
IVATDGGHLQSQNQAIVTITVLDTQDNPPVFSQGTYSFVVFENVALGYHVGTVFASTMDL
NTNISYLITTGDQRGVFAINRVTGQITTASIIDREEQAFYQLKVVARGGAITGDAVVNIT
VKDLNDNSPHFIQVVESVNVVENWKAGHTIFQAKALDPDEGVNGVVLYSLKQNPKGLFSI
NEQTGAISLTGPLDINAGSYQVEILASDMGVPQLSSAFILTVSVHDVNDNPPVFDQLSYE
ITILESEPVNSRFFKVQASDKDSGVNGEIAYSIIEGNAGDAFGIFPDGQLYIKSELDREL
QERYVLLVVASDRAVEPLNATVNVTVILEDVNDNRPLFNSTNYVFYFEEEQRGGSYVGKI
NAVDKDFGPNGEVRYSFEHMQPDFELNTVTGEIRSTHQFDREALMRQRGAAVFSLTVIAT
DQGLPKPLKDQATVQIYMKDINDNAPKFLKDLYQATISELAANLTQVLRVSASDVDEGVN
GLIQYSVIKGNEENQFVIDTSTGQVTLVGRLDYEATASYSLVIQAVDSGAVSLSSTCMLS
IDVLDENDNSPSFPKSTLLVDVLENMRVGELVSSVTATDSDSGDNADLHYSITGTNNHGT
FSISPNTGSIFLAKKLDFETQYLYKLNITAKDQGRPQSSTMSVVIHVRDFNDNPPHFPPG
DIFKSIVENVPVGSSVISVTAHDPDADINGQLTYAIIQQMPRGNHFRIDEVRGTIFTNAE
IDREFANLFELTVKATDQAVPVESRRFALKNVTILVTDQNDNVPVFVSQNALAADPSVVI
GSILTTIIAADPDEGANGEVEYEIVNGDTETFIVDRYSGDLRVASALVPSQLIYNLIVSA
TDLGPERRKSTTEMTIILQGVDGPVFTQPKYITILKEGEPIGTNVISLEAASPRGSEAQV
EYYIVSVRCEDKSLGRLFTIGRHTGVIQTAAILDREQGARLYLVDVYAIEKSSVLPRTQR
AEVEITLQDINDNPPVFPTDMLDLTVEENIGDGSKILQLTAMDADEGANALVTYTIISGA
DDSFHIDPESGELIATKRLDRERRSKYSLLVRADDGLQSSDVRINITVSDVNDHIPKFSK
PVYSFDIPEDATPGSLVAAILATDDDSGINGEITYTISEDDEEGIFFLNPVTGVFNLTRA
LDYEAQQYYILTVRAEDGGGQFTAIRVYFNILDVNDNPPVFGMASCSTSLMENLPPGSAI
LNFTVTDADDGPNSQLSFSIASGDSAGQFGIDNRGVLSIRKPLDRESQSFYSLVVQVHDM
APLPASRYTSTAQVSIILLDVNDSPPSFISPKLTYVPENTPIDTVVFKAQATDPDSGPNS
YIEYSLLPPPGNKFSIGTIDGEVRLTGELDREAVANYTLTVVATDKGQPSLSSSTDVVVI
VLDINDNNPLFAQKLYKVEVAENTLTGTDLIQVLAADGDEGTNGQVRYAIVSGDANSEFR
IDSVTGVITVAKPLDREKKPSYTLTVQSSDRGSSPRTDTTTVSIVLKDVNDFIPTFELSP
YSVNVPENLETLPKVILQVVARDDDQGLNSKLTYMLVAGNEEGAFTLSGSGELRLVQSLD
REAKEQYLLLVTAADTGSPALTGTGTIAVTVDDVNDNVPTFAFNMYFATVPEDAPTGTDI
LLVNSSDADASTNAVIRLMGGNSQFTINPSTGQIITSALLDREARENYTLVVVASDGGFP
TALSSSTSVLVSVDDVNDNPPKFQHHPYVTHVPSPTTSGSFVFAVTVTDADAGSNAELHY
SLVGKNSEKFHIDPARGAILAAKPLVGESEVTLSVHVRDSGRYPKTDSTTVTVRFVDKAE
FPRVQAEQETFTFPENQAVGTLVTTVSGSSARGGSLSYYIASGNLGSTFLVDQVTGQLSV
GRALDFESVQKYVVWIEARDMGFPPFSSYKKLEISVIDVNDNVPEFERDPFIAEIAENLS
PRKILTVAAVDRDSGLNGQLNYEIIEGNTENSFSINRATGEIRSIRPLDREKLSQYTLTI
KASDKGIPLQSTTVKVIINVLDENDNAPRFSQIFSASVPENAPLGFTVARVTTSDEDIGV
NAVSRYSIRDTSLPFTINPSTGDITISRPLNREDTDRYRIRVSAHDSGWTVSTDVTIFVS
DVNDNAPRFTKPSYYLECPELPGIGLKVTQVSATDPDEGSNGQVFYFIKSQSEFFRINAT
TGEIFNKQYLRYQNSSGSSNVNINRHSFIVTSSDRGSPPLVSETTVTINVVDSNDNAPLF
LTPKYFTPVTKNVRVGTNLIKVTAVDDKDFGLNSEVEYFIADESKTNKFRLDRNTGWISV
SSSLMADLNKNFLFKVKAKDKGNPPLSSEAAVEIVVTEENYHTPEFSQSRMSVTIPESYS
VGTVVRTVSARDRDAAMNGLIRYNISSGNEAGIFAINTTTGTLTLAKPLDFELDQKHELV
VTATDGGWVSRTGYCSVTVNVIDVNDNSPAFSPEEYFPTVLENAPSGTTVICLNATDADS
GSNAVIAYAIQSSDSDLFVIDPNTGTITTQGFLDYETKQSYHLTVKAFNVPDEERCSFAT
VNIQLEGTNEYVPRFVSKLYYFEVSEAASKGTVVGEVFASDRDMGIDGEVHYLIFGNSRK
KGFQIDEKSGQIYVSGPLDREKEERISLKVLAKNLGSIRGADIDEVTVNITVLDANDPPV
FTLGAYNIRISEGVPPGTHVTFVSAFDSDSVPSWSRFSYFIGSGNENSAFSINPQTGQVT
VTAELDRETLPVYNLSVLAIDSGTPSATGSASLVVTLEDINDNGPTLSTSQGEVLENNRA
GTLVMTLQSSDPDLPPNQGPFTYYLLSTGPATSYFSLSTAGVLTTTREIDREQISDFYLS
VITRDSGVPQMSSTGTVQIKVIDQNDNPSQPRTVEIFVHYYGNLFPGGILGNVKPQDPDV
LDSFQCSLTSGVTSLFSIPGGTCELHSQARSTDGTFDLAVLSNDGLHGAVTSSVRIFFAG
FSNTTIDNSVLLRLSAYSVRDFLTNHYLHFLRIASSQLTGLGTAIQLYGLYEDSNHTFLM
AAVKRGNNQYVNPSGVATFFESIKDVLFRQSGVRVEAVDHDWCLQSPCQNGGSCLRRLAV
SPALRTHESVPVIIVANEPLRPFVCRCLPGYDGSLCETDIDECLPSPCHNDGTCHNLVGG
FSCSCPEGFTGMACERDINECLSNPCKNGAACQNFPGSFNCVCKTGYTGKTCDSTVNYCE
CNPCFNGGSCQSGLEGYFCHCPFGVFGNHCELNSYGFEELSYMEFPSMDPNNNYIYIKFA
TIKSNALLLYNYDNQTGERAEFLALEIVEGRLRFSYNLGGGTYKLTTAKKVSDGQFHTAI
ARRAGMAASLTVDSCSEDQEPGYCTVSSVAVSTDWTLDVQPNRVTVGGIRSVEPILQRRG
QVESHDFVGCIMEFAVNGRPLEPSQALAAQGILDQCPRLEGACTTSPCQHGGTCVDQWSW
QQCHCKDGLTGRHCEKYVTADTALSLEGKGRLDYHMSPNKKRDYLMRLGARGAGAGRPGV
ERLEVKFMTRSESGILLHVQESSNFTTVKIKGGKVHYISDAGVAGKVERNIPEVYVADGQ
WHSVLLEKNGSATILSVDRTHSRDILHATQDFGGLNVLTISLGGAPSSQPFKSTAAGFNG
CISYIKYGGESLPFSGKHSLATPSKTDPSVKIGCRGPDVCASNPCWGELMCVNRWFAYQC
VPPGACASRPCLNGGSCEPGPRAGFTCSCPEAYAGRTCETLVACLGVLCAPGHECRASSH
GGHECLPSPHPTELSLPLWAVPAIVGSCATVLALLVLSLILCNQCRGKKSKGPKEEKKTK
QKKKKGSENVAFDDPDNIPPYGDDMTVRKQPEGNPKPDIIERENPYLIYDETDIPHATET
IPSAPLASPEPEIEHYDIDNASSIAPSDADIVQHYKQFRSHAPKFSIQRHSPLGFARQSP
MPLGASSLTYQPAYGQGLRTTSLSHSACPTPNPLSRHSPAPFSKSSTFYRHSPARELHLA
IREGSPLEMHGDVCQPGIFNYATRLGRRSKSPQTMASHSSRPGSRLKQPIGQIPLETSPP
VGLSIEEVERLNTPRPRNPSICSADHGRSSSEEDCRRPLSRTRNPADGIPAPESSSDSDS
HESFTCSEMEYDRDKPVTYTSRMPKLSQVNESDADDEDNYGSRLKPRRYPGRRGEGGPVG
AQATGTAAGESSLPGKLGQQAGSFNWDSLLNWGPGFGHYVDVFKDLASLPEKTAAAAAAA
SEESKGGAAKAVSKEGEAEQYV
Download sequence
Identical sequences H0YUY5
ENSTGUP00000002105 59729.ENSTGUP00000002105 ENSTGUP00000002105

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]