SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSTGUP00000012283 from Taeniopygia guttata 76_3.2.4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSTGUP00000012283
Domain Number 1 Region: 1378-1511
Classification Level Classification E-value
Superfamily Cadherin-like 6.42e-33
Family Cadherin 0.00092
Further Details:      
 
Domain Number 2 Region: 1884-2015,2043-2108
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.32e-32
Family Laminin G-like module 0.0078
Further Details:      
 
Domain Number 3 Region: 511-639
Classification Level Classification E-value
Superfamily Cadherin-like 7.33e-30
Family Cadherin 0.0013
Further Details:      
 
Domain Number 4 Region: 2133-2326
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.16e-28
Family Laminin G-like module 0.0043
Further Details:      
 
Domain Number 5 Region: 1055-1168
Classification Level Classification E-value
Superfamily Cadherin-like 2.36e-25
Family Cadherin 0.00066
Further Details:      
 
Domain Number 6 Region: 302-411
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-25
Family Cadherin 0.0018
Further Details:      
 
Domain Number 7 Region: 620-744
Classification Level Classification E-value
Superfamily Cadherin-like 8.51e-25
Family Cadherin 0.002
Further Details:      
 
Domain Number 8 Region: 952-1052
Classification Level Classification E-value
Superfamily Cadherin-like 9.68e-25
Family Cadherin 0.0012
Further Details:      
 
Domain Number 9 Region: 1270-1397
Classification Level Classification E-value
Superfamily Cadherin-like 2e-24
Family Cadherin 0.0013
Further Details:      
 
Domain Number 10 Region: 837-951
Classification Level Classification E-value
Superfamily Cadherin-like 5.14e-24
Family Cadherin 0.00044
Further Details:      
 
Domain Number 11 Region: 1161-1274
Classification Level Classification E-value
Superfamily Cadherin-like 4.84e-23
Family Cadherin 0.00021
Further Details:      
 
Domain Number 12 Region: 405-530
Classification Level Classification E-value
Superfamily Cadherin-like 3.57e-21
Family Cadherin 0.0019
Further Details:      
 
Domain Number 13 Region: 1489-1603
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000000111
Family Cadherin 0.0035
Further Details:      
 
Domain Number 14 Region: 210-320
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000000126
Family Cadherin 0.0028
Further Details:      
 
Domain Number 15 Region: 729-837
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000443
Family Cadherin 0.0043
Further Details:      
 
Domain Number 16 Region: 1599-1701
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000614
Family Cadherin 0.014
Further Details:      
 
Domain Number 17 Region: 116-196
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000243
Family Cadherin 0.027
Further Details:      
 
Domain Number 18 Region: 2373-2410
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000299
Family EGF-type module 0.017
Further Details:      
 
Weak hits

Sequence:  ENSTGUP00000012283
Domain Number - Region: 2330-2356
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000937
Family EGF-type module 0.015
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSTGUP00000012283   Gene: ENSTGUG00000011921   Transcript: ENSTGUT00000012418
Sequence length 2621
Comment pep:novel chromosome:taeGut3.2.4:2:132758322:132809643:1 gene:ENSTGUG00000011921 transcript:ENSTGUT00000012418 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
ETQRARCSRPRGPGARRYTARLPAGARVGDTVFTVPRSRDRAAGGWFELASPGATPVGVD
RTSGRLYLRRELPAGGRAEVPVKVHRGGGGDDDWYLCHVTLLAPEEELLSWAMYPYPYLA
RVDPDARKGTLTYQLVAHCSSRENTTAGITYTLIAGGEERFRVDKDTGMIMTTGLPLTWN
KEYVVTVEASDEHGNKSPYASVSILAGSRPPQFTNMSYSVFVPENTPAGEKVAVVEAVSF
QSQPLSYTLLMNPSGLFRVRQESGELSLTHPVDYESEHHLYYLLLKAMEVESTLSSVTEV
VVHITDENDCSPEFQRSIYSRDNIPETIPIGTSLLQVLATDCDSGSNSEISYFIQSTDFS
ITRHGVINSNQRLNFERANHMYEFVVIAVDKGHPPRTGTASVRIRMANVNDEAPVFSQAV
YRTFLSEDAGPGTLVATVRAEDPDGDGLLYLITGGNEEGNFELDSQKGIIKLRRNPPPSL
KGPQYTLNVTAIDDNASGGPTPLSSFAEVIVGVNDVNNNKPVFRECAYYSDSTWVLENQP
PGTRVLQVEAYDADLGINGEVKYGLMHRDGASLGFSIDPDTGVITTTQSFDRERQREYTL
SVTATDQAQEPLIGVCQLTVLIADVNDNDPKFDKSRYQYFLSEDTPVGMSFLQVAAHDSD
QGVNAAVTYSMLEHQLEYFQINPSTGWVYVNGPLHNTMRISRYIVATDGGNRSSTAELTV
TVTSALSQPPRWEQSTYWVTIPENTIRDTKIVTIKATSPLGDPRVTYNLEEGQVPETNMP
VRFYLKPNRADGSASLLVAEPLDFETTKFFTLTVRAQNVAMTPLASFATVCVNITDVNDN
VPFFMSSNYEVSVPEGADVGTSVVQVSAMDLDSGLHGEVHYLILKDANEDYQFFTIEPET
GIIYTQASFDREKKASYLIEVQSQDLSESARPGVHGQSNTDTAYVRIFVSDVNDNAPAFP
RSVYEVSIDEDRDVGSPVVTATADDKDEGANAKLRYQITSGNVKGVFDVEPETGTVFIAQ
PLDYEQEQHYELRLVASDGKWENHTLIIINVVNKNDEAPVFTQSEYQGSVLEELTDLPVL
VLKVSATDPDQAADQNAINYSLHGQGASSEFSINENTGEISAHKRLDREKRSTWRFLVLA
TDEGGEGLTGFADVIIEVRDVNDNAPLFLCVSDGCFTGHIPEDSPADTPIMEMTAVDLDD
PKAGINAVLTYSIIQNVKNEINLNLFSIDSVSGTISTVLGSLDREKEDKYLVVVEARDGG
GLTGTGTATILVTDVNDHAPVFLQRIYTAFVSENASINTEVVMVSAVDRDEGENAMVAFS
IIDGDNDRKFSIETDEVNNCGFIRLRKRVDFEKPHERVFNLTVKAEDMDFFSITHCVIYV
EDSNDHAPVFYPQFYEVAALGEDVPVGTRVIQVSAVDLDSGLNGRFSFHLLNKSDPGGQF
SLASDGWLMVAGLLDYETVAQYQLVVIATDMGQPPLTGSATVLVTLQDVNDNGPEFEAHY
NPVVWENTASPQAVQMNETSTLLYAKDRDTAANGAPFSFRLLSDFDSLTSFSLQDFHNGS
AVLTALRTFDREVHKVFYLPILITDSGIPPMSSTNTLTVNIGDQNDHPHSAGYMECLIYS
YDGILPTTELGQVSAPDADDWDRKIYQFEGKTSRYFILNDNSGLLTIKEGTPPGTYNIRV
RVIDGVWPDVISTVKVIVKEIKDDALRNAGSLRIKGITPEEFISQSPEKQSKYYQMKKLL
SEIISVQLENVHIFSVLNSPSPIRGVDVWFAVYGPPYHKAEKLNGNVAASRAWLESILDI
NITQTGIDECVTADCTHSSGCISKHEQNHVPTITTAGSVSLVSVTVLSHAVCGCAARENP
HLSCSSYQTNPCLNGGTCVDTDLGYRCKCAANFHGPDCQQTKHSFRGHGYAWFPPLQPCF
ESRISLEFITEVVDGLLLYHGPAARGQPGEQENFLALELSGGVPSLTVSHSSGELFLQLS
QKVNVADRRWHNIKIINDGKAMKLILDNCMNVSVRDDGRVTKKISQMDLSVCEASGEIVG
SQSMGKLFSGHQPLQLGGVKKTLPYRDSQRHFRGFVGCIRNVIVDSKVYDLQHPAESLNS
APGCVLMDEMCQSGGMASCGTHGKCVGGWDFFRCDCSPGYAGLACEKVLPEWAFGRDSWI
HFEPRSILSTRSTRIQLLVRTRISHSTLLSLASVDGNRYIRLEVFDGFFSVNFSLGDKNH
SLRMRTLRINNGQWVLLTMERYNNEFTLRVNSGGGDQEVTSVLGVNRWFEMDWASIVLGN
RLPSHSESDFQGCMRDVQLDGQPLLVEGRSTEFGLILRRQGVTMGCHSSACSSQPCYSPF
LCVDLWRKYECRCPAGKVEVTDTLTGLRHCTSSPCGHWTCRNGGTCVAQSQDKTICQCPE
GYKGRWCEISQVKAGRPVGLSSGSILAISMCLLVFLALLVSYTVWSQWGSSGFRKGGIYH
IPEERESWEDVRENVFNYNEEGGGERDQNAYNIDELKKPLHKIPRSSLRAAAPHSRTPTN
PKRDSLPKHSHQKQSISAVTSIPDFKEYVSQIIRDADNDLKSLPADTIHFYCLEGQCSLA
GSLSSLDSISGDEDLNYDCLQEWGSKFEKLKELYAVSNENL
Download sequence
Identical sequences H0ZNV4
59729.ENSTGUP00000012283 ENSTGUP00000012283 ENSTGUP00000012283

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]