SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSGALP00000035473 from Gallus gallus 69_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGALP00000035473
Domain Number 1 Region: 1158-1385
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.15e-41
Family Laminin G-like module 0.0015
Further Details:      
 
Domain Number 2 Region: 312-416
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-28
Family Cadherin 0.001
Further Details:      
 
Domain Number 3 Region: 207-319
Classification Level Classification E-value
Superfamily Cadherin-like 5.57e-28
Family Cadherin 0.0006
Further Details:      
 
Domain Number 4 Region: 623-735
Classification Level Classification E-value
Superfamily Cadherin-like 8.42e-28
Family Cadherin 0.0007
Further Details:      
 
Domain Number 5 Region: 99-205
Classification Level Classification E-value
Superfamily Cadherin-like 9.99e-26
Family Cadherin 0.0013
Further Details:      
 
Domain Number 6 Region: 1403-1614
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.39e-25
Family Laminin G-like module 0.0072
Further Details:      
 
Domain Number 7 Region: 520-622
Classification Level Classification E-value
Superfamily Cadherin-like 2.62e-25
Family Cadherin 0.0022
Further Details:      
 
Domain Number 8 Region: 723-840
Classification Level Classification E-value
Superfamily Cadherin-like 8.99e-25
Family Cadherin 0.00099
Further Details:      
 
Domain Number 9 Region: 417-518
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-22
Family Cadherin 0.001
Further Details:      
 
Domain Number 10 Region: 2-97
Classification Level Classification E-value
Superfamily Cadherin-like 6.02e-21
Family Cadherin 0.0081
Further Details:      
 
Domain Number 11 Region: 3543-3572,3643-3722
Classification Level Classification E-value
Superfamily SpoIIaa-like 0.000000000000844
Family Anti-sigma factor antagonist SpoIIaa 0.016
Further Details:      
 
Domain Number 12 Region: 1097-1136
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000031
Family EGF-type module 0.0076
Further Details:      
 
Domain Number 13 Region: 825-943
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000657
Family Cadherin 0.01
Further Details:      
 
Domain Number 14 Region: 2192-2472
Classification Level Classification E-value
Superfamily Family A G protein-coupled receptor-like 0.000000137
Family Rhodopsin-like 0.027
Further Details:      
 
Domain Number 15 Region: 1607-1640
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000944
Family EGF-type module 0.01
Further Details:      
 
Domain Number 16 Region: 1736-1775
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000201
Family Laminin-type module 0.0086
Further Details:      
 
Weak hits

Sequence:  ENSGALP00000035473
Domain Number - Region: 1642-1685
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000108
Family EGF-type module 0.025
Further Details:      
 
Domain Number - Region: 1755-1838
Classification Level Classification E-value
Superfamily Hormone receptor domain 0.00126
Family Hormone receptor domain 0.0092
Further Details:      
 
Domain Number - Region: 1077-1101
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00472
Family EGF-type module 0.048
Further Details:      
 
Domain Number - Region: 1140-1169
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0254
Family EGF-type module 0.017
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGALP00000035473   Gene: ENSGALG00000005730   Transcript: ENSGALT00000036251
Sequence length 3737
Comment pep:known chromosome:WASHUC2:12:9269058:9269472:1 gene:ENSGALG00000005730 transcript:ENSGALT00000036251 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
NYHAEMAENQPAGTAVVAVAAQDPDGGEAGRLVYSMDALMNSRSLELFSIDPHAGLITTT
QALDRESMELHYFRVTAADHGAPRLSATTMVAITVADRNDHDPVFEQGEYRETIRENVEE
GYPILQLRATDVDSPPNANIRYRFVNERAAHAVFEIDPRSGLITTSGPVDREKMERYSLV
VEANDQGREPGPRSATVRVYITVLDENDNTPQFSEKRYIVQVREDIRPHTEILRVTATDL
DKDNNALVHYNIISGNSRGQFSIDSVTGEIQVVAPLDFEVEREYALRIRAQDAGRPPLSN
NTGMASIQVVDINDHAPIFVSTPFQISVLENAPLGHSVIHIQAVDADYGENARLEYKLTG
VSADTPFVVNSATGWITVSGPLDRELVEHYFFGVEARDHGSPSLSASASVTITVMDVNDN
RPEFTQKEYFIRLNEDAAVGTSVLSVTAIDRDVNSAITYQITGGNTRNRFAISTQGGVGI
ITLSLPLDYKQERRYVLTVTASDRTLRDNCHVHINITDANTHRPVFQSAHYSVSINEDRP
VGSTVVVISATDDDVGENARITYYLEDNVPQFRIDPDSGAITLQAELDYEDQVTYTLAIT
AKDNGIPQKADTTYVEIMVNDVNDNAPQFVSTHYQGVISEDAPPFTSVLQISATDRDAHT
NGRVQYTFQNGEDGDGDFTIEPTSGIIRTVRRLDRENVPVYELTAYAVDRGIPAQRTPVH
IQVTIQDVNDNAPVFPAEEFEVLVKENSIVGSVVAQITAIDPDEGPNAQIMYQIVEGNIP
EIFQMDIFSGELTALIDLDYETKPEYVIVVQATSAPLVSRATVHIKLIDQNDNSPVLKNF
QILFNNYVSNKSNTFPSGVIGKVPAYDPDVSDRLFYTFERGNELHLLIVNQTSGELRLSR
KLDNNRPLVASMLVTVTDGIHSVTAQCVLRVIIITEDMLANSITVRLENMWQERFLSPLL
ATFLEGVATVLATPKEDVFIFNIQNDTDVGGTVLNVSFSALAPRGGHYFSSEELQEQLYM
KRMALTGASMLEVLPFDDNVCLREPCQNYMKCISVLKFDSSAPFIASPSTLFRPIHPITG
LRCRCPQGFTGDYCETEINLCYSNPCLHGGTCTRKEGGYTCVCRQHFSGENCEVDSRSGR
CQPGVCRNGGTCTNGADGGFRCQCPAGGFETPFCELSTRSFPPRSFVMFRGLRQRFHLTL
SLSFSTVEPGGLLLYNGRLNERHDFLAVEIIQGQVQLKYSTGESSTVVSPYLPGGVSDGQ
WHTLQLRYYNKPKVSALGVVQGPSKDKVAILTVDECDASVALQFGSEIGNYSCAAEGVQT
SSKKSLDLTGPLLLGGVPNLPENFPVSHRDFVGCMRDLYIDNKRIDLASYIANNNGTTAG
CHAKHSFCDSSPCKNGGTCSVSWGTYSCLCPVGFGGKDCRHAMHHAHYFQGNSVLSWDFK
ADMKISVPWYLGLAFRTRQMDGVLLQAHAGQYTTLLCQLSGGLLSFMVSRGSGRSTSLVL
DQLQLNDGRWHDLQLELRDVRSGRDSRYVITIMLDFGLYQDTVVVGNELHGLKVKHLHVG
GVLGSGEVQNGLRGCIQGVRLGDSVTGTVLPKPSHALRVEAGCSVPSPCDSNPCPANSIC
KDEWQSYSCVCQPGYYGGECVDACHLNPCKNKSVCRRKPGSPLGYVCECGGNFFGQYCEH
RMDQQCPKGWWGNPSCGPCNCDVSKGFDPDCNKTNGQCHCKDFHYRPKGSDTCLPCDCYP
VGSSSRSCNKETGRCHCRPGVIGRQCNSCDSPFAEVTPSGCKVLYDGCPKSLKAGVWWPQ
TKFGFSAVVLCPKGSLGAAVRHCDEEKGWLEPDLFNCTSPAFKELSMLLEGLERNETELN
TIEAKKLAHRLRAVTDHMDHYFGNDVHITYRLLSRLMAFESRQHGFGLTATQDAHFNENL
LRAGSSVLAPENREHWAMLPHSEHGSASLMEQLRDYSGTLASNMKLTYLNPVGVVTPNIM
LSIDRMENHSHIRRRYPRYHSSLFRGQPAWDPHTHVVLPLSVLSPPKAEAVPTVTGGEGN
YTVESSSPRQALPEPEPALTVIILIMYRTLGGLLPARYQVDRRSVRLPKNPVMNSPIVSV
SVFSNHTFLRGPLDTPLVLEFRLLETANRSKPLCVQWNHSSPTNPSGFWTARDCDLVYRN
TTHVHCQCSQFGTFGVLMDSSHREQLEGDLETLAIVTYSLVSLSLVALLLTFSFLTCLKG
LKSNTRGIHSNISVTLFFSELLFLLGINRTENQVRGHGRTISEWLLSLQFLCTVIAILLH
CFFLSTFAWLFVQGLHIYRMQTEARNVNFGAMRFYYAIGWGVPAIITGLAVGLDPEGYGN
PDFCWISVHDKLVWSFAGPITVVIVMNGVMFLLVAKMSCSPGQKETKKKSVLMTLRSSFV
LLLVISATWLFGLLAVNNSVLAFHYLYTVLCSLQGLAVLVLFCVLNEEVQEAWKLACLGK
KGQSEEAARSTQQGPNTYNNTALFEESGLIRITLGASTISSVSSVRSARTHSSQRGYLRD
NMTARQGSALDHSLLGHAGPTDIDVAMFHRDAGGDQDSDSDSDLSLDEERSLSIPSSESE
ENVRLRGRFPRQFKRAAHSERLLTNPTNTAPKADVDGNDLMSYWPALGECEVHPCSLQKW
GSERKLGFDINKDAANNNQPDLALTSGDENSLTQTQRQRKDVGILKNRLQYPPALQGLPA
VGRMTNELSWYKTSTLGHRAVPAASYGRIYSGAGSLSQPASRYSSREQLDMLMRRQMSRE
QLSRNNSENLPSRHGSRENLDLLAPRPSQRDHGNTLPRRQGSRDHLETLPCRFGSREQLD
CGLVREVSREWLNTLPSRQGSRDRIDRLPSRDTSREQLDLLSRRQPSRDQLASSRQTSRE
HLDFLSRKSNSREPLGTVPSRQPSQENLGSLSRRQLSRENLEAIPSRHPSTEQLDILSSI
LASFNTSVLSSVQSSSTPSGPQTTANPSGMHTSTPSAMCPSTPHSSTSHSISELSPDSDA
AELGAMAAEMVLSRRPGQPHSEVLSEADLEELGQRKPPSKTSTRDYLRCSASTAKSLLFR
FIPVLRWLPRYPVKDWLLGDIASGFSVGIMHLPQGLAYALLAGLPPVTGLYSSFYPVFLY
FFFGTSRHNSVGEPRDCVLPSVFPGPFAVISVMIGSLTDSLVPSDDFLEFVNGTNVTVVN
EAQRDAARVELVATITVLTGIFQVALGLLQFGFVVTYLSDPLVRGYTTAASVHVLISQLK
NVFGVSVGEHSGPLSLFVTFIEICKKLPETNVGTLVTAIIAMVAIFIVKELNHKFSAKLP
MPIPIELITIIISTGISYGVNLNSKFGISVVGNIPSGMKPPVVPNTRYFGQVVGNAFAIA
VVGYAICISLGKIFALKHGYKVDSNQELIALGLSNFLGGFFQCFAISCSMSRSLVQESTG
GNSQVAGVISSLVILVTILKIGELFHDLPKAILSAIIIINLKGMFKQFTDFRTLWKSNRV
DLMIWVVTFVATLLLNLDIGLGASVAFALLTVIFRTQLPHYSILGRVTDTDVYKDVAEYE
KAQEVPGIKIFRSSSTIYFANVEMYSDALKKKSGINVDRLIEKKKKALKKLKKQQKKAQK
EKAKRKKSGDTDAECNGPGVAVIELSGEEGGTPPEPTLRSLGLPQPNFHAVILDFSPVNF
VDTVSIKILKNIFKDFHEIEVDVFVASCPASVFAQLERGNFFSSTITKHCFFPSVNDAVL
HLNGRTCPASENLSTKM
Download sequence
Identical sequences 9031.ENSGALP00000009191 ENSGALP00000035473

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]