SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000022102 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000022102
Domain Number 1 Region: 3920-4118
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.24e-33
Family Laminin G-like module 0.0037
Further Details:      
 
Domain Number 2 Region: 2106-2235
Classification Level Classification E-value
Superfamily Cadherin-like 2.88e-33
Family Cadherin 0.00092
Further Details:      
 
Domain Number 3 Region: 2628-2738
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-32
Family Cadherin 0.00035
Further Details:      
 
Domain Number 4 Region: 209-327
Classification Level Classification E-value
Superfamily Cadherin-like 9.42e-32
Family Cadherin 0.00069
Further Details:      
 
Domain Number 5 Region: 1265-1383
Classification Level Classification E-value
Superfamily Cadherin-like 2.28e-30
Family Cadherin 0.0019
Further Details:      
 
Domain Number 6 Region: 1896-2021
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-29
Family Cadherin 0.00056
Further Details:      
 
Domain Number 7 Region: 2321-2423
Classification Level Classification E-value
Superfamily Cadherin-like 7.85e-29
Family Cadherin 0.00085
Further Details:      
 
Domain Number 8 Region: 645-748
Classification Level Classification E-value
Superfamily Cadherin-like 1.7e-28
Family Cadherin 0.00092
Further Details:      
 
Domain Number 9 Region: 3048-3159
Classification Level Classification E-value
Superfamily Cadherin-like 1.3e-27
Family Cadherin 0.0014
Further Details:      
 
Domain Number 10 Region: 4146-4368
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.18e-27
Family Laminin G-like module 0.003
Further Details:      
 
Domain Number 11 Region: 1166-1277
Classification Level Classification E-value
Superfamily Cadherin-like 8.71e-27
Family Cadherin 0.00038
Further Details:      
 
Domain Number 12 Region: 433-544
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-26
Family Cadherin 0.001
Further Details:      
 
Domain Number 13 Region: 2216-2327
Classification Level Classification E-value
Superfamily Cadherin-like 2.75e-26
Family Cadherin 0.0015
Further Details:      
 
Domain Number 14 Region: 3147-3257
Classification Level Classification E-value
Superfamily Cadherin-like 3e-26
Family Cadherin 0.00088
Further Details:      
 
Domain Number 15 Region: 1370-1502
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-26
Family Cadherin 0.0018
Further Details:      
 
Domain Number 16 Region: 3358-3479
Classification Level Classification E-value
Superfamily Cadherin-like 5.37e-26
Family Cadherin 0.00073
Further Details:      
 
Domain Number 17 Region: 1797-1907
Classification Level Classification E-value
Superfamily Cadherin-like 5e-25
Family Cadherin 0.0013
Further Details:      
 
Domain Number 18 Region: 946-1054
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-24
Family Cadherin 0.0012
Further Details:      
 
Domain Number 19 Region: 1696-1803
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-24
Family Cadherin 0.0013
Further Details:      
 
Domain Number 20 Region: 2007-2109
Classification Level Classification E-value
Superfamily Cadherin-like 1.44e-23
Family Cadherin 0.002
Further Details:      
 
Domain Number 21 Region: 537-644
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-23
Family Cadherin 0.00064
Further Details:      
 
Domain Number 22 Region: 2830-2940
Classification Level Classification E-value
Superfamily Cadherin-like 5.42e-23
Family Cadherin 0.0014
Further Details:      
 
Domain Number 23 Region: 843-946
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-22
Family Cadherin 0.0036
Further Details:      
 
Domain Number 24 Region: 2419-2524
Classification Level Classification E-value
Superfamily Cadherin-like 4e-22
Family Cadherin 0.004
Further Details:      
 
Domain Number 25 Region: 2731-2830
Classification Level Classification E-value
Superfamily Cadherin-like 5.23e-22
Family Cadherin 0.0022
Further Details:      
 
Domain Number 26 Region: 2942-3045
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-21
Family Cadherin 0.0015
Further Details:      
 
Domain Number 27 Region: 744-832
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-19
Family Cadherin 0.0025
Further Details:      
 
Domain Number 28 Region: 3468-3571
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-19
Family Cadherin 0.0016
Further Details:      
 
Domain Number 29 Region: 3257-3362
Classification Level Classification E-value
Superfamily Cadherin-like 3.71e-19
Family Cadherin 0.0018
Further Details:      
 
Domain Number 30 Region: 100-221
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-18
Family Cadherin 0.0027
Further Details:      
 
Domain Number 31 Region: 2530-2634
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-18
Family Cadherin 0.0035
Further Details:      
 
Domain Number 32 Region: 316-431
Classification Level Classification E-value
Superfamily Cadherin-like 1.17e-17
Family Cadherin 0.0019
Further Details:      
 
Domain Number 33 Region: 1479-1583
Classification Level Classification E-value
Superfamily Cadherin-like 5.28e-17
Family Cadherin 0.0025
Further Details:      
 
Domain Number 34 Region: 1056-1172
Classification Level Classification E-value
Superfamily Cadherin-like 4.14e-16
Family Cadherin 0.0066
Further Details:      
 
Domain Number 35 Region: 1588-1702
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000186
Family Cadherin 0.0028
Further Details:      
 
Domain Number 36 Region: 3815-3940
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.0000000000173
Family Growth factor receptor domain 0.022
Further Details:      
 
Domain Number 37 Region: 19-106
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000271
Family Cadherin 0.0078
Further Details:      
 
Domain Number 38 Region: 4394-4430
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000115
Family EGF-type module 0.0093
Further Details:      
 
Domain Number 39 Region: 3767-3832
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000162
Family EGF-type module 0.009
Further Details:      
 
Domain Number 40 Region: 3573-3664
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000487
Family Cadherin 0.015
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000022102
Domain Number - Region: 4369-4392
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0913
Family EGF-type module 0.027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000022102   Gene: ENSGACG00000016736   Transcript: ENSGACT00000022144
Sequence length 4953
Comment pep:known_by_projection group:BROADS1:groupIV:3590016:3672899:1 gene:ENSGACG00000016736 transcript:ENSGACT00000022144 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LLALWTLSRHSASSQVRQEFQVPEEQPVGTYVGTIETKPSFSYRFSESHKLFAINATTGV
IYTSSVIDRELLQSDVINVVVLSSQPTYPTEVRIVVLDINDNSPVFPDASIVVSFKEDAS
SGRQVILDTATDSDIGSNGVDHTTYRIVKGNEQRRFRLDITVNPSGEGAFLHLVSTGGLD
REVTPFYQLLVEVEDKGEPKKFGYMQVNVTIQDINDNPPVFDQEQYQNSVFEDAAVGSSI
LQITASDRDEGANSEVRYFLDEGTPFQIDPKAGTIIMKEGLDYESKREYSLTIHAVDNGV
PSLSGRTEATIKLLDVNDNDPVVKFRYFPTTSKFASVDENAQIGTVVALLTVSDSDSPTA
NGNISVSILGGNEQRHFEVHTSPVPNLSLIKVASVLDRERISSYNLTVSVSDNGAPVARS
SFASLVIFVNDINDHPPIFQQTLYRVDISEDVPKGSYVKGVSATDGDSGQNANLRYSLVS
GNALGWFSISENSGLVTSAALLDREIASEIVLNISAKDQGLQPKISYTKLVVNVTDVNDQ
VPTFAQGTYHVSLAEHAAAGTELLVLSASDDDDGANGTVRFAFDGETPAAVQQLFRLDAL
SGRLSTAAELDREERASYLLHVQAADAGSPALHSVGKVNITLRDINDNRPVFYPLQYFAN
VKENEPSGSYVTTASATDPDLGRNGTLKYMITAGDSSKFRINSNTGKITTLVPLDREEKT
AYQLQVTAADGGGLRSHTQAIVTVTVIDTQDNPPVFSQEVYSFVMFENVGLGTVIGTVSA
TTVDLNTNISYLITSGDHRGLFSVNGAGQITASSQIDREERDFYQLKVVARAGEITGEAL
ANVTVKDLNDNGPHFLYAVEHVSAVENWSAGHVIFQAKASDPDEGTNGAVVYSLKQNPRG
LFHIHEKHGLITLTGPLEVTTSSYEVEVTASDTGVPRHSSDLVLIVSVYDVNDNSPVFDQ
LSYEVVILESEPVNSRFFKVEAADMDSGLNGEIMYDIAGGNTRDVFGIFPDGQLYIKDEL
DREAQDRYNLVVVAKDRAVEPLSAAVNVSVILDDVNDNRPLFNSTNYVFHFEEEQKRGSL
AGHVYAEDKDFGPNSEVRYSFETPQPNFELNAITGELTSTLQFDREALMRQRGATVFGFV
VVSSDQGLPKPLRDQAKVQVYIQDINDNPPKFTKDIYQASISESAPNMTQLLRVSASDVD
ENKNGLVHYHISEGNEEGQFAIDSSSGQVTLVGKLDYESTSSYSLKILAADAGAVPLTSS
CMLSISILDENDNSPSFPKSSVSVDVLENMRIGELVASVTAADADSGQNADITYSITATN
NHGTFSISANTGSIFLDKKLDFETQSLYKLNVTAKDNGRPARSSSVPVVIHVRDFNDNPP
VFTPGDIFKSIPENLPLSASVMTIAAHDTDADINGQLEYSIVHQVPRGGHFGIDSGTGLI
YTSKEIDREFSNLFELTVKATDQAVPVEFRRFALKNVTVWVTDLNDNVPTFMSQNALVAE
PNIVIGSILTTVSAFDPDEGSNGEVEYELVEGDSDTFIMDRYSGDIRLASQLLQSRLMYT
LTVSATDHGDSRKTSRTELTIILRGLDGPVFSQPKYITILKEGQPPGTNVISLDASSPRG
SATKVEYFIVGVRSGGKAVGRLFTIGRHTGVIQTAAELDREQGSDLYLVVVYAIETDASQ
PRTQRAEVEITLQDVNDNPPVFPNDILDVTIEENVGDGFKIMQLTAADADKGPNALVTYT
IVSGADDSFRIDPESGDLIATKKLDRERRSKYSLLVRADDGKQSSDMRLNITVKDVNDHT
PEFSRVAYSFDIPEDMAPGSIVAAILASDSDSGANGEVTYSLEDEEDEDEAFLLNPVTGV
FNVTRPLDYETQRYYILTAKARDGGGQASGVRVYFNVLDVNDNPPVFNATAYSASVSESL
PPGSGVLTVGAADADDGPNAQLLYRIASGDSQGHFVISKDGVLQTKKALDREIQSFYNLV
ITVNDLAPPPATRFTSTAQVSIILLDVNDCPPTFTSQRMTYIQENTPVDTVVFTAQAADA
DSGPNSYVEYSLKGPFVNKFSIGTIDGDVRLVGELDREELSNYTLTVVATDKGEPHLSSM
MDVTMVVLDVNDNTPSFSQNIYDIEIEEDTLTGTDVIQVFASDADEGTNGQVRFSISGGN
ANSDFRIDSVTGVISVAKQLDREARGSYSLVVQAADRGSSPKVDRATVNIVLLDMNDCSP
EFELSPYTVNVQENFENLPKNILQVVARDDDQGANGQLSYMLSGENDNGAFSLSSSGQLS
VTRTLDREAQAKYVLLITATDSGSPSLSGTGTVTVVVDDVNDNVPVFTSSSFHTTVMENA
PTGADVLLVNSSDADVGVNGVISYSLAGGHGQFSINPATGQIITSSLLDREDRNNYQLLV
VATDGGQPQGLSSSATVSVTVADINDNPPHFHHHPYVTHIPASTAAGSLVFAVTVTDEDS
GSNAQLHYSLVGRNSEKFKIDPIRGAITANEKLTGSSEVTLTVRVKDGGANPKTDTTTVT
VRFVTGGSFPVIKLEERAFTFPESQPTNHVVTTVTGSSMRGGPLSYYVASGNLDNAFHID
QLSGELSIRHPLDYEHVQKYVLWIEARDQGFPPYSSYEKVELTVLDVNDNHPVFDKDPFH
AEILENLSPQRVLMVSAVDLDSGPNGQLEYAIVDGNKENSFSVNRATGEIRTTRPLDREK
AAQYTLRVKATDRGLPPKTTAVKVLINVLDVNDNAPRFSKIFSAAVAENAPVGYTVTRVT
TTDEDAGSNAVSRYSITDASLPFNINPNTGDITISRPLNREDTGHYIVKVSAHDSGWTVS
TDVTIFITDVNDNAPRFSRPSYYLDYPELTEVGSLVTQVSATDPDEGFNGKIFYFIRSQS
EFFRINASSGEIFVKQQLKYQNSTGASSININRHSFIVTASDRALKPLMSETTVIVNVVD
SNDNRPEFDSPSYFTPVTKSVKVGTRLIRIVAHDKKDFGLNSEVEYLITGGNSSRKFKLD
KTSGWITVASSLTSDTNTFFLIDITAKDRGNPPLSSRTSVRVAVTEENHHTPEFSQSQIS
ATVPESLVVGTAIRTLSARDKDKDMNGLITYDITSGNDKGLFSLHSKTGVLSLARPLDFE
EKQQHDLRVSATDGGWIAKTSHVSVTVRVSDVNDNPPVFNPDEYFPVVQENVPSGTTLVK
MNATDRDSGANAVMAYVIQSSDSDLFVIDPNTGTITTQGFLDYEAKQVYHLTVKAFNVPD
EERCSFANVNIQLKGANEYVPRFVSKQYYFEVSEAAPKRTVVGEVFASDRDQGDDGVVYY
LIFGKSRKKGFGINRKTGQIYVTGSLDREKEEKISLKVLAKNAGSIRGADIDEVFVNITV
LDANDPPVFTQELYDVQVSEGLSPGGLVTFVSAVDSDSVPSWSRFSYTIAPPFDKNVFTI
DPKTGQVSVAAELDRETTPVYNLTVLAVDTGTPPATGSTTLIVNLEDINDNGPTLTAAYA
EVMENQRAGIVVTTLTASDADLPPNQGPFLFSLIGSGSANSYFSLSPAGVLTTSREIDRE
QISNFYLSVVIKDSGVPQMSSTGTIHVKINDQNDNPSETRSMEIFVHYFGNMFPGGSLGD
VKPQDPDVQDRFHCSLIPPSSGLFNIPTGTCSLNSKARSTDGTFELTVRSSDGVHGSVSN
TARVLFMGFTNATVDNSILIRLQSQGVKNFLTNHYLSFLRIANSQLAGLGTGVLLYGAFE
LNNQTFIMAAVKRGHGQYVNPSGVATFFQSIKDILYRQSGVQIDAVDHDPCTRNPCQNGG
SCKRRLSVGPDMKTEESVPVILVSNHPLQPYACSCRPGYTGGLCETDIDECQPSPCHNGG
TCHNLVGGFSCTCPEGFTGMACERDVNECLSNPCKNGALCQNFPGGFNCLCNSGFAGKTC
DSIINHCECNPCFNGGSCQNRVDGYYCHCPFGVFGKHCELNSYGFEELSYMEFPSLDPNN
NYIYIKFATLKSNALLMYNHDNQTGDKAEFLALEIFEGRMRFSFNLGSGTYKLMTMIKVS
DGQFHTVIARRAGMAASLTVDLCGDDQEPGYCAVSNVAVHTDWILDVQPNRLSVGGVGSI
EPVLHRRGQVATHDFVGCIMEFAVNGRPLEPSQALASRGILDRPSLSSPRGEGGKSPCRH
GGTCLDRWSWQQCQCVDGFTGKFCEKYMTADTALSMDGAGRLDYSLKQGPRRDVLLRQSL
QGVTSEPARPSSLEVKFRTRSKSGTLLHVQESSNYTTVKLRNGNIHYVSDAGVGGKVERT
VGDAALSDGLWHTLHLLKNGSTTVLLVDGSHSRVIQHATQDFGGLSVVTFSLGGVPPGPA
QQKTAAGFDGCLAYVKYNGENLPFTGEHGLVALTKTSSSVKIGCRGPNLCESSPCWDGLM
CVNQWYTYQCVAPGDCGSSPCQNRGSCVPDPHSGFTCVCSEFYTGKSCETLVACLGVQCP
PGNVCKSANNGGFVCSPSPSSEKMVLPIWAVPAIVGSCATVLALVVLSLILCNHCRGRNK
TKVPKEPKEKKGSENVAFDDPDNIPPYGDDMTVRKQPEGNPKPDIIERENPYLIYDETDL
PHSNETVPSAPCAPCNGPEADIEHYDIDNASSIAPSDADIIQHYKQFRSHTPKFSIQRHS
PLGFARQSPLPLGATSYTYQPQYTQALRSTPLSHSHSACPTPNPLSRHSPAPFTKPSSFY
RNTPTRELNLGRREGSPLDHHGDMCQPPPPPMFNYATRLGRRSKSPQTMAAHGHASRPGS
RLKQPIEQIPLETGPPVGLSIEEVERLNTPRPRNPSICSADHGRSSSEEDCRRPLSRVRN
PADGIPAPESSSESDSHDSFTCSEMEYERDKPVSYSSRVPKLSQVNESDADDEDYGGRLK
QRRYSSRRAEGGPGVPQMPPSEQHYTLPHRLGQQAGGFNWDNLLSWGPGFGHYVDVFKDL
ALLPENAAANDIEMNSGDGSVTILNEGEAEQYV
Download sequence
Identical sequences G3PWW3
ENSGACP00000022102 ENSGACP00000022102 69293.ENSGACP00000022102

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]