SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000012956 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000012956
Domain Number 1 Region: 2142-2271
Classification Level Classification E-value
Superfamily Cadherin-like 2.49e-34
Family Cadherin 0.00096
Further Details:      
 
Domain Number 2 Region: 3913-4119
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.73e-33
Family Laminin G-like module 0.0062
Further Details:      
 
Domain Number 3 Region: 238-357
Classification Level Classification E-value
Superfamily Cadherin-like 4.71e-31
Family Cadherin 0.00058
Further Details:      
 
Domain Number 4 Region: 984-1101
Classification Level Classification E-value
Superfamily Cadherin-like 3.28e-30
Family Cadherin 0.00092
Further Details:      
 
Domain Number 5 Region: 2620-2730
Classification Level Classification E-value
Superfamily Cadherin-like 2.86e-29
Family Cadherin 0.00036
Further Details:      
 
Domain Number 6 Region: 1309-1412
Classification Level Classification E-value
Superfamily Cadherin-like 3.27e-29
Family Cadherin 0.0017
Further Details:      
 
Domain Number 7 Region: 682-785
Classification Level Classification E-value
Superfamily Cadherin-like 1.7e-28
Family Cadherin 0.0011
Further Details:      
 
Domain Number 8 Region: 1933-2056
Classification Level Classification E-value
Superfamily Cadherin-like 1.86e-28
Family Cadherin 0.00048
Further Details:      
 
Domain Number 9 Region: 1204-1315
Classification Level Classification E-value
Superfamily Cadherin-like 2.14e-28
Family Cadherin 0.00033
Further Details:      
 
Domain Number 10 Region: 463-585
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-28
Family Cadherin 0.00085
Further Details:      
 
Domain Number 11 Region: 4135-4358
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.97e-28
Family Laminin G-like module 0.0016
Further Details:      
 
Domain Number 12 Region: 3350-3471
Classification Level Classification E-value
Superfamily Cadherin-like 5.63e-28
Family Cadherin 0.0014
Further Details:      
 
Domain Number 13 Region: 2046-2147
Classification Level Classification E-value
Superfamily Cadherin-like 1.21e-27
Family Cadherin 0.0011
Further Details:      
 
Domain Number 14 Region: 3040-3151
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-27
Family Cadherin 0.00074
Further Details:      
 
Domain Number 15 Region: 1408-1536
Classification Level Classification E-value
Superfamily Cadherin-like 5.63e-27
Family Cadherin 0.0014
Further Details:      
 
Domain Number 16 Region: 881-988
Classification Level Classification E-value
Superfamily Cadherin-like 6.81e-27
Family Cadherin 0.003
Further Details:      
 
Domain Number 17 Region: 3139-3247
Classification Level Classification E-value
Superfamily Cadherin-like 1.01e-26
Family Cadherin 0.0012
Further Details:      
 
Domain Number 18 Region: 1835-1944
Classification Level Classification E-value
Superfamily Cadherin-like 2e-26
Family Cadherin 0.0011
Further Details:      
 
Domain Number 19 Region: 2352-2458
Classification Level Classification E-value
Superfamily Cadherin-like 5.57e-26
Family Cadherin 0.00059
Further Details:      
 
Domain Number 20 Region: 2253-2364
Classification Level Classification E-value
Superfamily Cadherin-like 7.42e-26
Family Cadherin 0.0011
Further Details:      
 
Domain Number 21 Region: 575-681
Classification Level Classification E-value
Superfamily Cadherin-like 4.71e-25
Family Cadherin 0.001
Further Details:      
 
Domain Number 22 Region: 2817-2942
Classification Level Classification E-value
Superfamily Cadherin-like 7.57e-25
Family Cadherin 0.0021
Further Details:      
 
Domain Number 23 Region: 1733-1834
Classification Level Classification E-value
Superfamily Cadherin-like 5.37e-24
Family Cadherin 0.0009
Further Details:      
 
Domain Number 24 Region: 2454-2582
Classification Level Classification E-value
Superfamily Cadherin-like 6.45e-24
Family Cadherin 0.0041
Further Details:      
 
Domain Number 25 Region: 2934-3037
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-23
Family Cadherin 0.0011
Further Details:      
 
Domain Number 26 Region: 2724-2821
Classification Level Classification E-value
Superfamily Cadherin-like 2.75e-22
Family Cadherin 0.0016
Further Details:      
 
Domain Number 27 Region: 788-893
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-22
Family Cadherin 0.0015
Further Details:      
 
Domain Number 28 Region: 1088-1204
Classification Level Classification E-value
Superfamily Cadherin-like 2.14e-21
Family Cadherin 0.003
Further Details:      
 
Domain Number 29 Region: 3249-3354
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-20
Family Cadherin 0.0021
Further Details:      
 
Domain Number 30 Region: 3460-3563
Classification Level Classification E-value
Superfamily Cadherin-like 3.93e-20
Family Cadherin 0.0017
Further Details:      
 
Domain Number 31 Region: 129-250
Classification Level Classification E-value
Superfamily Cadherin-like 9.42e-19
Family Cadherin 0.0023
Further Details:      
 
Domain Number 32 Region: 346-467
Classification Level Classification E-value
Superfamily Cadherin-like 2.71e-16
Family Cadherin 0.0013
Further Details:      
 
Domain Number 33 Region: 1517-1623
Classification Level Classification E-value
Superfamily Cadherin-like 3e-16
Family Cadherin 0.0023
Further Details:      
 
Domain Number 34 Region: 3762-3774,3805-3899
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.0000000000377
Family Growth factor receptor domain 0.01
Further Details:      
 
Domain Number 35 Region: 1627-1732
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000103
Family Cadherin 0.002
Further Details:      
 
Domain Number 36 Region: 68-135
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000006
Family Cadherin 0.0084
Further Details:      
 
Domain Number 37 Region: 4385-4422
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000201
Family EGF-type module 0.014
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000012956
Domain Number - Region: 3565-3655
Classification Level Classification E-value
Superfamily Cadherin-like 0.00171
Family Cadherin 0.057
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000012956   Gene: ENSGGOG00000013277   Transcript: ENSGGOT00000013331
Sequence length 4938
Comment pep:known_by_projection chromosome:gorGor3.1:4:135181121:135362581:1 gene:ENSGGOG00000013277 transcript:ENSGGOT00000013331 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDLAPDRATGRPWLPLHTLSVSQLLRVFWLLSLLPGQAWVHGAEPRQVFQVLEEQPPGTL
VGTIQTRPGFTYRLSESHALFAINSSTGALYTTSTIDRESLPSDVINLVVLSSAPTYPTE
VRVLVRDLNDNAPVFPDPSIVVTFKEDSSSGRQVILDTATDSDIGSNGVDHRSYRIIRGN
EAGRFRLDITLNPSGEGAFLHLVSKGGLDREVTPQYQLLVEVEDKGEPKRRGYLQVNVTV
QDINDNPPVFGSSHYQAGVPEDAVVGSSVLQVAAADADEGTNADIRYRLQDEGTPFQMDP
ETGLITVREPLDFEARRQYSLTVQAMDRGVPSLTGRAEALIQLLDVNDNDPVVKFRYFPA
TSRYASVDENAQVGTVVALLTVTDADSPAANGNISVQILGGNEQRHFEVQSSKVPNLSLI
KVASALDRERIPSYNLTVSVSDNYGAPPGAAVQARSSVASLVIFVNDINDHPPVFSQQVY
RVNLSEEAPPGSYVSGISATDGDSGLNANLRYSIVSGNGLGWFHISEHSGLVTTGSSGGL
DRELASQIVLNISARDQGVHPKVSYAQLVVTLLDVNDEKPVFSQPEGYDVSVVENAPTGT
ELLMLRATDGDLGDNGTVRFSLQEAETDRRSFRLDPVSGRLSTISSLDREEQAFYSLLVL
ATDLGSPPQSSMARINVSLLDINDNSPVFYPVQYFAHIKENEPGGSYITTVSATDPDLGT
NGTVKYSISAGDRSRFQVNAQSGVISTRMALDREEKTAYQLQVVATDGGNLQSPNQAIVT
ITVLDTQDNPPVFSQVAYSFVVFENVALGYHVGSVSASTMDLNSNISYLITTGDQKGMFA
INQVTGQLTTANVIDREEQSFYQLKVVASGGTVTGDTMVNITVKDLNDNSPHFLQAIESV
NVVENWQAGHSIFQAKAVDPDEGVNGMVLYSLKQNPKNLFAINEKNGTISLLGPLDVHAG
SYQIEILASDMGVPQLSSSVILTVYVHDVNDNSPVFDQLSYEVTLSESEPVNSRFFKVQA
SDKDSGANGEIAYTIAEGNTGDAFGIFPDGQLYIKSELDRELQDRYVLMVVASDRAVEPL
SATVNVTVILEDVNDNRPLFNSTNYTFYFEEEQRAGSFVGKVSAVDKDFGPNGEVRYSFE
MVQPDFELHAISGEITNTHQFDRESLMRRRGTAVFSFTVIATDQGLPQPLKDQATVHVYM
KDINDNAPKFLKDFYQATISESAANLTQVLRVSASDVDEGNNGLIHYSIIKGNEERQFAI
DSTSGQVALIGKLDYEATPAYSLVIQAVDSGTIPLNSTCTLNIDILDENDNTPSFPKSTL
FVDVLENMRIGELVSSVTATDSDSGDNADLYYSITGTNNHGTFSISPNTGSIFLAKKLDF
ETQSLYKLNITAKDQGRPPRSSTMSVVIHVRDFNDNPPSFPPGDIFKSIVENIPIGTSVI
SVTAHDPDADINGQLSYTIIQQMPRGNHFTIDEVKGTIYTNAEIDREFANLFELTVKAND
QAVPIETRRYALKNVTILVTDLNDNVPMFISQNALAADPSAVIGSVLTTIMAADPDEGAN
GEIEYEIINGDTDTFIVDRYSGDLRVASALVPSQLIYNLIVSATDLGPERRKSTTELTII
LQGLDGPVFTQPKYITILKEGEPIGTNVISIEAASPRGSEAPVEYYIVSVRCEEKTVGRL
FTIGRHTGIIQTAAILDREQGACLYLVDVYAIEKSTAFPRTQRAEVEITLQDINDNPPVF
PTDMLDLTVEENIGDGSKIMQLTAMDADEGANALVTYTIISGADDSFRIDPESGDLIATR
RLDRERRSKYSLLVRADDGLQSSDMRINITVSDVNDHTPKFSRHVYSFDIPEDTIPGSLV
AAILATDDDSGVNGEITYIVNEDDEDGIFFLNPITGVFNLTRLLDYEVQQYYILTVRAED
GGGQFTTIRVYFNILDVNDNPPVFSLNSYSTSLMENLPVGSTVLVFNVTDADDGINSQLT
YSIASGDSLGQFTVDKNGVLKVLKALDRESQSFYNLVVQVHDLPQIPASRFTSTAQVSII
LLDVNDNPPTFLSPKLTYIPENTPIDTVVFKAQATDPDSGPNSYIEYTLLNPLGNKFSIG
TIDGEVRLTGELDREEVSNYTLTVVATDKGQPSLSSSTEVVVMVLDINDNNPIFAQALYK
VEINENTLTGTDIIQVFAADGDEGTNGQVRYGIVNGNTNQEFRIDSVTGAITVAKPLDRE
KTPTYHLTVQATDRGSTPRTDTSMVSIVLLDINDFVPVFEPSPYSVSVPENLGTLPRTIL
QVVARDDDQGSNSKLSYVLFGGNEDNAFTLSASGELGVTQSLDRETKERFVLMITATDSG
SPALTGTGTINVIVDDVNDNVPTFASNAYFTTIPEDAPTGTDVLLVNASDADASTNAVIR
IIGGNSQFTINPSTGQIITSALLDRETKDNYTLVVVCSDAGSPEPLSSSTSVLVTVTDVN
DNPPRFQHHPYVTHIPSPTLPGSFVFAVTVTDADIGPNSELHYSLSGRNSEKFHIDPLRG
AIMAAGPLNGASEVTFSVHVKDGGSFPKTDSTTVTVRFVNKADFPKVRAKEQTFMFPENQ
PVSSLVTTITGSSLRGEPMSYYIASGSSYEKLDITVLDVNDNAPIFKEDPFISEILENLS
PRKILTVSAMDKDSGPNGQLDYEIVNGNMENSFSINHATGEIRSVRPLDREKVSHYVLTI
KSSDKGSPSQSTSVKVMINILDENDNAPRFSQIFSAHVPENSPLGYTVTRVTTSDEDIGI
NAISRYSIMDASLPFTINPSTGDIVISRPLNREDTDRYRIRVSAHDSGWTVSTDVTIFVT
DINDNAPRFSRTSYYLDCPELTETGSKVTQVFATDPDEGSNGQVFYFIKSQSEYFRINAT
TGEIFNKQILKYQNVTGFSNVNINRHSFIVTSSDRGKPSLISETTVTINIVDSNDNAPQF
LKSKYFTPVTKNVKVGTKLIRVTAIDDKDFGLNSEVEYFISNDNHLGKFKLDNDTGWISV
ASSLISDLNQNFFITVTAKDKGNPPLSSQATVHITVTEENYHTPEFSQSHMSATIPESHS
VGSIVRTVSARDRDAAMNGLIKYSISSGNEEGIFAINSSTGILTLAKALDYELCQKHEMT
ISAIDGGWVARTGYCSVTVNVIDVNDNSPVFLSDDYFPTVLENAPSGTTVIHLNATDADS
GTNAVIAYTVQSSDSDLFVIDPNTGVITTQGFLDFETKQSYHLTVKAFNVPDEERCSFAT
VNVQLKGTNEYVPRFVSKLYYFEISEAAPKGTIVGEVFASDRDLGTDGEVHYLIFGNSRK
KGFQINKKTGQIYVSGILDREKEERVSLKVLAKNFGSIRGADIDEVTVNVTVLDANDPPV
FTLNIYSVQISEGVPIGTHVTFVSAFDSDSIPSWSRFSYFIGSGNENGTFSINPQTGQIT
VTAELDRETLPIYNLSVLAVDSGTPSATGSASLLVTLEDINDNGPMLTVSEGEVMENKQP
GTLVMTLQSTDPDLPPNQGPFTYYLLSTGPATSYFSLSTAGVLSTTREIDREQIADFYLS
VVTKDSGVPQMSSTGTVHITVIDQNDNPSQSRTVEIFVNYYGNLFPGGILGSVKPQDPDV
LDSFHCSLTSGVTSLFSIPGGTCDLNSQPRSTDGTFDLTVLSNDGVHSTVTSNIRVFFAG
FSNATVDNSILLRLGVPTVKDFLTNHYLHFLRIASSQLTGLGTAVQLYSAYQENNRTFLL
AAVKRNHNQYVNPSGVATFFESIKEILLRQSGVKVESVDHDSCVHGPCQNGGSCLRRLAV
SSVLKSRESLPVIIVANEPLQPFLCKCLPGYAGSWCEIDIDECLPSPCHNGGTCHNLVGG
FSCSCPDGFTGRACERDINECLQSPCKNGAVCQNFPGSFNCVCKTGYTGKMCESSVNYCE
CNPCFNGGSCQSGVDSYYCHCPFGVFGKHCELNSYGFEELSYMEFPSLDPNNNYIYVKFA
TIKSHALLLYNYDNQTGDRAEFLALEIAEERLRFSYNLGSGTYKLTTMKKVSDGHFHTVI
ARRAGMAASLTVDSCSENQEPGYCTVSNVAVSDDWTLDVQPNRVTVGGIRSLEPILQRRG
HVESHDFVGCIMEFAVNGRPLEPSQALAAQGILDQCPRLEGACTRSPCQHGGTCMDYWSW
QQCHCKEGLTGKYCEKSVTPDTALSLEGKGRLDYHMSQNEKREYLLRQSLRGAMLEPFGV
NSLEVKFRTRSENGVLIHIQESSNYTTVKIKNGKVHFTSDAGIAGKVERNIPEVYVADGH
WHTFLIGKNGTATVLSVDRIYNRDIIHPTQDFGGLDVLTISLGGIPPNQAHRDAQTAGFD
GCIASMWYGGESLPFSGKHSLASISKTDPSVKIGCRGPNICASNPCWGDLLCINQWYAYR
CVPPGDCASHPCQNGGSCEPGLHSGFTCSCPDSHTGRTCEMVVACLGVLCPQGKVCKAGS
PAGHVCVLSQGPEEISLPLWAVPAIVGSCATVLALLVLSLILCNQCRGKKAKNPKEEKKP
KEKKKKGSENVAFDDPDNIPPYGDDMTVRKQPEGNPKPDIIERENPYLIYDETDIPHNSE
TIPSAPLASPEQEIEHYDIDNASSIAPSDADIIQHYKQFRSHTPKFSIQRHSPLGFARQS
PMPLGASSLTYQPSYGQGLRTSSLSHSACPTPNPLSRHSPAPFSKSSTFYRNSPARELHL
PIRDGNTLEMHGDTCQPGIFNYATRLGRRSKSPQAMASHGSRPGSRLKQPIGQIPLESSP
PVGLSIEEVERLNTPRPRNPSICSADHGRSSSEEDCRRPLSRTRNPADGIPAPESSSDSD
SHESFTCSEMEYDREKPMVYTSRMPKLSQVNESDADDEDNYGARLKPRRYHGRRAEGGPV
GTQAAAPGTADNTLPMKLGQQAGTFNWDNLLNWGPGFGHYVDVFKDLASLPEKAAANEEG
KAGTTKPVPKDGEAEQYV
Download sequence
Identical sequences ENSGGOP00000012956 ENSGGOP00000012956

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]