SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000023300 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000023300
Domain Number 1 Region: 3869-4062
Classification Level Classification E-value
Superfamily Fibronectin type III 3.83e-37
Family Fibronectin type III 0.0012
Further Details:      
 
Domain Number 2 Region: 4525-4731
Classification Level Classification E-value
Superfamily Fibronectin type III 3.16e-35
Family Fibronectin type III 0.0023
Further Details:      
 
Domain Number 3 Region: 4264-4439
Classification Level Classification E-value
Superfamily Fibronectin type III 2.55e-34
Family Fibronectin type III 0.0015
Further Details:      
 
Domain Number 4 Region: 1238-1283,1314-1448
Classification Level Classification E-value
Superfamily Fibronectin type III 1.04e-32
Family Fibronectin type III 0.0027
Further Details:      
 
Domain Number 5 Region: 2525-2668
Classification Level Classification E-value
Superfamily Fibronectin type III 6.62e-32
Family Fibronectin type III 0.0035
Further Details:      
 
Domain Number 6 Region: 4728-4928
Classification Level Classification E-value
Superfamily Fibronectin type III 3.14e-31
Family Fibronectin type III 0.0025
Further Details:      
 
Domain Number 7 Region: 3509-3676
Classification Level Classification E-value
Superfamily Fibronectin type III 2.71e-30
Family Fibronectin type III 0.0011
Further Details:      
 
Domain Number 8 Region: 1049-1238
Classification Level Classification E-value
Superfamily Fibronectin type III 5.09e-30
Family Fibronectin type III 0.0025
Further Details:      
 
Domain Number 9 Region: 1701-1889
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.84e-30
Family Laminin G-like module 0.0017
Further Details:      
 
Domain Number 10 Region: 2323-2527
Classification Level Classification E-value
Superfamily Fibronectin type III 2.06e-28
Family Fibronectin type III 0.0015
Further Details:      
 
Domain Number 11 Region: 2136-2323
Classification Level Classification E-value
Superfamily Fibronectin type III 8.08e-28
Family Fibronectin type III 0.0026
Further Details:      
 
Domain Number 12 Region: 2813-2999
Classification Level Classification E-value
Superfamily Fibronectin type III 1.2e-27
Family Fibronectin type III 0.0032
Further Details:      
 
Domain Number 13 Region: 1531-1675
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.86e-26
Family Laminin G-like module 0.0031
Further Details:      
 
Domain Number 14 Region: 1976-2137
Classification Level Classification E-value
Superfamily Fibronectin type III 4.66e-26
Family Fibronectin type III 0.0025
Further Details:      
 
Domain Number 15 Region: 3683-3863
Classification Level Classification E-value
Superfamily Fibronectin type III 6.47e-25
Family Fibronectin type III 0.0051
Further Details:      
 
Domain Number 16 Region: 2682-2815
Classification Level Classification E-value
Superfamily Fibronectin type III 8.52e-22
Family Fibronectin type III 0.0041
Further Details:      
 
Domain Number 17 Region: 3014-3143
Classification Level Classification E-value
Superfamily Fibronectin type III 8.56e-21
Family Fibronectin type III 0.0032
Further Details:      
 
Domain Number 18 Region: 4019-4151
Classification Level Classification E-value
Superfamily Fibronectin type III 2.13e-20
Family Fibronectin type III 0.0046
Further Details:      
 
Domain Number 19 Region: 4145-4260
Classification Level Classification E-value
Superfamily Fibronectin type III 2.76e-17
Family Fibronectin type III 0.003
Further Details:      
 
Domain Number 20 Region: 121-289
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 9.5e-17
Family Pentraxin (pentaxin) 0.081
Further Details:      
 
Domain Number 21 Region: 4430-4535
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00000000000000876
Family Fibronectin type III 0.0022
Further Details:      
 
Domain Number 22 Region: 3440-3504
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00000000754
Family Fibronectin type III 0.0049
Further Details:      
 
Domain Number 23 Region: 842-892
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000134
Family Laminin-type module 0.013
Further Details:      
 
Domain Number 24 Region: 636-686
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000014
Family Laminin-type module 0.0058
Further Details:      
 
Domain Number 25 Region: 895-937
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000206
Family Laminin-type module 0.0059
Further Details:      
 
Domain Number 26 Region: 790-839
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000335
Family Laminin-type module 0.011
Further Details:      
 
Domain Number 27 Region: 742-784
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000474
Family Laminin-type module 0.0071
Further Details:      
 
Domain Number 28 Region: 689-739
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000586
Family Laminin-type module 0.012
Further Details:      
 
Domain Number 29 Region: 570-622
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000725
Family Laminin-type module 0.028
Further Details:      
 
Domain Number 30 Region: 1900-1953
Classification Level Classification E-value
Superfamily Fibronectin type III 0.00000148
Family Fibronectin type III 0.0037
Further Details:      
 
Domain Number 31 Region: 997-1045
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000335
Family Laminin-type module 0.014
Further Details:      
 
Domain Number 32 Region: 946-990
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000809
Family Laminin-type module 0.0093
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000023300
Domain Number - Region: 337-412
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0306
Family beta-mannanase CBM 0.053
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000023300   Gene: ENSGGOG00000009561   Transcript: ENSGGOT00000026152
Sequence length 5169
Comment pep:known_by_projection chromosome:gorGor3.1:1:195786578:196594522:-1 gene:ENSGGOG00000009561 transcript:ENSGGOT00000026152 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
CPVLSLGSGFLFQVIEMLIFAYFASISLAESRGLFPRLENVGAFKKVSIVPTQAVCGLPD
RSTFCHSSAAAESIQFCTQRFCIQDCPYRSSHPTYTALFSAGLSSCITPDKNDLHPNAHS
NSTSFIFGNHKNCFSSPASPKLMPSFTLAVWLKPEQQGVMCVIERTVDGQIVFKLTISEK
ETMFYYRTVNGLQPPIKVMTLGRILVKKWIHLSVQVHQTKISFFIDGVEKDHTPFDARTL
SGSITDFASGTVQIGQSLNGLEQFVGRMQDFRLYQVALTNREILEVFSGDLLRLHAQSHC
RCPGSHPRVHPLAQRYCIPNDAGDTADNRVSRLNPEAHPLSFVNDNDVGTSWVSNVFTNI
TQLNQGVTISVDLENGQYQVFYIIIQFFSPQPTEIRIQRKKENSLGWEDWQYFARNCGAF
GMKNNGDLEKPDSVNCLQLSNFTPYSRGNVTFSILTPGPNYRPGYNDFYNTPSLQEFVKA
TQIRFHFHGQYYTTETAVNLRHRYYAVDEITISGSCMYKNVCGLNFYPYIFICLPNLYTE
SPQCDRCLPLYNDKPFRQGDQVYAFNCKPCQCNSHSKSCHYNISVDPFPFEHFRGGGGVC
DDCEHNTTGRNCELCKDYFFRQVGIDPSAIDVCKPCDCEKVGTRNGSILCDQIGGQCNCK
RHVSGRQCNQCQNGFYNLQELDPDGCSPCNCNTSGTVDGDITCHQNSGQCKCKANVIGLR
CDHCNFGFKFLRSFNDDGCEPCQCNLHGSVNKFCNPHSGQCECKKEAKGLQCDTCRENFY
GLDVTNCKACDCDTAGSLPGTVCNAKTGQCICKPNVEGRQCNKCLEGNFYLRQNNSFLCL
PCNCDKTGTINGSLLCDKSTGQCPCKLGVTGLRCNQCEPHRYNLTIDNFQHCQMCECDSL
GTLPGTICDPISGQCLCVPNRQGRRCNQCQPGFYISPGNATGCLPCSCHTTGAVNHICNS
LTGQCVCQDASIAGQRCDQCRDHYFGFNPQTGRCQPCNCHLSGALNETCHLVTGQCFCKQ
FVTGSKCDACVPSASHLDVNNLLGCSKTPFQQPPPRGQVQSSSAINLSWSPPDSPNAHWL
TYSLLRDGFEIYTTEDQYPYSIQYFLDTDLLPYTKYSYYIETANVHGSTRSVAVTYKTKP
GVPEGNLTLSYIIPIGSDSVTLTWTTLSNQSGPIEKYILSCAPLAGGQPCVSYEGHETSA
TIWNLVPFAKYDFSVQACTSGGCLHSLPITVTTAQAPPQRLSPPKMQKISSTELHVEWSP
PVELNGIIIRYELYMRRLRSTKETTSEESRVFQSSGWLSPHSFVESANENALKPPQTMAT
ITGLEPYTKYEFRVLAVNMAGSVSSAWVSERTGESAPVFMIPPSVFPLSSYSLNISWEKP
ADNVTRGKVVGYDINMVSEQSPQQSIPTAFSQLLHTAKSQELSYTVEGLKPYRIYEFTIT
LCNSVGCVTSASGAGQTLAAGKKAQLMIAVIVREASVTMHLIFLPCPGVNMQPEIFNIKR
RKSSLPSPVSAITKIFRMLGHGCNCTSSAFSHISIKRIKASFRTKVPEGLIVFAASPGNQ
EEYFALQLKKGRLYFLFDPQGSPVEVTTTNDHGKQYSDGKWHEIIAIRHQAFGQITLDGI
YTGSSAILNGSTVIGDNTGVFLGGLPRSYTILRKDPEIIQKGFVGCLKDVHFMKNYNPSA
IWEPLDWQSSEEQINVYNSWEGCPASLNEGAQFLGAGFLELHPYMFHGGMNFEISFKFRT
DQLNGLLLFVYNKDGPDFLAMELKSGILTFRLNTSLAFTQVDLLLGLSYCNGKWNKVIIK
KEGSFISASVNGLMKHASESGDQPLVVNSPVYVGGIPQELLNSYKHLCLEQGFGGCMKDV
KFTRGAVVNLASVSSGAVRVNLDGCLSTDSAVNCRGNDSILVYQGKEQSVYEGGLQPFTE
YLYRVIASHEGGSVYSDWSRGRTTGAAPQSVPTPSRVRSLNGYSIEVTWDEPVVRGVIEK
YILKAYSEDSTRPPHMPSASAEFVNTSTLTGILTGLLPFKNYAVTLTACTLAGCTESSHA
LNISTPQEAPQEVQPPVAKSLPSSLLLSWNPPKKANGIITQYRLYMDGRLIYSGNEENYT
VTDLAVFTPHQFLLSACTHVGCTNSSWVLLYTAQLPPEHVDSPVLTVLDSRTIYIQWKQP
RKISGILERYVLYISNHTHDFTIWSVIYNSTELFQDHMLQYVLPGNKYLIKLGACTGGGC
TVSEASEALTDEDIPEGVPAPKAHSYSPDSFNVSWTEPEYPNGVITSYGLYLDGILIHNS
SELSYRAYGFAPWSLHSFRVQACTAKGCALGPLVENRTLEAPPEGTVNVFVKTQGSRKAH
VRWEAPFRPNGLLTYSVLFTGIFYVDPVGNNYTLLNVTKVMYSGEETNLWVLIDGLVPFT
NYTIQVNISNSQGSLITDPITIAMPPGAPDGVLPPRLSSATPTSLQVVWSTPARNNAPGS
PRYQLQMRSGDSTHGFLELFSNPSASLSYEVSDLQPYTEYMFRLVASNGFGSAHSSWIPF
MTAEDKPGPVVPPILLDVKSRMMLVTWQHPRKSNGVITHYNIYLHGRLYLRTPGNVTNCT
VMHLHPYTAYKFQVEACTSKGCSLSPESQTVWTLPGAPEGIPNPELFSDTPTSVIISWQP
PTHPNGLVENFTIERRVKGKEEVTTLVTLPRSHSMRFIDKTSALSPWTKYEYRVLMSTLH
GGTNSSAWVEVTTRPSRPAGVQPPVVTVLEPDAVQVTWKPPLIQNGDILSYEIRMPDPHI
TITNVTSAVLSQKVTHLIPFTNYSVTIVACSGGNGYLGGCTESLPTYVTTHPTVPQNVGP
LSVIPLSESYVVISWQPPSKPNGPNLRYELLRRKIQQPLASNPPEDLNRWHNIYSGTQWF
YEDKGLSRFTTYEYMLFVHNSVGFTLSREVTVTTLAGLPERGANLTASVLNHTAIDVRWA
KPTVQDLQGEVEYYTLFWSSATSNDSLKILPDVNSHVIGHLKPNTEYWIFISVFNGVHSI
NSAGLHATTCDGEPQGMLPPEVVIINSTAVRVIWTSPSNPNGVVTEYSIYVNNKLYKTGM
NVPGSFILRDLSPFTIYDIQVEVCTIYACVKSNGTQITTVEDTPSDIPTPTIRGITSRSL
QIDWVSPRKPNGIILGYDLLWKTWYPCTKTQKLVQDQSDELCKAVRCQKPEYICGHICYS
SEAKVCCNGVLYNPKPGHRCCEEKYIPFVLNSTGVCCGGRIQEAQPNHQCCSGYYARILP
GEVCCPDEQHNRVSVGIGDSCCGRMPYSTSGNQICCAGRLHDGHGQKCCGGQIVSNDLEC
CGGEEGVVYSRLPGMFCCGQDYVNMSDTICCSASSGESKAHVKKNDPVPVKCCETELIPK
SQKCCNGVGYNPLKYVCSDKISTGMMMKETKECRILCPASMEATAHCGRCDFNFTRHICT
VIRGSHNSTGKASIEEMCSSAEETIHTGSVNTYSYTDVNLKPYMTYEYRISAWNSYGRGL
SKAVRARTKEDVPQGVSPPTWTKIDNLEDTIVLNWRKPIQSNGPIIYYILLRNGIERFRG
TSLSFSDKEGIQPFQEYSYQLKACTVAGCATSSKVVAATTQGVPESILPPSITALSAVAL
HLSWSVPEKPNGVIKEYQIRQVGKGLIHTDTTDRRQHTVTGGLQPYTNYSFTLTACTSAG
CTSSEPFLGQTLQAAPEGVWVTPRHIIINSTTVELYWSLPEKPNGLISQYQLSRNGNLLF
LGGSEEQNFTDKNLEPNSRYTYKLEVKTGGGSSTSDDYIVQTPMSTPEEIYPPYNITVIG
PYSVFVAWIPPGILIPEIPVEYNVLLNDGSVTPLAFSVGHHQSTLLENLTPFTQYEIRIQ
ACQNGGSCGVSSRMFVKTPEAAPMDLNSPVLKALGSACIEIKWMPPEKPNGIIINYFIYR
RPAGIEEESVLFVWSEGALEFMDEGDTLRPFTLYEYRVRACNSKGSVESLWSLTQTLEAP
PQDFPAPWAQATSAHSVLLNWTKPESPNGIISHYRVVYQERPDDPTFNSPTVHAFTVKGT
SHQAHLYGLEPFTTYRIGVVAANHAGEILSPWTLIQTLESSPSGLRNFIVEQKENGRALL
LQWSEPMRTNGVIKTYNIFSDGFLEYSGLNRQFLFRRLDPFTLYTLTLEACTTAGCAHSA
PQPLWTDEAPPDSQLAPTVHSVKSTSVELSWSEPVNPNGKIIRYEVIRRCFEGKAWGNQT
IQADEKIVFTEYNTERNTFIYNDTGLQPWTQCEYKIYTWNSAGHTCSSWNVVRTLQAPPE
GLSPPVISYVSRNPQKLLISWIPPEQSNGIIQSYRLQRNEMLYPFSFDPVTFNYTDEELL
PFSTYSYALQACTSGGCSTSKPTSITTLEAAPSEVSPPDLWAISATQMNVCWSPPTVQNG
KITKYLVRYDNKESLAGQGLCLLVSHLQPYTQYNFSVVACTNGGCTASVSKSAWTMEALP
KNMDSPTLQVTGSESIEITWKPPRNPNGQIRSYELRRDGTIVYTGLETRYHDFTLTPGVE
YGYTVTASNSQGGILSPLVKDRTSPSAPSGMEPPKLQAGGPQEILVNWDPPVRTNGDIIN
YTLFIRELFERETKIIHINTTHNSFGTQSYIVNQLKPFHRYEIRIQACTTLGCASSEWTF
IQTPEIAPLMQPPPHLEVQMAPGGFQPTVSLLWTGPLQPNGKVLYYELYRRQIATQPGKS
NPVLIYNGSSTSFIDSELLPFTEYEYQVWAVNSAGKAPSSWTWCRTGPAPPEGLRAPTFH
AISSTQAVVNISAPGKPNGIVSLYRLFSSSAHGAETVLSEGMATQQTLHGLHAFTNYSIG
VEACTCFNCCSKGPTAELRTHPAPPSGLSSPQIKTLASRTASFRWSPPMFPNGVIHSYEL
QLHVVCPPDSALSCTPSQIETKYTGLGQKASLGGLQPYTTYKLRVVAHNEVGSTASEWIS
FTTQKELPQYRAPFSVDSNLSVVCVNWSDTFLLNGQLKEYVLTDGGRRMYSGFDTTLYIP
RTADKTFFFQVICTTDQGSVKTPLIQYDTSTGLGLVLTTSGEKKGSRSKSTEFYSELWFI
VLMAMLGLILLAIFLSLILQRKIHKEPYIRERPPLVPLQKRMSPLNVYPQGENHMGLADT
KIPRSGTPVSIRSNRSACVLRIPSQSQTSLTYSQGSLHRSVSQLMDIQDKKVLMDNSLWE
AIMGHNSGL
Download sequence
Identical sequences ENSGGOP00000023300 ENSGGOP00000023300

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]