SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000006875 from Gorilla gorilla 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000006875
Domain Number 1 Region: 400-533
Classification Level Classification E-value
Superfamily C-type lectin-like 7.7e-25
Family C-type lectin domain 0.0019
Further Details:      
 
Domain Number 2 Region: 1087-1209
Classification Level Classification E-value
Superfamily PKD domain 8.11e-23
Family PKD domain 0.002
Further Details:      
 
Domain Number 3 Region: 2022-2144
Classification Level Classification E-value
Superfamily PKD domain 2.98e-22
Family PKD domain 0.0022
Further Details:      
 
Domain Number 4 Region: 1681-1801
Classification Level Classification E-value
Superfamily PKD domain 1.18e-21
Family PKD domain 0.0068
Further Details:      
 
Domain Number 5 Region: 68-158
Classification Level Classification E-value
Superfamily L domain-like 1.28e-21
Family Ngr ectodomain-like 0.011
Further Details:      
 
Domain Number 6 Region: 1461-1545
Classification Level Classification E-value
Superfamily PKD domain 0.0000000000000209
Family PKD domain 0.0036
Further Details:      
 
Domain Number 7 Region: 983-1121
Classification Level Classification E-value
Superfamily PKD domain 0.00000000000147
Family PKD domain 0.0068
Further Details:      
 
Domain Number 8 Region: 1550-1631
Classification Level Classification E-value
Superfamily PKD domain 0.00000000000314
Family PKD domain 0.0077
Further Details:      
 
Domain Number 9 Region: 264-354
Classification Level Classification E-value
Superfamily PKD domain 0.00000000000445
Family PKD domain 0.00000894
Further Details:      
 
Domain Number 10 Region: 1310-1377
Classification Level Classification E-value
Superfamily PKD domain 0.0000000000144
Family PKD domain 0.0046
Further Details:      
 
Domain Number 11 Region: 1815-1885
Classification Level Classification E-value
Superfamily PKD domain 0.0000000000327
Family PKD domain 0.0055
Further Details:      
 
Domain Number 12 Region: 1883-1970
Classification Level Classification E-value
Superfamily PKD domain 0.000000000051
Family PKD domain 0.0088
Further Details:      
 
Domain Number 13 Region: 1221-1291
Classification Level Classification E-value
Superfamily PKD domain 0.0000000017
Family PKD domain 0.0072
Further Details:      
 
Domain Number 14 Region: 1387-1461
Classification Level Classification E-value
Superfamily PKD domain 0.0000000128
Family PKD domain 0.006
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000006875
Domain Number - Region: 904-1012
Classification Level Classification E-value
Superfamily PKD domain 0.0373
Family PKD domain 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000006875   Gene: ENSGGOG00000007016   Transcript: ENSGGOT00000007057
Sequence length 3621
Comment pep:novel chromosome:gorGor3.1:16:2224385:2281081:-1 gene:ENSGGOG00000007016 transcript:ENSGGOT00000007057 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPPAAPVRLALALGLGLWLGALAGGPGRGCGPCEPPCLCGPAPGAACRVNCSGRGLRTLG
PALRIPADATALDVSHNLLRALDVGLLANLSALAELDISNNKISTLEEGIFANLFNLSEI
NLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGCGEEY
VACLPDNSSGTVAAVSFSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSA
SFACLSCSSPPPPPAPTCRGRTLLQHVFPASPGAALVGPHGPLASGRLAAFHIAAPLPVT
ATRWDFGDGSPEVDAAGPAASHRYVLPGRYHVTAVLALGAGSALLGTDVQVEAAPATLEL
VCPSSVQSDESLDLSIRNRGGSGLEAAYSIVALGEEPARAVHPLCPSDTEIFPGSGHCYR
LVVEKAAWLQAQEQCRAWAGAALAMVDSPAVQRFLVSRVTRSLDVWIGFSTVQGVEVGPA
PQGEAFSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELRPGGPVQDA
ENLLVGAPSGDLQGPLTPLAQQDGLSAPHEPVEVMVFPGLRLSREAFLTTAEFGTQELRR
PAQLRLQVYRLLSTAGTPENGSEPESRSPDNRTQLVPACMPGGCWCPGANICLPLDASCH
PQACANGCTSGPGLPGAPYALWREFLFSVPAGPPAQYSVTLHGQDVLMLPGDLVGLQHDA
GPSALLHCSPAPSHPGPQAPYLSANASSWLPHLPAQLEGTWACPACALRLLAATEQLTVL
LGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSCSFDVVSPVAGLRVIYPAPHHGRLYVPTN
GSALVLQVDSGANATATARWPGGSVSARFENACPALVATFVPGCPWETNDTLFSAVALPW
LSEGEHVMDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGS
DMVFRWTINDKQSLTFQNVVFNVIQSAAVFKLSLTASNHVSNVTVNYNVTVERMNRMQGL
RVSTVPAVLSPNATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSV
AQVLVEHNVTHTYAAPGEYILTVLASNAFENLTQQVPVSVRTSLPSVAVGVSDGVLVAGR
PVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAA
QADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEH
VYLRAQNCTVTVGAASPAGHLAQSLHVLVFVLEVLRIEPAACIPTQPHARLTAYVTGNPA
HYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGN
VTLQPERQFVQLGDEARLVACAWPPFPYRYTWDFGTEEAGPARAGGPEVTCIYRDPGSYL
VTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLGLELQQPYLFSVVGRGRPASYVWDL
GDGGRLEGPEVTHAYNSTGDFTVRVAGWNEVSHSEAWLNVTVKRRVRGLVVNASRTVVPL
NGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQD
SIFVYVLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNISYSWTAWRDRGPALAGSGKG
FSLTALEAGTYHVRLRATNMLGSAWADCTVDFVEPVGWLMVAASPNPAAVNTSVTLCAEL
AGGSGVVYTWSLEEGLSWETTEPFTTHSFPTPGLHLVTMTAGNPLGSANATMEVDVQVPV
SGLSIRASEPGGSFVAAGSSVPFWGQLATGTNVSWCWVVPGGSSKRGPHVTMVFPDAGTF
SIRLNASNAVSWVSATYNLTVEEPIVGLVLWASSKVVAPGQPVHFQILLAAGSAVTFRLQ
VGGASPEVLPGPRFSHSFPRVGDHVVSVQGKNHVSWAQAQVRIVVLEAVSGLQVPNCCES
GIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRA
FNALGSENRTLVLEVQDAVQYVALRSGPCFTNRSAQFEATTSPSPRRVAYHWDFGDGSPG
QDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATVTVQVLACREPEVDVVLPLQVLMR
RSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPGVDVSRPQLVLPRLA
LPVGHYCFVFVVSFGDTPLARSIQANVTVAPERLVPIIEGGSYRVWSDTQDLVLDGSESY
DPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRGSSTVTIPRERLAAGVEYTFSLTV
WKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKR
GRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVLRDGEGYTFTLSVLGRSGEEEGCAS
IRLSPNRPPLGGSCRLFPLGAVHALTTKVHFECTGWYDAEDAGAPLVYALLLRRCRQGHC
EEFCVYKGSLSSYEAVLPPGFRPHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSAT
GLTVWLHGLTASVLPGLLRQANPQHVIEYSLALVTVLNEYEQALEVAAEPKHERQRRAQI
RKNITETLVSLRVHTVDDIQQIAAALAQCMTRDPAGSYHLNLSSHFRWSALEVSVGLYTS
LCQYFSEEDVVWRTEALLPLEETSPRQAVCLTRHLTAFGASLFVPRSHVRFVFPEPTVDV
NYIVMLTCAVCLVTYMVMAAIVHKLDQLDASRGHAIPFCGQWGRFKYEILVKTGWGRGSS
STPGEKTETVALQRLGELGPPSPGLNWEQPQAARLSRTGLVEGLRKRLLPAWCASLAHGL
SLLLVAVAVAVSGWVGASFPPGVSVAWLLSSSASFLASFLGWEPLKVLLEALYFSLVAKR
LHPDEDDTLVESPAVTPVSARVPRVRPPHGFALFLAKEEARKVKRLHGMLRSLLVYMLFL
LVTLLASYGDASCHGHAYRLQSAIKQELHSRAFLAITRSEELWPWMAHVLLPYVHGNQSS
PELGPPRLRQVRLQEALYPDPPGPRVHTCSAAGGFSTSDYDVGWGSPHNGSGTWAYSAPD
LLGAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFVELTRYSPAV
GLHAAVTLRLEFPAAGRALAALSIRPFALRRLSAGLSLPLLTSVCLLLFALHFAVAEART
WHREGRWRVLRLGAWARWLLVALTAATALVRLAQLGAADRQWTRFVRGRPRRFTSFDQVA
QLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYA
QLAVLLVSSCVDSLWSVAQALLVLCPGTGLSTLCPAESWQLSPLLCVGLWALRLWGALRL
GAVILRWRYHALRGELYRPAWEPQDYEMVELFLRRLRLWMGLSKVKEFRHKVRFEGMEPL
PSRSSRGSKVSPDVPPPSAGSDASHPSTSSSQLDGLSVSLGRLGTRCEPEPSRLQAVFEA
LLTQFDLNQATEDVYQLEQQLHSLQGRRSSRAPAGPSRGPSPGLRPALPSRLVRASRGVD
LATGPSRTPLRAKNKVHPSST
Download sequence
Identical sequences ENSGGOP00000006875 ENSGGOP00000006875

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]