SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000010859 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000010859
Domain Number 1 Region: 35-502
Classification Level Classification E-value
Superfamily Sema domain 1.57e-127
Family Sema domain 0.0000154
Further Details:      
 
Domain Number 2 Region: 1343-1471,1687-1875
Classification Level Classification E-value
Superfamily GTPase activation domain, GAP 1.96e-50
Family p120GAP domain-like 0.027
Further Details:      
 
Domain Number 3 Region: 1036-1138
Classification Level Classification E-value
Superfamily E set domains 5.23e-16
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.081
Further Details:      
 
Domain Number 4 Region: 948-1036
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000019
Family E-set domains of sugar-utilizing enzymes 0.082
Further Details:      
 
Domain Number 5 Region: 850-950
Classification Level Classification E-value
Superfamily E set domains 0.00000000000014
Family Other IPT/TIG domains 0.013
Further Details:      
 
Domain Number 6 Region: 505-556
Classification Level Classification E-value
Superfamily Plexin repeat 0.00000000000146
Family Plexin repeat 0.0019
Further Details:      
 
Domain Number 7 Region: 1136-1226
Classification Level Classification E-value
Superfamily E set domains 0.000000504
Family E-set domains of sugar-utilizing enzymes 0.043
Further Details:      
 
Domain Number 8 Region: 804-836
Classification Level Classification E-value
Superfamily Plexin repeat 0.0000196
Family Plexin repeat 0.0021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000010859   Gene: ENSGACG00000008199   Transcript: ENSGACT00000010882
Sequence length 1906
Comment pep:known_by_projection group:BROADS1:groupXVII:7545775:7673484:1 gene:ENSGACG00000008199 transcript:ENSGACT00000010882 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LLLSLSFPHPLNLASRYPSMSSSLSGGQPPPLKRFVPAEWGLTHLAIHNKSGEVYVGAVN
WIFKLSSNLTKLRSHMTGPVVDNEKCYPPPSVQSCPHDLAQTPNVNKLLLVDYAQNRLIA
CGSTSQGICQFLRLDDLFKLGEPHHRKEHYLSSVAESGTMSGVIISSSHSPTSKLFIGTP
IDGKSEYFPTLSSRKLMENEENADMFSFVYQDEFVSSQLKIPSDTLSKFPAFDIYYVYSF
SSEQFVYYLTMQLDTQLTSPDASGEQFFTSKIVRLCVDDPKFYSYVEFPIGCTKDGVEYR
LVQDAFLARPGRQLATSLGISENEDILFTVFSQGQKNRAKPPKESALCLFTLRKIKEKIK
ERIQSCYKGSGKLSLPWLLNKELACINSPLQIDDNFCGQDFNQPLGGTSTIEGTPLFIDK
DDGMTSVAAYDYRGNTVAFVGTRNGKLKKILVNSVNPTRPAALYEKVTVSEVGSPLLRDM
LFSPDMQYIYTLTDRQTSRVPVESCEQYTTCGECLGSRDPHCGWCVLHNVCSRKDRCERA
GEPQRFASDQRQCVELTVQPRNISVTMADVQLVLQARNVPDLLAGVNCSFEDYVETEGQI
QGGHIFCRSPSLRDIIPITRNKGDKRVVKLYLKSKETGKKFASVDFVFYNCSVHQSCLSC
VNGSFPCHWCKYRHMCTQNANDCSFQEGRVNISEDCPQIVPSAQIFIPVGVTKPITLAAK
NLPQPQSGQRNYECVFHIQGETQSVPALRFNSTSIQCQKTAYAYDGNDISDLPVDLSVVW
NGDFVIDNPYNIQAHLYKCYALRESCGMCLKANPRFECGWCVQEKKCSMRQECTPPESTW
MHATTGNSRCAHPKITKLSPETGPRQGGTMLTITGENLGLQFKDIQSGVRIGKVACNPQE
DQYISAEQIVCRLNDATGYRVQDAQVEVCVRDCIHPDYKAVSSKAFTFVSPYFTRVLPST
GPLSGGTRITIEGSHLNAGSAVSVKIGLHPCRFERRGNKEIVCVTPAGQTPGTTPVMVDI
NSAELRNPEVKFNYSDDPTILKIEPDWSIASGGTMLTITGTNLDTIKEPKMRAKYGAAKS
ENNCTVLNNTVMVCLAPSVAGSDKGFLESGSSPDEIGFVMDDVRSVLVVNETFSYHPDPV
FEPLSPSGMLELKPSSPLILKGRNLIPAAPGNAKLNYTVLIGETPCVLTLSESQLLCEWP
NLTGEHKVTVRVGGFEYSPGTLQIYSDSLLTLPAIIGIGGGGGLLLLVIIVVLIAYKRKS
RDADRTLKRLQLQMDNLESRVALECKEAFAELQTDIHELTQELDGAGIPFLEYRTYAMRV
LFPGIEDHPVLKEMEVPANTEKALTLFGQLLTKKHFLLTFIRTLEAQRSFSMRDRGNVAS
LIMTALQGEMEYATGVLKQLLSDLIDKNLESKNHPKLLLRRTESVAEKMLTNWFTFLLYK
FLKECAGEPLFMLYCAMKQQMEKGPIDSITGEARYSLSEDKLIRQQIDYKTLTQCIDMDI
FCDNTDNLQLPKTLHCVNPENENAPEVTVKSLNCDTVTQVKEKLLDAVYKGTPYSQRPKA
SDMDLGKWRQGRMARIILQDEDITTKIDNDWKRLNTLAHYQVTDGSLIALVPKQNSAYNI
SNSSTFTKSLSRYESMLRTASSPDSLRSRTPMITPDLESGTKLWHLVKNHDHSDQREGDR
GSKMVSEIYLTRLLATKGTLQKFVDDLFETIFSTAHRGSALPLAIKYMFDFLDEQADKHS
ISDSDVRHTWKSNCLPLRFWVNVIKNPQFVFDIHKNSITDACLSVVAQTFMDSCSTSEHK
LGKDSPSNKLLYAKDIPNYKNWVERYYSDISRMSAISDQDMSAYLAEQSRLHANQFNSMS
ALNEIYSYIVKYKDEILSALERDEQARRQRLRSKLEQVIDTMALSS
Download sequence
Identical sequences G3NZT6
ENSGACP00000010859 69293.ENSGACP00000010859 ENSGACP00000010859

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]