SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000021744 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000021744
Domain Number 1 Region: 44-513
Classification Level Classification E-value
Superfamily Sema domain 1.01e-131
Family Sema domain 0.000017
Further Details:      
 
Domain Number 2 Region: 1357-1483,1678-1867
Classification Level Classification E-value
Superfamily GTPase activation domain, GAP 7.85e-53
Family p120GAP domain-like 0.032
Further Details:      
 
Domain Number 3 Region: 1046-1150
Classification Level Classification E-value
Superfamily E set domains 5.1e-16
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.075
Further Details:      
 
Domain Number 4 Region: 959-1047
Classification Level Classification E-value
Superfamily E set domains 9.29e-16
Family Other IPT/TIG domains 0.029
Further Details:      
 
Domain Number 5 Region: 862-961
Classification Level Classification E-value
Superfamily E set domains 0.000000000000233
Family Other IPT/TIG domains 0.011
Further Details:      
 
Domain Number 6 Region: 516-568
Classification Level Classification E-value
Superfamily Plexin repeat 0.000000000157
Family Plexin repeat 0.0023
Further Details:      
 
Domain Number 7 Region: 1147-1237
Classification Level Classification E-value
Superfamily E set domains 0.000000392
Family E-set domains of sugar-utilizing enzymes 0.046
Further Details:      
 
Domain Number 8 Region: 815-848
Classification Level Classification E-value
Superfamily Plexin repeat 0.0000275
Family Plexin repeat 0.0027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000021744   Gene: ENSGGOG00000000847   Transcript: ENSGGOT00000026988
Sequence length 1897
Comment pep:known_by_projection chromosome:gorGor3.1:3:126779626:126828440:1 gene:ENSGGOG00000000847 transcript:ENSGGOT00000026988 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPLPPRSLQVLLLLLLLLLLLLPGMWAEAGLPRAGGGSQPPFRTFSASDWGLTHLVVHEQ
TGEVYVGAVNRIYKLSGNLTLLRAHVTGPVEDNEKCYPPPSVQSCPHGLGSTDNVNKLLL
LDYAANRLLACGSASQGICQFLRLDDLFKLGEPHHRKEHYLSSVREAGSMAGVLIAGPPG
QGQAKLFVGTPIDGKSEYFPTLSSRRLMANEEDADMFGFVYQDEFVSSQLKIPSDTLSKF
PAFDIYYVYSFRSEQFVYYLTLQLDTQLTSPDAAGEHFFTSKIVRLCVDDPKFYSYVEFP
IGCEQAGVEYRLVQDAYLSRPGRALARQLGLAEDEDVLFTVFAQGQKNRVKPPKESALCL
FTLRAIKEKIKERIQSCYRGEGKLSLPWLLNKELGCINSPLQIDDDFCGQDFNQPLGGTV
TIEGMPLFVDKDDGLTAVAAYDYRGRTVVFAGTRSGRIRKILVDLSNPGGRPALAYESVV
AQEGSPILRDLVLSPNHQYLYAMTEKQVTRVPVESCVQYTSCELCLGSRDPHCGWCVLHS
ICSRRDACERADEPQRFAADLLQCVQLTVQPRNVSVTMSQVPLVLQAWNVPDLSAGVNCS
FEDFTESESILEDGRIHCRSPSAREVAPITRGQGDQRVVKLYLKSKETGKKFASVDFVFY
NCSVHQSCLSCVNGSFPCHWCKYRHVCTHNVADCAFLEGRVNVSEDCPQILPSTQIYVPV
GVVKPITLAARNLPQPQSGQRGYECLFHIPGSPARVTALRFNSSSLQCQNSSYSYEGNDV
SDLPVNLSVVWNGNFVIDNPQNIQAHLYKCPALRESCGLCLKADPRFECGWCVAERRCSL
RHHCAADTPASWMHARHGSSRCADPKILKLSPETGPRQGGTRLTITGENLGLRFEDVRLG
VRVGKVLCSPVESEYISAXXIVCEIGDASSVRAHDALVEVCVRDCSPHYRALSPKRFTFV
TPTFYRVSPSRGPLSGGTWIGIEGSHLNAGSDVAVSVGGRPCSFSWRNSREIRCLTPPGQ
SPGSAPIIININRAQLTNPEVKYNYTEDPTILRIDPEWSINSGGTLLTVTGTNLATVREP
RIRAKYGGIERENGCLVYNDTTMVCRAPSVANPVRSPPELGERPDELGFVMDNVRSLLVL
NSTSFLYYPDPVLEPLSPTGLLELKPSSPLILKGRNLLPPAPGNSRLNYTVLIGSTPCTL
TVSETQLLCEAPNLTGQHKVTVRAGGFEFSPGTLQVYSDSLLTLPAIVGIGGGGGLLLLV
IVAVLIAYKRKSRDADRTLKRLQLQMDNLESRVALECKEAFAELQTDIHELTNDLDGAGI
PFLDYRTYAMRVLFPGIEDHPVLKEMEVQANVEKSLTLFGQLLTKKHFLLTFIRTLEAQR
SFSMRDRGNVASLIMTALQGEMEYATGVLKQLLSDLIEKNLESKNHPKLLLRRTESVAEK
MLTNWFTFLLYKFLKECAGEPLFMLYCAIKQQMEKGPIDAITGEARYSLSEDKLIRQQID
YKTLTLNCVNPENENAPEVPVKGLDCDTVTQAKEKLLDAAYKGVPYSQRPKAADMDLEWR
QGRMARIILQDEDVTTKIDNDWKRLNTLAHYQVTDGSSVALVPKQTSAYNISNSSTFTKS
LSRYESMLRTASSPDSLRSRTPMITPDLESGTKLWHLVKNHDHLDQREGDRGSKMVSEIY
LTRLLATKGTLQKFVDDLFETIFSTAHRGSALPLAIKYMFDFLDEQADKHQIHDADVRHT
WKSNCLPLRFWVNVIKNPQFVFDIHKNSITDACLSVVAQTFMDSCSTSEHKLGKDSPSNK
LLYAKDIPNYKSWVERYYADIAKMPAISDQDMSAYLAEQSRLHLSQFNSMSALHEIYSYI
TKYKDEILAALEKDEQARRQRLRSKLEQVVDTMALSS
Download sequence
Identical sequences ENSGGOP00000000836 ENSGGOP00000021744

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]