SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000011210 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000011210
Domain Number 1 Region: 1577-1855
Classification Level Classification E-value
Superfamily (Phosphotyrosine protein) phosphatases II 8.84e-99
Family Higher-molecular-weight phosphotyrosine protein phosphatases 0.00000109
Further Details:      
 
Domain Number 2 Region: 271-458
Classification Level Classification E-value
Superfamily Fibronectin type III 1.92e-44
Family Fibronectin type III 0.00083
Further Details:      
 
Domain Number 3 Region: 460-657
Classification Level Classification E-value
Superfamily Fibronectin type III 3.27e-38
Family Fibronectin type III 0.0011
Further Details:      
 
Domain Number 4 Region: 849-1034
Classification Level Classification E-value
Superfamily Fibronectin type III 4.08e-38
Family Fibronectin type III 0.0017
Further Details:      
 
Domain Number 5 Region: 1-50,134-258
Classification Level Classification E-value
Superfamily Fibronectin type III 7.74e-36
Family Fibronectin type III 0.00096
Further Details:      
 
Domain Number 6 Region: 661-846
Classification Level Classification E-value
Superfamily Fibronectin type III 2.99e-30
Family Fibronectin type III 0.0012
Further Details:      
 
Domain Number 7 Region: 1032-1220
Classification Level Classification E-value
Superfamily Fibronectin type III 1.18e-29
Family Fibronectin type III 0.0028
Further Details:      
 
Domain Number 8 Region: 1237-1336
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000000000417
Family Fibronectin type III 0.0023
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000011210   Gene: ENSGGOG00000011494   Transcript: ENSGGOT00000011539
Sequence length 1894
Comment pep:known_by_projection chromosome:gorGor3.1:12:79171808:79368495:1 gene:ENSGGOG00000011494 transcript:ENSGGOT00000011539 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
PGAVFDLQLAEVEATQVRITWKKPRQPNGIINQYRVKVLVPETGIILENTLLTGNNEYIN
DPMAPEIVNMVEPMVGLYEGSAEMSSDLHSLATFIYNSHPDKKFPARNRAEDQTSPVVTT
RNQYITDIAAEQLSYVIRRLVPFTEHMISVSAFTIMGEGPPTFLSVRTRQQVPSSIKIIN
YKNISSSSILLYWDPPEYPNGKITHYTIYAMELDTNRAFQITTIDNSFLITGLKKYTKYK
MRVAASTHVGESSLSEENDIFVRTPEDEPESSPQDVEVIDVTADEIRLKWSPPEKPNGII
IAYEVLYKNTDTLYMKNTSTTDIILRNLRPHTLYNISVRSYTRFGHGNQVSSLLSVRTSE
TVPDSAPENITYKNISSGEIELSFLPPSSPNGIIHKYTIYLKRSNGNEERTINTTSLTQN
IKGLKKYTQYIIEVSASTLKGEGVRSAPISILTEEDAPDSPPQDFSVKQLSGVTVKLSWQ
PPLEPNGIILYYTVYVWNRSSLKTINVTETSLELSDLDCNVEYSAYVTASTRFGDGKTRS
NIISFQTPEGAPSDPPKDVYYVNLSSSSIILFWTPPSKPNGIIQYYSVYYRNTSGTFMQN
FTLHEVTNDFDNITVSTIIDKLTIFSYYTFWLTASTSVGNGNKSSDIIEVYTDQDIPEGL
VGNLTYESISSTAINVSWVPPAQPNGLVFYYVSLILQQTPRHVRPPLVTYERSIYFDNLE
KYTDYILKITPSTEKGFSDTYTAQLYIKTEEDVPETSPIINTFKNLSSTSVLLSWDPPVK
PNGAIISYDLTLQGPNENYSFITSDNYIILEELSPFTLYSFFAAARTRKGLGPSSILFFY
TDESVPLAPPQNLTLINCTSDFVWLKWSPSPLPGGIVKVYSFKIHEHETDTIYYKNISGF
KTEAKLVGLEPVSTYSIRVSAFTKVGNGNQFSNVVKFTTQESVPDVVQNMQCMATSWQSV
LVKWDPPKKANGIITQYMITVERNSTKVSPQDHMYTFVKLLANTSYVFKVRASTSAGEGD
ESTCHVSTLPETVPSVPTNIAFSDVQSTSATLTWIRPDSILGYFQNYKITTQLRAQKCKE
WESEECVEYQKIQYLYEAHLTEETVYGLKKFRWYRFQVAASTNAGYGNASNWISTQTLPG
PPDGPPENVHVVATSPFSISISWSEPAVITGPTCYLIDVKSVDNDEFNISFIKSNEENKT
IEIKDLEIFTRYSVVITAFTGNISAAYVEGKSSAEMIVTTLESAPKDPPNNMTFQKIPDE
VTKFQLTFLPPSQPNGNIQVYQALVYREDDPTAVQIHNLSIIQKTNTFIIAMLEGLKGGH
TYNISVYAVNSAGAGPKVPMRITMDIKAPARPKTKPTPIYDATGKLLVTSTTITIRMPIC
YYSDDHGPIKNVQVLVTETGAQHDGNVTKWYDAYFNKARPYFTNEGFPNPPCTEGKTKFS
GNEEIYIIGADNACMIPGNEDKICNGPLKPKKQYLFKFRATNIMGQFTDTDYSDPVKTLG
EGLSERTVEIILSVTLCILSIILLGTASFAFARIRQKQKEGGTYSPQDAEIIDTKFKLDQ
LITVADLELKDERLTRPISKKSFLQHVEELCTNNNLKFQEEFSELPKFLQDLSSTDADLP
WNRAKNRFPNIKPYNNNRVKLTADASVPGSDYINASYISGYLCPNEFIATQGPLPGTVGD
FWRMVWETRAKTLVMLTQCFEKGRIRCHQYWPEDNKPVTVFGDIVITKLMEDVQIDWTIR
DLKIERHGDCMTVRQCNFTAWPEHGVPENSAPLIHFVKLVRASRAHDTTPMIVHCSAGVG
RTGVFIALDHLTQHINDHDFVDIYGLVAELRSERMCMVQNLAQYIFLHQCILDLLSNKGS
NQPICFVNYSALQKMDSLDAMEGDVELEWEETTM
Download sequence
Identical sequences ENSGGOP00000011210 ENSGGOP00000011210

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]