SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000002522 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000002522
Domain Number 1 Region: 95-282,428-634
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 2.14e-49
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.0081
Further Details:      
 
Domain Number 2 Region: 760-1011
Classification Level Classification E-value
Superfamily YWTD domain 6.28e-43
Family YWTD domain 0.0000092
Further Details:      
 
Domain Number 3 Region: 1559-1744
Classification Level Classification E-value
Superfamily Fibronectin type III 3.85e-29
Family Fibronectin type III 0.0018
Further Details:      
 
Domain Number 4 Region: 1934-2107
Classification Level Classification E-value
Superfamily Fibronectin type III 2.54e-17
Family Fibronectin type III 0.0054
Further Details:      
 
Domain Number 5 Region: 1416-1454
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000223
Family LDL receptor-like module 0.00092
Further Details:      
 
Domain Number 6 Region: 1196-1233
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000956
Family LDL receptor-like module 0.00072
Further Details:      
 
Domain Number 7 Region: 1323-1359
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000105
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 8 Region: 1075-1113
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000157
Family LDL receptor-like module 0.0011
Further Details:      
 
Domain Number 9 Region: 1370-1403
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000262
Family LDL receptor-like module 0.00095
Further Details:      
 
Domain Number 10 Region: 1238-1271
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000034
Family LDL receptor-like module 0.0018
Further Details:      
 
Domain Number 11 Region: 1152-1193
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000353
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 12 Region: 1470-1507
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000144
Family LDL receptor-like module 0.0016
Further Details:      
 
Domain Number 13 Region: 1510-1549
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000314
Family LDL receptor-like module 0.001
Further Details:      
 
Domain Number 14 Region: 1120-1153
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000301
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 15 Region: 1749-1878
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000548
Family Fibronectin type III 0.0079
Further Details:      
 
Domain Number 16 Region: 1275-1310
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000034
Family LDL receptor-like module 0.0026
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000002522   Gene: ENSGGOG00000002552   Transcript: ENSGGOT00000002575
Sequence length 2215
Comment pep:known_by_projection chromosome:gorGor3.1:11:119475320:119666876:1 gene:ENSGGOG00000002552 transcript:ENSGGOT00000002575 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MATRSSRRESRLPFLFTLVALLPPGALCEVWTQRLHGGSAPLPQDRGFLVVQGDPRELRL
WARGDARGASRADEKPLRRKRSAALQPEPIKVYGQVSLNDSHNQMVVHWAGEKSNVIVAL
ARDSLALARPKSSDVYVSYDYGKSFKKISDKLNFGVGNRSEAVIAQFYHSPVDNKRYIFA
DAYAQYLWITFDFCNTLQGFSIPFRAADLLLHSKASNLLLGFDRSHPNKQLWKSDDFGQT
WIMIQEHVKSFSWGIDPYDKPNTIYIERHEPSGYSTVFRSTDFFQSRENQEVIFEEVRDF
QLRDKYMFPTKVVHLLGSEQQSSVQLWVSFGRKPMRAAQFVTRHPINEYYIADASEDQVF
VCVSHSNNRTNLYISEAEGLKFSLSLENVLYYSPGGAGSDTLVRYFANEPFADFHRVEGL
QGVYIATLINGSMNEENMRSVITFDKGGTWEFLQAPAFTGYGEKINCELSQGCSLHLAQR
LSQLLNLQLRRMPILSKESAPGLIIATGSVGKNLASKTNVYISSSAGARWREALPGPHYY
TWGDHGGIITAIAQGMETNELKYSTNEGETWKTFIFSEKPVFVYGLLTEPGEKSTVFTIF
GSNKENVHSWLILQVNATDALGVPCTENDYKLWSPSHERGNECLLGHKTVFKRRTPHATC
FNGEDFDRPVVVSNCSCTREDYECDFGFKMSEDLSLEVCVPDPEFSGKSYSPPVPCPVGS
TYRRTRGYRKISGDTCSRGDVEARLEGELVPCPLAEENEFILYAVRKSIYRYDLASGATE
QLPLTGLRAAVALDFDYEHNCLYWSDLALDIIQRLCLNGSTGQEVIINSGLETVDALAFE
PLSQLLYWVDAGFKKIEVANPDGDFRLTIVNSSVLDRPRALVLVPQEGVMFWTDWGDLKP
GIYRSNMDGSAAYRLVSEDVKWPNGISVDDQWIYWTDAYLDCIERITFSGQQRSVILDNL
PHPYAIAVFKNEIYRDDWSQLSIFRASRYSGSQMEILANQLTGLMDMKIFYKGKNTGSNA
CVPRPCSLLCLPKANNSRSCRCPEGVSSSVLPSGDLMCDCPQGYQLKNNTCVKEENTCLR
NQYRCSNGNCINSIWWCDFDNDCGDMSDERNCPTTICDLDTQFRCQESGTCIPLSYKCDL
EDDCGDNSDESHCEMHQCRSDEYNCSSGMGIRSSWVCDGDNDCRDWSDEANCTAISHTCE
ASNFQCRNGHCIPQRWACDGDTDCQDGSDEDPVNCEKKCNGFRCPNGTCIPSSKHCDGLR
DCSDGSDEQHCEPLCTHFMDFVCKNRQQCLFHSMVCDGIIQCRDGSDEDAAFAGCSQDPE
FHKVCDEFGFQCQNGVCISLIWKCDGMDDCGDYSDEANCENPTEAPNCSRYFQFRCENGH
CIPNRWKCDRENDCGDWSDEKDCGDSHILPFSTPGPSTCLPNYYRCSSGTCVMDTWVCDG
YRDCADGSDEEACPSLANVTAASTPTQLGRCDRFEFECHQPKKCIPNWKRCDGHQDCQDG
RDEANCPTHSTLTCMSREFQCEDGEACIVLSERCDGFLDCSDESDEKACSDELTVYKVQN
LQWTADFSGDVTLTWMRPKKMPSASCVYNVYYRVVGESIWKTLETHSNKTNTVLKVLKPD
TTYQVKVQVQCLSKAHNTNDFVTLRTPEGLPDAPRNLQLSLPREAEGVIVGHWAPPIHTH
GLIREYIVEYSRSGSKMWASQRAASNFTEIKNLLVNTLYTVRVAAVTSRGIGNWSDSKSI
TTIKGKVIPPPDIHIDSYGENYLSFTLTMESDIKVNGYVVNLFWAFDTHKQERRTLNFRG
SILSHKVGNLTAHTSYEISAWAKTDLGDSPLAFEHVMTRGVRPSAPSLKAKAINQTAVEC
TWTGPRNVVYGIFYATSFLDLYRNPKSLTTSLHNKTVIVSKDEQYLFLVRVVVPYQGPSS
DYVVVKMIPDSRLPPRHLHVVHTGKTSVVIKWESPYDSPDQDLLYAIAVKDLIRKTDRSY
KVKSRNSTVEYTLNKLEPGGKYHIIVQLGNMSKDSSIKITTVSLSAPDALKIITENDHVL
LFWKSLALKEKHFNESRGYEIHMFDSAMNITAYLGNTTDNFFKISNLKMGHNYTFTVQAR
CLFGSQICGEPAILLYDELGSGADASATQAARSTDVAAVVVPILFLILLSLGVGFAILYT
KHRRLQSSFTAFANSHYSSRLGSAIFSSGDDLGEDDEDAPMITGFSDDDVPMVIA
Download sequence
Identical sequences ENSGGOP00000002522

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]