SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000016831 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000016831
Domain Number 1 Region: 2208-2301
Classification Level Classification E-value
Superfamily C-type lectin-like 4.24e-33
Family Link domain 0.00011
Further Details:      
 
Domain Number 2 Region: 508-646
Classification Level Classification E-value
Superfamily FAS1 domain 1.44e-30
Family FAS1 domain 0.0015
Further Details:      
 
Domain Number 3 Region: 1472-1520,1598-1712
Classification Level Classification E-value
Superfamily FAS1 domain 2.75e-28
Family FAS1 domain 0.0028
Further Details:      
 
Domain Number 4 Region: 1714-1867
Classification Level Classification E-value
Superfamily FAS1 domain 1.96e-26
Family FAS1 domain 0.0042
Further Details:      
 
Domain Number 5 Region: 964-1121
Classification Level Classification E-value
Superfamily FAS1 domain 1.16e-22
Family FAS1 domain 0.0046
Further Details:      
 
Domain Number 6 Region: 385-498
Classification Level Classification E-value
Superfamily FAS1 domain 1.7e-20
Family FAS1 domain 0.0037
Further Details:      
 
Domain Number 7 Region: 2299-2462
Classification Level Classification E-value
Superfamily FAS1 domain 5.76e-17
Family FAS1 domain 0.01
Further Details:      
 
Domain Number 8 Region: 1139-1230
Classification Level Classification E-value
Superfamily FAS1 domain 0.00000000249
Family FAS1 domain 0.0084
Further Details:      
 
Domain Number 9 Region: 1544-1581
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000838
Family Merozoite surface protein 1 (MSP-1) 0.065
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000016831
Domain Number - Region: 1997-2136
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000377
Family Growth factor receptor domain 0.01
Further Details:      
 
Domain Number - Region: 1369-1501
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000659
Family Growth factor receptor domain 0.014
Further Details:      
 
Domain Number - Region: 904-960
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00082
Family EGF-type module 0.046
Further Details:      
 
Domain Number - Region: 198-231
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00102
Family Merozoite surface protein 1 (MSP-1) 0.039
Further Details:      
 
Domain Number - Region: 829-861
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00261
Family EGF-like domain of nidogen-1 0.072
Further Details:      
 
Domain Number - Region: 1968-2009
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00395
Family EGF-type module 0.02
Further Details:      
 
Domain Number - Region: 2140-2176
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00768
Family EGF-like domain of nidogen-1 0.06
Further Details:      
 
Domain Number - Region: 230-283
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0123
Family EGF-type module 0.046
Further Details:      
 
Domain Number - Region: 863-910
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0127
Family EGF-like domain of nidogen-1 0.078
Further Details:      
 
Domain Number - Region: 736-770
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0636
Family EGF-type module 0.038
Further Details:      
 
Domain Number - Region: 109-152
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0737
Family EGF-type module 0.025
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000016831   Gene: ENSGGOG00000015946   Transcript: ENSGGOT00000028045
Sequence length 2570
Comment pep:known_by_projection chromosome:gorGor3.1:3:53838594:53867994:1 gene:ENSGGOG00000015946 transcript:ENSGGOT00000028045 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MAGPRGLLPLCLLAFCLAGFSFVRGQVLFKGCDVKTTFVTHVPCTSCAAIKKQTCPSGWL
RELPDQITQDCRYEVQLGGSMVSMSGCRRKCRKQVVQKACCPGYWGSRCYECPGGAETPC
NGHGTCLDGMDRNGTCVCQENFRGSACQECQDPNRFGPDCQSVCSCVHGVCNHGPRGDGS
CLCFAGYTGPHCDQELPVCQELRCPQNTQCSAEAPSCRCLPGYTQQGSECRAPNPCWPSP
CSLLAQCSVSPKGQAQCHCPENYHGDGMVCLPKDPCTDNLGGCPSNSTLCVYQKPGQAFC
TCRPGLVSINSNASAGCFAFCSPFSCDRSATCQVPADGKTSCVCRESEVGDGRACYGHLL
HEVQKATQTGRVFLQLRVAVAMMDQGCREILTTAGPFTVLVPSVSSFSSRTMNASLAQQL
CRQHIIAGQHILEDTRTQQTRRWWTLAGQEITVTFNQFTKYSYKYKDQPQQTFNIYKANN
IAANGVFHVVTGLRWQAPSGTPGDPKRTIGQILASTEAFSRFETILENCGLPSILDGPGP
FTVFAPSNEAVDSLRDGRLIYLFTAGLSKLQELVRYHIYNHGQLTVEKLISKGRILTMAN
QVLAVNISEEGRILLGPEGVPLQRVDVMAANGVIHMLDGILLPPTILPILPKHCSEEQHK
IMAGSCVDCQALNTSTCPPNSVKLDIFPKECVYIHDPTGLNVLKKGCASYCNQTIMEQGC
CKGFFGPDCTQCPGGFSNPCYGKGNCSDGIQGNGACLCFPDYKGIACHICSNPNKHGDQC
QEDCGCVHGLCDNRPGSGGVCQQGTCAPGFSGQFCNESMGDCGPTGLAQHCHLHARCVSQ
EGVARCRCLDGFEGDGFSCTPSNPCSHPDRGGCSENAECVPGSLGTHHCTCHKGWSGDGR
VCVAIDECELDVRGGCHTDALCSYVGPGQSRCTCKLGFAGDGYQCSPIDPCRAGNGGCHG
LATCRAVGGGQRVCTCPPGFGGDGFSCYGDIFRELEANAHFSIFYQWLKSAGITLPADRR
VTALVPSEAAVRQLSPEDRAFWLQPRMLPNLVRAHFLQGALFEEELARLGGQEVATLNPT
TRWEIRNISGRVWVQNASVDVADLLATNGVLHILSQVLLPPRGDVPGGQGLLQQLDLVPA
FSLFRELLQHHRLVPQIEAATAYTIFVPTNRSLEAQGNSSHLDADTVRHHVVLGEALSME
TLRKGGHRNSLLGPAHWIVFYNHSGQPEVNHVPLEGPMLEAPGRSLIGLSGVLTVGSSRC
LHSHAEALREKCVNCTRRFRCTQGFQLQDTPRKSCVYRSGFSFSRGCSYTCAKKIQVPDC
CPGFFGTLCEPCPGGLGGVCSGHGQCQDRFLGSGECHCHEGFHGTACEVCELGRYGPNCT
GVCDCAHGLCQEGLQGDGSCVCNVGWQGLRCDQKITSPQCPRKCDPNANCVWDSAGASTC
ACAAGYSGNGIFCSEVDPCAHGHGGCSPHANCTKVAPGQRTCTCQDGYMGDGELCQEINS
CLIHHGGCHIHAECIPTGPQQVSCSCREGYSGDGIRTCGLLDPCSKNNGGCSPYATCKST
GDGQRTCTCDTAHTVGDGLTCRARVGLELLRDKHASFFSLHLLEYKELKGDGPFTIFVPH
ADLMSNLSQDELARIRAHRQLVFRYHVVGCRRLRSEDLLEQGYATALSGHPLRFSEREGS
IYLNDFARVVSSDHEAVNGILHFIDRVLLPPEALHWEPDDAPIPRRNVTAAAQGFGYKIF
SGLLKVAGLLPLLREASHRPFTMLWPTDAAFRALPPDRQAWLYHEDHRDKLAAVLRGHMI
RNVEALASDLPNLGPLRTMHGTPISFSCSRTRPGELMVGEDDARIVQRHLPFEGGLAYGI
DQLLEPPGLGARCDHFETRPLRLNTCSICGLEPPCPEGSQEQGSPEACWRFYPKFWTSPP
LHSLGLRSVWVHPSLWGRPQGLGRGCHRNCVTTTWKPSCCPGHYGSECQACPGGPSSPCS
DRGVCMDGMSGSGQCLCRSGFAGTACELCAPGAFGPHCQACRCTVHGRCDEGLGGSGSCF
CDEGWTGPRCEVQLELQPVCTPPCAPEAVCRAGNSCECSLGYEGDGRVCTVADLCQDGHG
GCSEHANCSQVGTVVTCTCLPDYEGDGWSCRARNPCTDGHRGGCSEHADCLSTGLNTRRC
ECHAGYVGDGLQCLEESEPPVDRCLGQPPPCHSDAVCTDLHFQEKRAGVFHLQATSGPYG
LNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPAADCGNGRV
GIVSLGARKNLSERWDAYCFRVQDVACRCRDGFVGDGISTCNGKLLDVLAATANFSTFYG
MLLGYANATQRGLDFLDFLDDELTYKTLFVPVNEGFVDNMTLSGPDLELHASNATLLSAN
ASQGKLLPAHSGLSLIISDAGPDNSSWAPVAPGTVVVSHIIVWDIMAFNGIIHALASPLL
APPQLQAVLAPEAPPVAAGVGAVLAAGALLGLVAGALYLRARGKPMGFGFSAFQAEDDAD
NDFSPWQEGTNPTLVSVPNPVFGSDTFCEPFDDSLLEEDFPDTQRILTVK
Download sequence
Identical sequences ENSGGOP00000015582 ENSGGOP00000016831

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]