SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for G3WGQ2 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  G3WGQ2
Domain Number 1 Region: 350-490
Classification Level Classification E-value
Superfamily C-type lectin-like 1.52e-42
Family C-type lectin domain 0.00046
Further Details:      
 
Domain Number 2 Region: 929-1079
Classification Level Classification E-value
Superfamily C-type lectin-like 7.52e-39
Family C-type lectin domain 0.00025
Further Details:      
 
Domain Number 3 Region: 1216-1355
Classification Level Classification E-value
Superfamily C-type lectin-like 5.6e-38
Family C-type lectin domain 0.00021
Further Details:      
 
Domain Number 4 Region: 633-778
Classification Level Classification E-value
Superfamily C-type lectin-like 1.1e-37
Family C-type lectin domain 0.00000345
Further Details:      
 
Domain Number 5 Region: 786-922
Classification Level Classification E-value
Superfamily C-type lectin-like 8.93e-36
Family C-type lectin domain 0.00033
Further Details:      
 
Domain Number 6 Region: 496-637
Classification Level Classification E-value
Superfamily C-type lectin-like 2.53e-32
Family C-type lectin domain 0.00097
Further Details:      
 
Domain Number 7 Region: 22-153
Classification Level Classification E-value
Superfamily Ricin B-like lectins 9.07e-32
Family Cysteine rich domain 0.00000146
Further Details:      
 
Domain Number 8 Region: 1084-1211
Classification Level Classification E-value
Superfamily C-type lectin-like 4.22e-31
Family C-type lectin domain 0.00079
Further Details:      
 
Domain Number 9 Region: 230-343
Classification Level Classification E-value
Superfamily C-type lectin-like 1.69e-28
Family C-type lectin domain 0.00071
Further Details:      
 
Domain Number 10 Region: 153-211
Classification Level Classification E-value
Superfamily Kringle-like 1.31e-19
Family Fibronectin type II module 0.00071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) G3WGQ2
Sequence length 1450
Comment (tr|G3WGQ2|G3WGQ2_SARHA) Mannose receptor C-type 1 {ECO:0000313|Ensembl:ENSSHAP00000014607} KW=Complete proteome; Reference proteome OX=9305 OS=Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). GN=MRC1 OC=Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
Sequence
MKVSLFLVFLSLTCGASQLLDTKQFLIYNDDHKRCVNVISSSAVQTAVCNPESESQKFRW
VSNTQLMSISLKLCLGVSSKTDWVQVTLFPCDSKSDLQKWECKNDTLFGILGEDLYFNYG
NKQEKNIMLYKGSGLWSRWKIHGTSDDLCSRGYEEIYTLLGNSNGAPCVFPFKFENKWYA
DCTSAGRSDGWLWCGTTTDYEVTRLFGFCPRKFAGSESLWTKDPLTGISYQINSKSALTW
HQARKSCQQQQSELLSITEVSEQTYLTGLTNSLTTGLWIGLNSLSFNSGWQWAGYSPFRY
LNWLPGSPALEPGKSCVLLNPAKNAKWENVECSQKLGYICKKGNSSLNQFIIPSESDVPI
SCPSQWWPYAGHCYKISREPKIQKDAKTTCRKEGGDLASIHSIEEFDFIFSQLGYEDTDT
LWIGLNDLKHQMFFEWSDDTPVTFTKWLPGEPSHANNRQEDCVVMKGKNGYWADHPCEGH
HGYVCKSKPLSTPPVVEVDTGCKKGWKRHGPYCYMIGQTLSTFEVANHTCINQNAYLVTV
EDRYEQAFLTSLVGLRPEKYFWIGLSDVQNKGTFKWTIDENVQFSHWNSLMPGRKTGCVA
MKTGIAGGLWDVLKCEEKAKFVCKHWAEGVTPPPIPTTTPAPKCPDGWISSSKTNLCFKV
FGKSKYSKKTWFDARDFCRAIGGDLARITNKEDQSTIWQNIGTSNYHDVFWLGLTIANTD
EGFTWSDGSPVTYENWSYGEPNNYGNIEFCGELKADPSMRWNDINCELLRKWICHIKKGD
ELKPEPTPSPESNPPVTEDGWVIYKDYQYYFSKEKSTMENARAFCKKNFGDIVTISGESE
KKFLWKYISALDLEATYFIGLLISLDKKFSWMDGSKVDYVAWAPGEPNFANDDENCVVLY
TSSGLWNDINCGFPNAFICQRHNSSINTTASPTPLPPLGGCRWGWKLFQNKCYKIFGAKE
EERKNWQAARKDCQGYGGNLVSIHSRKEQAFLTTELLESTYDSWIGMNDINSENKFLWTD
GRGVQYTNWAKGFPAGRRSFLSYEDVDCVVIVGGPSLEAGKWIDEICENEKGYICQTDSD
PAQPHPSTPTSGNFIRFGDSSYSVVTSKMKWKDAKEYCEGSSSQIASILDPYVNSFVWLE
MQKHNEPMWIGLNSNLTQGEYAWIDRWRMRYTNWGPGEPQVKSGCVYMDLEGFWKTAPCN
ESYQFLCKKSDVSPATEPPQLPGRCPDSEQSSWIPFSGHCYYIESSSTRSWGQALLECSR
MGASLVSIESAVESTFLTYKVEPLKSKTNFWIGMFKNVEGNWLWIDNTAVSFVNWKTGEP
SNDRNEDCVELYSSSGFWNNLYCSSYKGYICKREKIIDAKATEAIKPNDGPETRKAPKSH
NSAGIIVVVVLLILTGVGAAAYFFYKKRQVQIPQEGNFDNTLYFNTGSISGVSDTKDLMG
NIEQNEHALI
Download sequence
Identical sequences G3WGQ2
ENSSHAP00000014607 ENSSHAP00000014607 XP_012406716.1.9362

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]