SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for G3WGQ3 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  G3WGQ3
Domain Number 1 Region: 349-489
Classification Level Classification E-value
Superfamily C-type lectin-like 1.52e-42
Family C-type lectin domain 0.00046
Further Details:      
 
Domain Number 2 Region: 928-1078
Classification Level Classification E-value
Superfamily C-type lectin-like 7.52e-39
Family C-type lectin domain 0.00025
Further Details:      
 
Domain Number 3 Region: 1215-1354
Classification Level Classification E-value
Superfamily C-type lectin-like 5.6e-38
Family C-type lectin domain 0.00021
Further Details:      
 
Domain Number 4 Region: 632-777
Classification Level Classification E-value
Superfamily C-type lectin-like 1.1e-37
Family C-type lectin domain 0.00000345
Further Details:      
 
Domain Number 5 Region: 785-921
Classification Level Classification E-value
Superfamily C-type lectin-like 8.93e-36
Family C-type lectin domain 0.00033
Further Details:      
 
Domain Number 6 Region: 495-636
Classification Level Classification E-value
Superfamily C-type lectin-like 2.53e-32
Family C-type lectin domain 0.00097
Further Details:      
 
Domain Number 7 Region: 21-152
Classification Level Classification E-value
Superfamily Ricin B-like lectins 9.07e-32
Family Cysteine rich domain 0.00000146
Further Details:      
 
Domain Number 8 Region: 1083-1210
Classification Level Classification E-value
Superfamily C-type lectin-like 4.22e-31
Family C-type lectin domain 0.00079
Further Details:      
 
Domain Number 9 Region: 229-342
Classification Level Classification E-value
Superfamily C-type lectin-like 1.69e-28
Family C-type lectin domain 0.00071
Further Details:      
 
Domain Number 10 Region: 152-210
Classification Level Classification E-value
Superfamily Kringle-like 1.31e-19
Family Fibronectin type II module 0.00071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) G3WGQ3
Sequence length 1449
Comment (tr|G3WGQ3|G3WGQ3_SARHA) Mannose receptor C-type 1 {ECO:0000313|Ensembl:ENSSHAP00000014608} KW=Complete proteome; Reference proteome OX=9305 OS=Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). GN=MRC1 OC=Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
Sequence
IRLPIIIIIISCPILNLLRDTKQFLIYNDDHKRCVNVISSSAVQTAVCNPESESQKFRWV
SNTQLMSISLKLCLGVSSKTDWVQVTLFPCDSKSDLQKWECKNDTLFGILGEDLYFNYGN
KQEKNIMLYKGSGLWSRWKIHGTSDDLCSRGYEEIYTLLGNSNGAPCVFPFKFENKWYAD
CTSAGRSDGWLWCGTTTDYEVTRLFGFCPRKFAGSESLWTKDPLTGISYQINSKSALTWH
QARKSCQQQQSELLSITEVSEQTYLTGLTNSLTTGLWIGLNSLSFNSGWQWAGYSPFRYL
NWLPGSPALEPGKSCVLLNPAKNAKWENVECSQKLGYICKKGNSSLNQFIIPSESDVPIS
CPSQWWPYAGHCYKISREPKIQKDAKTTCRKEGGDLASIHSIEEFDFIFSQLGYEDTDTL
WIGLNDLKHQMFFEWSDDTPVTFTKWLPGEPSHANNRQEDCVVMKGKNGYWADHPCEGHH
GYVCKSKPLSTPPVVEVDTGCKKGWKRHGPYCYMIGQTLSTFEVANHTCINQNAYLVTVE
DRYEQAFLTSLVGLRPEKYFWIGLSDVQNKGTFKWTIDENVQFSHWNSLMPGRKTGCVAM
KTGIAGGLWDVLKCEEKAKFVCKHWAEGVTPPPIPTTTPAPKCPDGWISSSKTNLCFKVF
GKSKYSKKTWFDARDFCRAIGGDLARITNKEDQSTIWQNIGTSNYHDVFWLGLTIANTDE
GFTWSDGSPVTYENWSYGEPNNYGNIEFCGELKADPSMRWNDINCELLRKWICHIKKGDE
LKPEPTPSPESNPPVTEDGWVIYKDYQYYFSKEKSTMENARAFCKKNFGDIVTISGESEK
KFLWKYISALDLEATYFIGLLISLDKKFSWMDGSKVDYVAWAPGEPNFANDDENCVVLYT
SSGLWNDINCGFPNAFICQRHNSSINTTASPTPLPPLGGCRWGWKLFQNKCYKIFGAKEE
ERKNWQAARKDCQGYGGNLVSIHSRKEQAFLTTELLESTYDSWIGMNDINSENKFLWTDG
RGVQYTNWAKGFPAGRRSFLSYEDVDCVVIVGGPSLEAGKWIDEICENEKGYICQTDSDP
AQPHPSTPTSGNFIRFGDSSYSVVTSKMKWKDAKEYCEGSSSQIASILDPYVNSFVWLEM
QKHNEPMWIGLNSNLTQGEYAWIDRWRMRYTNWGPGEPQVKSGCVYMDLEGFWKTAPCNE
SYQFLCKKSDVSPATEPPQLPGRCPDSEQSSWIPFSGHCYYIESSSTRSWGQALLECSRM
GASLVSIESAVESTFLTYKVEPLKSKTNFWIGMFKNVEGNWLWIDNTAVSFVNWKTGEPS
NDRNEDCVELYSSSGFWNNLYCSSYKGYICKREKIIDAKATEAIKPNDGPETRKAPKSHN
SAGIIVVVVLLILTGVGAAAYFFYKKRQVQIPQEGNFDNTLYFNTGSISGVSDTKDLMGN
IEQNEHALI
Download sequence
Identical sequences G3WGQ3
ENSSHAP00000014608

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]