SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for G1KF84 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  G1KF84
Domain Number 1 Region: 61-282
Classification Level Classification E-value
Superfamily p53-like transcription factors 4.06e-74
Family STAT DNA-binding domain 0.000000383
Further Details:      
 
Domain Number 2 Region: 830-1036
Classification Level Classification E-value
Superfamily p53-like transcription factors 4.39e-70
Family STAT DNA-binding domain 0.0000000648
Further Details:      
 
Domain Number 3 Region: 654-830
Classification Level Classification E-value
Superfamily STAT 3.92e-53
Family STAT 0.000000535
Further Details:      
 
Domain Number 4 Region: 516-635
Classification Level Classification E-value
Superfamily Transcription factor STAT-4 N-domain 2.35e-45
Family Transcription factor STAT-4 N-domain 0.0000142
Further Details:      
 
Domain Number 5 Region: 283-399
Classification Level Classification E-value
Superfamily SH2 domain 5.65e-41
Family SH2 domain 0.00000418
Further Details:      
 
Domain Number 6 Region: 1037-1177
Classification Level Classification E-value
Superfamily SH2 domain 3.37e-39
Family SH2 domain 0.000000705
Further Details:      
 
Domain Number 7 Region: 2-83
Classification Level Classification E-value
Superfamily STAT 1.16e-29
Family STAT 0.00016
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) G1KF84
Sequence length 1217
Comment (tr|G1KF84|G1KF84_ANOCA) Signal transducer and activator of transcription 1 {ECO:0000313|Ensembl:ENSACAP00000006161} KW=Complete proteome; Reference proteome OX=28377 OS=Anolis carolinensis (Green anole) (American chameleon). GN=STAT1 OC=Toxicofera; Iguania; Dactyloidae; Anolis.
Sequence
MEELLDWKRRQQIACIGGPLHSGLDQLQNCFTLLAESLFQIKRQLEKLDELLVKLTYDGD
PIALQRPHLLERVNFLLYSLFQRLLIKLPELNYQIRVKATIDNNRRFVLCGTHVKAMNMD
ESENGSLSVEFRHLGPQMVTEELHSITFETQVCLYGLTIDLETGSLPVVMISNVSQLPNA
WASIIWYNLSTNESQNLAFFNNPPSVGLSQVLEVLSWQFSSYIGRGLNSDQLGMLAEKLT
GQQVIYTEHHISWSKFCKEHLPGKSFTFWAWLEAILELIKKHILPLWIDGCIMGFVSKEK
ERSLLKDKMPGTFLLRFSESNLGGITFTWVSQSENGEVDFHSVEPYNKGRLTALPFADIL
RDYKVITEDNVPENPLKYLYPDIPKDTAFGRHYSSQPSEVLPLKERGAVLKMGAAFHIFL
INKILYTKPQRKYCYLFKKEEGRENPGVAQNPNQNLSQTPSPVDRQILTAEIEKIVYTLF
PLKNRYSTQKDARNCGCSYQYTLSPPPPPGKSVPRMSQWYKLQQLDSKFLEQVHQLYDDS
FPMEIRQYLAQWLEGQDWDHASSDISLATLLFQNLLSQLDDQYSRFTQENNFLLQHNIRK
SKRNLQTTFEEDPMYMALMISKCLYEERKILQAAQSAEQVKVGNVQNTGTLCRVKERDSK
VKGVKDSVTEIEQLIKTLEDAQDEYDFKYKTLQVREGEANGMTQDDCKKETLQLHTMYLQ
VHNMRQDVLRMITAALNLAEHTQNVLIQEELVEWKSRQQIACIGGPPNACLDQLQNWFTT
VAESLQQIHQQLKKLEELEQKFSYETDPIPQQKQGLYDRTLSLFNQLIQRLLVKLQELNY
KLKVQVVFDKDIGEKNPPVKGYVVDTSLLHIIYRIRKFNILGTNTKVMNMEESNGSLSAE
FRHLQLKEQKNVARTNETTSLPIVVISNVSQLPSGWASILWYNMLTNEPKNVFFFLNPPC
ARWSQLSDVLSWQFSSVTKRGLNVDQLNMLGEKLLGGGCNSDDLISWARFCKENINDKNF
PFWLWVEGILELIKKHLLSLWNDGSIVGFISKERERALLKGKANGTFLLRFSESSREGAI
TFTWVEGPAHDYDPDFHSVEPYTRKELTAVSLPDIIRNYKVMAAENIPENPLKFLYPEIP
KDIAFGKYYSRPKDSAEPMDVDGGGKGYIKTELISVSEVHPSKLLTTENLLPLSPEDFSE
VAREAVGGAKDHVTSSV
Download sequence
Identical sequences G1KF84
ENSACAP00000006161

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]