SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000020789 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000020789
Domain Number 1 Region: 105-328,372-422
Classification Level Classification E-value
Superfamily MFS general substrate transporter 3.66e-30
Family LacY-like proton/sugar symporter 0.08
Further Details:      
 
Domain Number 2 Region: 537-676
Classification Level Classification E-value
Superfamily MFS general substrate transporter 1.06e-19
Family Glycerol-3-phosphate transporter 0.02
Further Details:      
 
Domain Number 3 Region: 445-526
Classification Level Classification E-value
Superfamily Pentapeptide repeat-like 0.00000000144
Family Pentapeptide repeats 0.0095
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000020789   Gene: ENSGACG00000015751   Transcript: ENSGACT00000020829
Sequence length 688
Comment pep:known_by_projection group:BROADS1:groupXIV:826445:840457:1 gene:ENSGACG00000015751 transcript:ENSGACT00000020829 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
KMDETHNNRTALAKGAKDIAKEAKRHAAKNIGKAVDRAANGYSASRSYDRFQYQGVPARG
EGQAAAAAGQPVCDSAKSRNELESERRSDEEELAQQYELIMQECGHGRFQWQLFFVLGLA
LMSDGVEVFVVGFVLPSAETDMCVPNSGAGWLVSVVEVFTGGLSWHPVFGSLSRSTRRKL
SLRSLEPSGGHFPLSVLVPGFASAFLLARRVNACSIGGAVPIVFSYFAEVLAREKRGEHL
SWLCMFWMIGGIYASAMAWAIIPHYGWSFSMGSAYQFHSWRVFVVVCALPCVSAVVALTF
MPESPRFFLETGKHDEAWMVLKHIHDTNMRARGEPERVFTVNRIKVPKQLDELVEMHNES
ANAALKVLLKIRTELRAIWSTFMRCFDYPVRDNSLKLAAVWFTLSFGYYGLSVWFPDVIK
HLQADEYASRVKIHSNERLEDFTFNFTLENQIHRNGVFLNDRFIGMKFKAVTFIDSSFIN
CYFEDVSSLGSFFKNCTFVDSFFYNTDIDDSKLTNSRVINSSFHHNKTGCQMTFDDDYSA
YWVYFVNFLGTLAVLPGNIVSALLMDKIGRLSMLGGSMVLSGISCFFLWFGTSESMMIFM
LCLYNGLSISAWNSLDVVTAELYPTDRRGTGFGFCNAMCKLAAVLGNLIFGSLVGITKAI
PILMASSVLVGGGLVGLRLPDTRANVLM
Download sequence
Identical sequences G3PT53
ENSGACP00000020789 69293.ENSGACP00000020789 ENSGACP00000020789

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]