SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gb|TGGT1_084220 from Toxoplasma gondii GT1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gb|TGGT1_084220
Domain Number 1 Region: 1136-1258,1295-1367
Classification Level Classification E-value
Superfamily Sec7 domain 1.3e-59
Family Sec7 domain 0.0000125
Further Details:      
 
Domain Number 2 Region: 2159-2250,2300-2365
Classification Level Classification E-value
Superfamily ARM repeat 0.0000842
Family HEAT repeat 0.072
Further Details:      
 
Weak hits

Sequence:  gb|TGGT1_084220
Domain Number - Region: 250-288,384-439,857-954
Classification Level Classification E-value
Superfamily ARM repeat 0.000196
Family Clathrin adaptor core protein 0.077
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gb|TGGT1_084220
Sequence length 3007
Comment | organism=Toxoplasma_gondii_GT1 | product=protein transport protein sec7, putative | location=TGGT1_chrX:197590-209854(-) | length=3007
Sequence
MESSREEGANGGRRGGSSRFFSPSLPTNMATSSFVASLPAAPQPSSPPFAPSLISPADVY
FVPSPSLANFLRKSLERLDKNAPNIRRSKELREACSAALEALDALLSRRERWTQAAEGRD
AEDAPAATCGVSAAAPGVQKLVDGEKREEDGLEETEKFEADRPSGGEGIREQRDETESAG
KTAEEDQERKEEGKQVDTDEGQEEDEKEGKKGSPRPTSFFPFSLGLGSSSNRGKKSKRES
EKKRALTLSSVDEDERDVELLFDPLRLACESNCPKLQLPALDGIHRLLVSGFISPVLSKA
SLSRESLLAGRQAREERSEGSEENRHPRAPARPLSPRVQVDPSSAVSGFFSSSVEATGSS
AEACPQSRSGSQSEQRPSPSSSAPASASLPLLSRVVVAVCRCSSSADEAVVLQVLRCLLT
TLTSPSLEVHGGTLLTCLRTLFEVFQNPHRSKENQRTAQAALLQTVHTVMQRYEFSACPL
LCEREAEDEEERTRGVHRETANARSDREEERRATMQATVAGSRGEEDSREGQRDERGDSQ
DGGDTRQKEQRAGNVQLSSEHGTTSLAVFLESYTYQLVDQAIQQAVHQTQPARDSDSQSP
LSSSLSSSLSSSLSSSPSSPSSSSSHGVNGAALEALEFASTVNERGVERGRYGWCVGCRQ
AAAHYCVDSGDPVCGKACKREHLRRLQALEKEAQGAVAVKSLVPPSRPLGGPELPASFAQ
LGGAAACSPPQTEPALSPRSLCASPMSVQQRDVLLVLQALCRLASSEDAWPFSFADAETK
RGFGVFSFVPSGALTLGVASGDRRPEEKRRRATTRLALELTFNMLHASGECLRGSKLFLA
FVKRQLFFALIKSAIVSSLTSVSLRIFLYLVEHHHMHLEQETAFFLSEVLLRLVASPNLP
VEQRETVLAALREFLALVPPPFILSLFVNFDCSVHEKDVALPLLQTLCDLAADSGKADAS
TASSAQKTLRAEALRGLEVLLARLLAWLDKLEKKQKAETRRRLRRQREGNGLRLQSPALS
RRSDLRGDRETLTAKAETGFGHVSSDSSLPLSSSLSDEDSPFSSPRSLSSPRQLSINNVD
SSSSSAASSSSAASSSSAASSSSAASSSSASAGSPDGSGLLRTSALAGRVSSRLDEVVKQ
RERKDHIRQAVALFNRSPKKGLAQLEAQQLLEMQPKSVARFFLSQDGLSKTRIGEFLGED
APFNKKVLHALVDALDFRGKEIDAALKSFLQLFRLPGEAQKIDRMMEKFAEKFFLDNNAP
TPPAALQKLCGPAANLSARASVANARAREAVAEQNARLYASADCCYVLAFSLIMLHTDAH
SPEIKEEQRMTKAAFVRNNRGINNGRDVETSYLEALYDRIVQEEWRLEDDDVAVCLRGKK
SQKKSEKGDENKARAERACPPEGLGAGSASGNGEDSDVVSDDDTFVASPFSGAAASSFFW
SPLQAFQEDDGCRTAGPGASTQNASGASLWASPGFEEGVAIASAAAGEPAVWTAFRDLAQ
QVGTVSSGPTKKLFCPPSLHVAPPIVDPKTFAKKACQTLLARAAARRDSQRSSASRESSG
CGLQSPALSAVELMASTPYLLQLASWHLLRCFAQVLGRDPKEEKEESEATLVSAVNAFNS
ATRLCMRLRLGVQRNAFVAALSALTYLHCATTRTFRGKNLALIRLLLALGLECGEDLQEA
WLPLLHAASQIDFLHVVAHDLLQRAREKQMAHASLQAAGSPPGPPETTVCSGQQKATGPS
LAPAAASLAALPPGQKPSDAPCLQATLRRSGALPPASRPGEEEFEKVSEPSEPREDSNEG
EGELPDRRESERRKGVEREPRGEVEQSEGDLGEREESNHLEDVTGDRQEVQISPGERNAT
SVGSLESDGVSTRNLSLPHGPVLQTCPVDPSPPALPFLLASSSASDTVPVSSFPASSPAS
AMKQDAFFVAANPHAAFLTVGLLGVGDRAGVLRPLWQDDASGGPVKERGEGVSFVSLLAS
APSGSSHAHTSYTNLSSFAPFGSGAPPPLGLWLAPLREGAPQFVPFASLSGGAYSETLFQ
NALLVWREVAASVLDLLFTQSRALSSSAVIFFVLALSLVSSHELRPPEATGGGVGSQSLS
VPPSLTGRGRKPEQVSGTALEVSPRFFSLQKLVEVAHFNMDRLRFVWTRMWTILRSHFAG
ACLHPSLAVRLYAIDSLRQLTTKFLEKDELAQFTFQAEFLKLFLTVMTHPDTEDEVKEFL
MHILFNLIRTQASNIRSGWKTVLQTLHAAASEASVSLQHMSSSRLKAARLSHGSSPPSRE
GRRASREEGEGREELSKVLGPWKRLRLSFEVVEQILAHSLGMLTGDSLDEAVRCLLLFAS
NPVDESMAIRAIRYLELVVLCLIEGTEAARFSGASLLASLLNEETRDTVVLSSLLHLHTL
LREGSREERRLSRRSFREETKEKKKQRQGVIEPAPAKDGGRLQTDACDQESKGEEAGKVS
EEMREKDSKRGLAALSNNTATFFLCLPVLNALAHLACSPLPSTFSSSSSSSSSSERRDLG
REGKSMNAALDALFRLLLTYGASFNPVSAFCTNGSGVGSFSASSGDNETHWRALERGGRE
EEARGNAAGEQKSSRERQQETRLLIRSQSDTDSQEIIKKAGERQDNFEEEKTNVGVAEGV
VSHPPNELFAEERRMLTAEGRDVRWKMIFQGLLCPLFDDLFLLLRLNLLPDQRALPQQSL
VLPASQRASASEREESGERRKSHRRRSPAEFVLERREVPSPDEISRRVRPVCVDSQQQDE
AASDLSVASPDKVGSEQSGAVARASFSSASSSLSTFSLKRSERQLLARERDAEGRASETG
SGVPAENERKREKANAEEGDWGRGKERGGTLHWAEMSCCSALRQLVVLVDRHLGDLQTHL
KNFLSLIFAAIDEDSAVERLARLGVDAFRNFLVALGRRTTRRGRRETEGQRPSERRPVES
APGDCGVTESRGELRSEKPNEKQRENVANKSTQEGEKKDDLADDDVDLDACWRAVAEAAL
VSETRIL
Download sequence
Identical sequences gb|TGGT1_084220

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]