SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSCJAP00000030809 from Callithrix jacchus 69_3.2.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSCJAP00000030809
Domain Number 1 Region: 290-690
Classification Level Classification E-value
Superfamily Ankyrin repeat 1.62e-100
Family Ankyrin repeat 0.00000239
Further Details:      
 
Domain Number 2 Region: 1036-1364
Classification Level Classification E-value
Superfamily Ankyrin repeat 1.41e-81
Family Ankyrin repeat 0.00000757
Further Details:      
 
Domain Number 3 Region: 190-338
Classification Level Classification E-value
Superfamily Ankyrin repeat 2.11e-32
Family Ankyrin repeat 0.00043
Further Details:      
 
Domain Number 4 Region: 1680-1754
Classification Level Classification E-value
Superfamily Eukaryotic type KH-domain (KH-domain type I) 0.000000000000146
Family Eukaryotic type KH-domain (KH-domain type I) 0.0021
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSCJAP00000030809   Gene: ENSCJAG00000016624   Transcript: ENSCJAT00000032559
Sequence length 2600
Comment pep:known chromosome:C_jacchus3.2.1:2:37442936:37444820:1 gene:ENSCJAG00000016624 transcript:ENSCJAT00000032559 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLTDGGGGGTSFEEDLDAVAPRSAPAGASEPPPPGGVGLGIRTVRLFGGAGGGDSALDFK
LAAAVLRTGGGGGASGSDEDEVSEVESFILDQEDLDNPVLKTTSEIFLSSTAEGADLRTV
DPETQARLEALLEAAGIGKLSTADGKAFADPEVLRRLTSSVSCALDEAAAALTRMKIENS
HNAGQVDTRSLAEACSDGDVNAVRKLLDEGRSVNEHTEEGESLLCLACSAGYYELAQVLL
AMHANVEDRGNKGDITPLMAASSGGYLDIVKLLLLHDADVNSQSATVGNTALTYACAGGF
VDIVKVLLNEGANIEDHNENGHTPLMEAASAGHVEVARVLLDHGAGINTHSNEFKESALT
LACYKGHLDMVRFLLEAGADQEHKTDEMHTALMEACMDGHVEVARLLLDSGAQVNMPADS
FESPLTLAACGGHVELAALLIERGANLEEVNDEGYTPLMEAAREGHEEMVALLLAQGANI
NAQTEETQETALTLACCGGFSEVADFLIKAGADIELGCSTPLMEASQEGHLELVKYLLAS
GANVHATTATGDTALTYACENGHTDVADVLLQAGADLEHESEGGRTPLMKAARAGHLCTV
QFLISKGANVNRATANNDHTVVSLACAGGHLAVVELLLAHGADPTHRLKDGSTMLIEAAK
GGHTNVVSYLLDYPNNVLSVPTTDVSQLTPPSQDQSQVPRVPMHTLAMVVPPQEPDRTSQ
ENSPALLGVQKGTSKQKPSSLQVADQDLLPSFHPYQPLECIVEETEGKLNELGQRISAIE
KAQLKSLELIQGEPLNKDKIEELKKNREEQVQKKKKILKELQKVERQLQMKTQQQFTKEY
LETKGQKDTVSLHQQCSHRGVFPEGEGDGSLPEDHFSELPQVDTILFKDNDVDDEQQSPP
SAEQIDFVPVQPLSSPQCNFSSDLGSNGTNSLELQKVSGNQQIVGQPQIAITGHDQGLLV
QEPDGLMVATPAQTLTDTLDDLIAAVSTRVPTASNSSSQTTECLTPESCSQTTSNVASQS
MPPVYPSVDIDAHTESNHDTALTLACAGGHEELVSVLIARDAKIEHRDKKGFTPLILAAT
AGHVGVVEILLDKGGDIEAQSERTKDTPLSLACSGGRQEVVDLLLARGANKEHRNVSDYT
PLSLAASGGYVNIIKILLNAGAEINSRTGSKLGISPLMLAAMNGHVPAVKLLLDMGSDIN
AQIETNRNTALTLACFQGRAEVVSLLLDRKANVEHRAKTGLTPLMEAASGGYAEVGRVLL
DKGADVNAPPVPSSRDTALTIAADKGHYKFCELLIHRGAHIDVRNKKGNTPLWLASNGGH
FDVVQLLVQAGADVDAADNRKITPLMSAFRKGHVKVVQYLVKEVNQFPSDIECMRYIATI
TDKELLKKCHQCVETIVKAKDQQAAEANKNASILLKELDLEKKFLRTEFRNYINYIKRKK
RKSQKKKKNKFYKKKQEEDEENKPKENSELPEDEDEENDEDVEQEVPIEPPSATTTTTIG
ISATSATFTNVFGKKRANVVTTPSTNRKNKKNKTKETPPTGHLILPEQHMPLAQQKADKN
KINGEPRGGGTGGNSDSDNLDSTDCNSESSSGGKSQELNFVMDVNSSKYPSLLLHSQEEK
TSTAISKTQTRLEGEVNPNSLSTSYKSVSLPLSSPNIKLNLTSPKRGQKREEGWKEVVRR
SKKLSVPASVVSRIMGRGGCNITAIQDVTGAHIDVDKQKDKNGERMITISRGGTESTRYA
VQLINALIQDPAKELEDLIPKNHIRTPASTKSIHANFSSGVGTTAASSKNAFPLGAPTLV
TSQATTLSTFQPTNKLNKNVPTNVRSSFPVSLPLAYPHPHFALLAAQTMQQIRHPRLPMA
QFGGTFSPSPNTWGPFPVRPVNPGNTNSSPKHNTSRLPNQNGTVLPSESAGLATASCPIT
VSSVVGANQQLCMTNTRTPSSVRKQLFACVPKTSPPATVISSVASTCSSLPSVSSAPVTS
GQAPTTFLCTSTSQAQLSSQKMESFSAVPPTKEKVSIQDQPMANLCTPSSAANSCNNSAN
NTPGAPEIHPSSSPTPPSSNTQEEAQPSNVSDVSPTSMPFASNSEPAPLTLTSPRMVAAD
NQDTSNLPQLAATAPRVSHRMQPRGSFYSVVPNATIHQDPQSIFVTNPVPLTPPQGPPAA
VQLSSAVNIMNGSQMHINPANKSLPPAFGPATLFNHFSSLFDSSQVPANQGWGDGPLSSR
VAADASFTVQSAFLGNSVLGHLENVHPDNSKAPGFRPPSQRVSTSPVGLPSIDPSGSSPS
SSSAPLTSFSGIPGTRVFLQGPAPVGTPSFNRQHFSPHPWTSASNSSTSAPPTLGQPKGG
SASQDRKIPPPIGTERLARIRQGGSVAQAPVGTSFVAPVGHSGIWSFGVNAVSEGLSGWS
QSVMGNHPMHQQLSDPSTFSQHQPMERDDSGMVAPSNIFHQPMASGFVDFSKGLPISMYG
GTIIPSHPQLADVPGGPLFNGLHNPDPAWNPMIKVIQNSTECTDAQQASLLPSVPALKGE
IPSPQLTRPKKRVGRPMVASPNQRHQDHLRPKVPAGVQELTHCPDTPLLPPSDSRSHNSS
NSPTLQAGGAKGAGDRGRDT
Download sequence
Identical sequences F7C441
ENSCJAP00000030824 ENSCJAP00000030809

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]