SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for CAOG_03066T0 from Capsaspora owczarzaki ATCC 30864

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  CAOG_03066T0
Domain Number 1 Region: 1862-2019
Classification Level Classification E-value
Superfamily Apolipoprotein A-I 0.0000157
Family Apolipoprotein A-I 0.031
Further Details:      
 
Weak hits

Sequence:  CAOG_03066T0
Domain Number - Region: 661-705
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000138
Family Laminin-type module 0.01
Further Details:      
 
Domain Number - Region: 1460-1513
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00167
Family Laminin-type module 0.017
Further Details:      
 
Domain Number - Region: 895-946
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00247
Family Laminin-type module 0.029
Further Details:      
 
Domain Number - Region: 1294-1341
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00586
Family Laminin-type module 0.01
Further Details:      
 
Domain Number - Region: 1025-1077
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0159
Family Laminin-type module 0.011
Further Details:      
 
Domain Number - Region: 2077-2199
Classification Level Classification E-value
Superfamily Apolipoprotein A-I 0.017
Family Apolipoprotein A-I 0.033
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) CAOG_03066T0
Sequence length 2395
Comment | CAOG_03066 | Capsaspora owczarzaki ATCC 30864 conserved hypothetical protein (2396 aa)
Sequence
MRSYSLFALLALLSLAASSLADVTFAPIDFGPVDANGVQIISNIPEGSPYNLQFSLVGGT
IACCGYAIQRRNLDSTNPADWAAFIDDRIVISAGSDSFGYTATVTITAVSSDTTYQYRIA
ATDGATSYSDAFSLEVTTFGVIGKALQLMKGPVGFVAYWTPARVGPNNLVGVYDYPSYTR
KLFTTAVASGGSYIKLYLTYSNPGVSTAIVAQDKLGHNSFWYVYDQFVEGSFTGVLFPSA
SVDSFRNAMISWLAPLLKAAGNSAHDYKVEIARMGTANSAAARTFLFYARDYHTGQLASS
TVTLAALTAGGVSVGSAFTVPGYGQVATNTVSLTALTYSVIRSASTTNSLNGGSSYLGFM
ASGVVGRSATGYTTAQPQLWMANLNFVQAEPATSDQLIFEAVTDVVVSENVATPFRSRQP
YRLRSNTGLYLVVNNDYTTTNGFATLSTTSTASAATAFVIGDNTDARYFLTDAAQDVVGT
DDGNLNVWAAPPCVELAVANCRNVEDAPQYRRLELDLGGTFSVDKISMWFDFYSVSKPQT
TTPFVSNSWPSTSRFPSNITSATGQNIELWLKSETGDYVDFSAALDGAALTNCVNRRTPI
AGLTCQLTFTIAAYGQASLGLVGNVGYRHVRIVLDGLPTTSESQIFYRTHAIRDIQVAGG
CNCNGHSYNCDLLTGDCSPAGTAGCQHDTAGVSCDTCAPGFFVNYGNKPITVPPNTEYTC
TSDVSTVVGSSGNSFQVTNCDGPSLPDLGYVGLAEIPDRSEAFDVYFYDGGPCVGCDCYL
HDNYTYGVPGDRGCEQVTGECHCHPSTLTEGHNCELCEDGACRTSTDLYQTCELCVCNQH
AVDTTSTGADSCAKDTCGCNCKPEDHVTGLHCEDCETGYWSVSGSGLLGDVCFACECNGH
KPSGDGVTEVCDVTGGNCDMSGDPCNGFTTGEHCEKCLPGYYRPYISGTLDVDLTDDCIP
CACANLADPLTWGDGTTAICDPNGGVCNCRSSANVVGDHCEECIDGFFSPNSGPGYTDTV
GCSACLCNDHKGVTNYCNRQGGECNEGPTGAPDPCQGNTEGSSCELCKATFFRPYVSGTS
VDLTDDCQSCDCNGHSIAPIAGVDTCNAEGGYCTCDGVSNSEGNHCEFCIENFWNEFGNS
EDGRICAPCACHGHSTTCSNFGVCSSCSGNTENGAGDSCDECVSGYRRNTQINPADAGNP
TVIEDARFDTCTLCTCNGHSTTCDSYTGSCTNCDGNTENDAWPNAYCNRCDPTFYRQITG
FVPTDDVCNVCGLTGGSCDVHTLADDCLSCADQCFGPNKECVETTGECTNCAASNAIGRR
CERCAPGFYGDPTQGIPCVACKGACYHTVPQIWTDTTLLGPILSCETANYPEVRQCVLDY
PLSGDVISAAIPQSDAVMCNCPEAPAGLHATLDTIPPYLNRDCSACNPSGFFGQPTVCLT
PGLGGSALGDCDAFHTCIECGCNGNINEALAANNCAYDDAKPSDLSCSNCEGRTTGDSCE
MCESGSYGSATRTDVSGVYPGFPVDLSGNNLKCFQCDCNGRSLAASTVDILSDTCTTTSP
TTYSCECKEPYTGATCRECKRGYSPIYDTTDGIDSAPGYVWCKPCPDCILALYSGGIVPL
EDDGALLNANFNSLNTSVANLWIEVNNINAALPYRVAALPTLRRDADSEAAVANLMEEAK
AALQESIAANQNKLADAESKLNELTANVNVAIASINDKLSQADIDTKLTPAAVQAAELSA
QLATASDKVAALQRNMRDYEARSTASIIEHLTRLVAAEKAAETAAADVVEFKAATLARLD
ALSPLSASLATQVASVADSFTALSASIHAAMEDKSRIVDDKITAVAALSTLVGDSLASVN
AFQADAEALNARFDSVSATVQSLVEASSALRANVNARLAASREEIVAATERIAAAQADVT
AQIAVAKSQLGDGIETMSAQFAARLTTLRQSFEDAVAAEKSAHASTATEIKSFVSSTTEE
ARAQLADHSTKLDDFERQLKGIQVAMDENVDELATKLSATVARLSNVVSSAAAGQFVLPD
EFHGFINRASYEMSTLQRTIKSRASQMEGYLGSNLLALTAKLNQIRAQAEVDNESVRQNA
QVRMDAALAELRNVISQIRTEQSSAIAAAEARLAAAVATAEQSVRDGVAAARREVAADAA
AVKTWTEDRIAHIQAEVDAKSAQISESAKTAADRAKQVATSTISTASQEIARQRELVLAP
LRTELASTLKSVAEVSASLSSLDQASKRMANKVQTVEADTALARAQLKNLQNNLKAFLSI
PSSINIAEDFSKTVSPFTGDASFVAKHLHPGCRNGSNQPCPADQADAAVAAAAPAGASSS
SSSSSSSDNNGLVLKVALPATAFVAFVALVVAGVFYRKYRAASANNGYHRVSTHI
Download sequence
Identical sequences CAOG_03066T0 XP_004363905.1.32957

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]