SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for L8I786 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  L8I786
Domain Number 1 Region: 372-480
Classification Level Classification E-value
Superfamily Anthrax protective antigen 7.98e-19
Family Anthrax protective antigen 0.012
Further Details:      
 
Domain Number 2 Region: 1835-1915
Classification Level Classification E-value
Superfamily E set domains 2.7e-16
Family E-set domains of sugar-utilizing enzymes 0.01
Further Details:      
 
Domain Number 3 Region: 2096-2183
Classification Level Classification E-value
Superfamily E set domains 0.00000000000000118
Family Other IPT/TIG domains 0.077
Further Details:      
 
Domain Number 4 Region: 1919-2001
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000014
Family Other IPT/TIG domains 0.016
Further Details:      
 
Domain Number 5 Region: 2005-2090
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000603
Family NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain 0.034
Further Details:      
 
Domain Number 6 Region: 3329-3532
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.000000000000337
Family Galacturonase 0.062
Further Details:      
 
Domain Number 7 Region: 1156-1234
Classification Level Classification E-value
Superfamily E set domains 0.0000000000014
Family E-set domains of sugar-utilizing enzymes 0.012
Further Details:      
 
Domain Number 8 Region: 1238-1314
Classification Level Classification E-value
Superfamily E set domains 0.00000000000318
Family Other IPT/TIG domains 0.086
Further Details:      
 
Domain Number 9 Region: 1663-1750
Classification Level Classification E-value
Superfamily E set domains 0.00000000000356
Family E-set domains of sugar-utilizing enzymes 0.048
Further Details:      
 
Domain Number 10 Region: 1066-1141
Classification Level Classification E-value
Superfamily E set domains 0.0000000000182
Family E-set domains of sugar-utilizing enzymes 0.016
Further Details:      
 
Domain Number 11 Region: 271-351
Classification Level Classification E-value
Superfamily E set domains 0.0000000000243
Family E-set domains of sugar-utilizing enzymes 0.038
Further Details:      
 
Domain Number 12 Region: 1569-1646
Classification Level Classification E-value
Superfamily E set domains 0.000000000294
Family E-set domains of sugar-utilizing enzymes 0.016
Further Details:      
 
Domain Number 13 Region: 143-241
Classification Level Classification E-value
Superfamily E set domains 0.000000000311
Family E-set domains of sugar-utilizing enzymes 0.026
Further Details:      
 
Domain Number 14 Region: 1331-1388
Classification Level Classification E-value
Superfamily E set domains 0.00000000338
Family E-set domains of sugar-utilizing enzymes 0.044
Further Details:      
 
Domain Number 15 Region: 1755-1830
Classification Level Classification E-value
Superfamily E set domains 0.00000000876
Family E-set domains of sugar-utilizing enzymes 0.03
Further Details:      
 
Domain Number 16 Region: 32-125
Classification Level Classification E-value
Superfamily E set domains 0.0000000471
Family E-set domains of sugar-utilizing enzymes 0.048
Further Details:      
 
Domain Number 17 Region: 1406-1503
Classification Level Classification E-value
Superfamily Cupredoxins 0.000000857
Family Plastocyanin/azurin-like 0.021
Further Details:      
 
Domain Number 18 Region: 2315-2414,2498-2537,2572-2687
Classification Level Classification E-value
Superfamily Pectin lyase-like 0.000045
Family Pectate lyase-like 0.082
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) L8I786
Sequence length 4252
Comment (tr|L8I786|L8I786_9CETA) Fibrocystin-L {ECO:0000313|EMBL:ELR51379.1} OX=72004 OS=Bos mutus (wild yak). GN=M91_14805 OC=Pecora; Bovidae; Bovinae; Bos.
Sequence
MGHLWLWATCGLWGLLLSIVEPRADGSKIIPKITEIIPKYGSINGATRLTIKGEGFAQAN
QFDYGVDNAELGNRVQLISSFRSISCDVEKDSSHSTHITCYTRAMPEDSYTVRVSVDGVP
VTENNTCKGHVSSWACIFSAKSFRTPTIRSITPLSGTPGTLITIRGRTFTDVYGSNTALS
SNGKNVRILRVYIGGMPCELLISQSDNLYGLKLDHPNGDMGSMVCKMSGTYIGHHNVSFI
LDSDYGRSLPEKMAYFVSSLNKISMFQTYAEITRISPSQGSTQGGTWLTISGRFFDQTDF
PVRVLVGGQACHILNVTENSICCKTPPKPDILRTIYPGGRGLMLEVWNNSRPARLEEILQ
YSEKTPGYLGAIWVDSASYVWPMEQDTFVARFSGFLVAPESDVYRFYIKGDDRYAIYFSQ
TGLPEDKVRIAYHSSNANNYFSNPTQRSDDIHLQEGKEYYIEILLQEYRLSAFVDVGLYQ
YKSVYTEQQTEDAVNEEQVIKSQSAVVQEVQVITLENWETTHATNEVQKVVVTSPCVEAN
LCSHYQYRLIYNMEKTVWLPADASEFILQSALNDLWSIKPDTVQVIRTEDPRSYVYLITF
ISTRGDFDLLSYEAFEGNNVMLAITEQTKGKPSLDTFTLNWDGITSRPLTPQSSEAEFQA
ALEEMVSSKCPPQIANFEEGFVVKYFRDYETDFNLEHINRGQKTAETDAYCGRYSLKNPA
ILFDSADVKQNRLPYGDILLFPYNQLCLAYKGFLANYIGLKFQYQDRSKITRSTDTQFTF
KFAYGNNWTYTCVDLLDLVQTKYTGTSFSLQRISLQKASESQSFYVDIVYIGQTSTMTTL
DEMSKRRLPALANKGIFLKDFQVNQTKSNGSSITNQYSVTMTSYNCSYNIPMMAVSFGQR
ITNETENESVYRGNNWPGESKIRIQRIQAASPPLRGSFDIHAYGHVLTGIPAAVSAADLQ
FALQSLEEVGQVSVTREGSCAGYSWRVKWRSTCGKQPLLQINDSNIFGENANMTVTKIKE
GGLFRQRILGDLLRTPSKQPQVEVFVNEIPAKCSGNCGFTWEPTTTPQIRAINPSQGSYE
ESTILTISGSGFSPSSSVSVSVGPAGCSLLSLSENEIKCQILNGSAGRFPVAVSIAGTGL
AQNAEEKGFHFIYQSQISHIWPASGSLAGGTLLTVSGFGFHEYSKVLVGNETCNVIEGDL
NKITCRTPKRIEGTVDISVITSGVQATAKNAYSYSCLQTPVITDFSPKVRIILGEVNLMI
KGYNFGNELTQNMEVYVGGKSCQVLHRNFTDIRCLLPKLSPGKHDICVEVRNWGFASTRD
KLSASIQYILEVTNMFPQKGSLYGGTEITIMGFGFSTIPTENTVLLGSFPCNVTSSSQTV
IKCVLHSTGNVFRITNNGEDSVHGLGYAWSPSVLNVSVGDTVTWLWQAHPYLRGIGYSVF
SVSSPGSVIYDGKGFTNGREKSASGSFSYQFTSPGIHYYSSGYVDEAHSIFLQGVINVFP
AETSHIPLHLFVGGTEATYAQVLLFKGKPVSLHLGSSVAGCQAREPLCALNSTRVENSER
LLFELSSCLSPSISHISPSVGTLNELITITGRGFSNLTCANKVNIGSYPCVVEESSNNSI
VCRIDPQNSMDVGIREIVSVTVYNLGTAINTLSSELDRRFVLLPNIDMISPNAGSTTGMT
KVTIKGSGFAVSSAGAQVLMGHSPCKVLSVNYTAIECETSPAPQQLVKVNLLIQGVPAQC
QGNCSFSYLESIAVFVTRIFPNSIQGSEKVLIEGEGFGTVLEDISVFIGNQQFRAIDVNE
NNITVLMTPLPAGLHSLSVVVGTKGLALGNLTVSSPTVASVTPTSGSIGGGTTLMVTGNG
FSPGNTTVTVGDEPCQILSVNSSDIHCHTPAGTAGRVSVKISVNAVAYPPLSFMYALEDT
PLLKGIVPSTGPPETEIRITGSNFGTDILEISVMISDTQCNVTMVSDTVLQCIVGDHAGG
TFPVTMHHKTKGFAMSAVVFKYPLTIDSIHPSQGSFGGGQTMTVTGTGFNPQNSVVLVCG
SKCAVNKLKSNRTTLLCDIPPSNGRGPEQACEVSVVNGNDSSQSTAPFTYNMSMTPFITE
IAPKRGSTAGGTRLTVLGSGFSENVEDVLVTVAEAMCDVEYSNETCVICMTNAHSPSGWA
PVHVSIRSTGLAKRANADFLYVDTWSSNSSWGGKSPPEEGSLVVITKGQTILLDQNTPIL
KMLLIQGGTLIFHEADIELQAENILITDGGILQIGTEAAPFQHRAVITLHGHLRSPELPV
YGAKTLAVREGILDLHGLPVPVIWTRLAHTAKAGERTLILQEAVTWKPGDKIVIASTGHR
HSQRENEKRTIASVSADGRNITLTDPLNYTHLGITVTLSDGTLFEARAEIGILTRNILIR
GSNNVEWNNKIPACPDGFDTGEFATQTCLQGKFGEEIGSDQFGGCIMLHAPIAGANMVTG
RIEYVEIFHAGQAFRLGRYPIHWHLLGDLQFKSYVKGCAIHQTYNRAITIHNTHHLLVER
NIIYDIRGGAFFIEDGIEHGNILQYNLAVFVHQSTSLLNDDVTPAAFWVTNPNNTIRHNA
AAGGTHFGFWYRMNNHPDGPSYDRNICQKRVPLGEFFNNTVHSQGWFGLWIFEEYFPMQT
GSCTSSVPMPAVFHSLTTWNCQKGAEWVNGGALQFHNFVMVNNYEAGIETKRILAPYVGG
WGETNGAMIKNAKIVGHLDELGMGSAFCTTKGLVLPFSEGLTVSSVHFMNFDRPGCAALG
VTSITGVCNHRCGGWSAKFAGVQYSHTPNKAGFRWEHEVVLIDVDGSLTGHKGHTVIPHS
PLLDPSHCTQEAEWSTGFPGSVCDTSVSFHRIAFNKPSPVSLLEKDVVLSDSFGTSIVPF
QKKRLTHMSGWMALIPNAKHINWYFKGVDHITNISYTSTFYGFKEEDYVIISHNFTQNPD
MFNIIDMRNGSSNPLNWNTNKNGDWHLEANTSTLYYLVSGKNDHHQSQSISGTLDPDVKD
VIINFQAYCCVLQDCFPVHPPPRKPIPRKRPAAYNLWSNNSFWQSSQENNYTIPYPGANV
VIPEGTWIVADTDTPPMERLVIWGVLELEDKHNVGAAESSYRKVVLNATYISLQGGRLIG
GWEDNPFKGELQIVLRGNHSTPEWALPEGPNQGSKVLGVFGELDLHGIPHSIYKTKLSET
AAAGSRVLSLMDAVDWQEGEEIVITTTSYDFHQTETRSIVKILHDHKILILNDTLSYTHF
AERYHVPGTSQSYTLAADVGILSRNIKILGEDYPGWLKESFGARVLVSSFTENMVTFKGN
ARISNVEFYHSGQEAFRDSTDPRYAVTFLNLGQIQERGSSYVRGCAFHNGFSPAIGVFGT
DGLDIDDNIIHFTVGEGIRIWGNANRVRGNLVTLSVWPGTYQNRKDLSSTLWHAAIEINR
GTNTVLQNNVVAGFGRAGYRIDGEPCSGQSNSLEKWFDNEVHGGLFGIYMNQDGLPGCSL
IQGFTIWSCWDYGIYFQTTESVHIYNVTLIDNGMAIFSMIYMPSAVSHKISSKTVQIKNS
LIVGSSPEFNCSDVLTNDDPNIELSAAHRSSRPPSGGRSGICWPTFSSAHNMAPRKPHAG
IMSYNSISGLLDISGKFKHNTTFVGFKNVCSGETNVIFITNPLNEDLQHPIHVKNIQLVD
TTEQSKIFIHRPDLSKVNPSDCVDMVCDAKRKSLLRDMDGSFLGNSGSVIPEAEYEWNGN
SQFGIGDYRIPKVMLTFPNGSRIPVTEKAPYKGIIRDSTCKYIPEWQGHRCFGMEYAMMV
IESLDSDTETRRLSPVAIVSNGYVDLINGPQDHGWCAGYTCQRRLSLFHSIVALGKSYEV
YFTGTSPQNLRLMLLNADHDKAVLVGIFFPTRQRLDVYVNNTLVCPQNTVWNSQQKHCEL
TRQLYTEQLLPNLNSTLPGENYFDRTYQMLYLLVKGTIPVEVHTTAVIFVSFQLPAVTED
DFYSSHNLVRNLALFLKIPSDKIRVSKVIRGESLRRKRSMELTVELEIGDPPPQLITNDT
AGQMQLPELQEIAGSLGQAVILGKTNGILGFNVSSMSITDPIPSPSDSGWAKVAAQPVGR
LSFPVHHVALVTSLSVVTQPVATQLGQPFSQQPSVKAEDADGNCVSVGITALALKAVLKD
SNNNQISGLSGNTTIPFSSCWANYTDLTLLRTGKNFKIEFILDEFVRVESQPLGLASQSV
SSPGSSSSGSSNSKASTLGTAAQIVTTVISCLVGRVLLLEVFMTAVLTLNVL
Download sequence
Identical sequences L8I786

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]