SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOARP00000022331 from Ovis aries 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOARP00000022331
Domain Number 1 Region: 3989-4356
Classification Level Classification E-value
Superfamily RCC1/BLIP-II 6.02e-98
Family Regulator of chromosome condensation RCC1 0.0041
Further Details:      
 
Domain Number 2 Region: 367-734
Classification Level Classification E-value
Superfamily RCC1/BLIP-II 1.74e-95
Family Regulator of chromosome condensation RCC1 0.00071
Further Details:      
 
Domain Number 3 Region: 4484-4833
Classification Level Classification E-value
Superfamily Hect, E3 ligase catalytic domain 4.97e-90
Family Hect, E3 ligase catalytic domain 0.0000541
Further Details:      
 
Domain Number 4 Region: 2023-2205
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 8.71e-34
Family SPRY domain 0.01
Further Details:      
 
Domain Number 5 Region: 3420-3708
Classification Level Classification E-value
Superfamily WD40 repeat-like 5.95e-32
Family WD40-repeat 0.008
Further Details:      
 
Domain Number 6 Region: 3685-3813
Classification Level Classification E-value
Superfamily WD40 repeat-like 0.0000000000114
Family WD40-repeat 0.0066
Further Details:      
 
Weak hits

Sequence:  ENSOARP00000022331
Domain Number - Region: 1744-1886,1917-1988
Classification Level Classification E-value
Superfamily ARM repeat 0.00437
Family HspBP1 domain 0.054
Further Details:      
 
Domain Number - Region: 2739-2801
Classification Level Classification E-value
Superfamily UBA-like 0.0134
Family UBA domain 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOARP00000022331   Gene: ENSOARG00000020775   Transcript: ENSOART00000022636
Sequence length 4857
Comment pep:known_by_projection chromosome:Oar_v3.1:7:43237435:43432873:1 gene:ENSOARG00000020775 transcript:ENSOART00000022636 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MASMVPPVKLKWLEHLNSSWITEDSESIATREGVAVLYSKLVSNKEVVPLPQQVLCLKGP
QLPDFERESLSSDEQDHYLDALLSSQLALAKMVCSDSPFAGALRKRLLVLQRVFYALSNK
YHDKGKVKQQQHSPESSSGSADVHSVSERPRSSTDALIEMGVRTGLSLLFALLRQSWMMP
VSGPGLSLCNDVIHTAIEVVSSLPPLSLANESKIPPMGLDCLSQVTTFLKGVTIPNSGAD
TLGRRLASELLLGLAAQRGSLRYLLEWIEMALGASAVVNSMEKSKLLSSQEGMISFDCFM
TILMQMRRSLGSSADRSQWREPTRTSDGLCSLYEAALCLFEEVCRMASDYSRTCASPDSI
QTGDTPIVSETCEVYVWGSNSSHQLVEGTQEKILQPKLAPSFSDAQTIEAGQYCTFVIST
DGSVRACGKGSYGRLGLGDSNNQSTLKKLTFEPHRSIKKVSSSKGSDGHTLAFTTEGEVF
SWGDGDYGKLGHGNSSTQKYPKLIQGPLQGKVVVCVSAGYRHSAAVTEDGELYTWGEGDF
GRLGHGDSNSRNIPTLVKDISNVGEVSCGSSHTIALSKDGRTVWSFGGGDNGKLGHGDTN
RVYKPKVIEALQGMFIRKVCAGSQSSLALTSTGQVYAWGCGACLGCGSSEATALRPKLIE
ELAATRIVDISIGDSHCLALSHDNEVYAWGNNSMGQCGQGNSTGPITKPKKVSGLDGIAI
QQISAGTSHSLAWTALPRDRQVVAWHRPYCVDLEESTFSHLRSFLERYCDKINSEIPPLP
FPSSREHHNFLKLCLKLLSNHLALALAGGVATSILGRQAGPLRNLLFRLMDSTVPDEIQE
VVIETLSVGATMLLPPLRERMELLHSLLPQGPDRWESLSKGQRMQLDIILTSLQDHTHVA
SLLGYSSPSDAADLSSVCTGYGNLSDQPYGTQSCHPDTHLAEILMKTLLRNLGFYTDQAF
GELEKNSDKFLLGTSSSENSQPAHLHELLCSLQKQLLAFCHINNISENSSSVALLHKHLQ
LLLPHATDIYSRSANLLKESPWNGSVGEKLRDVIYVSAAGSMLCQIVNSLLLLPVSVARP
LLSYLLDLLPPLDCLNRLLPAAALLEDQELQWPLHGGPELIDPAGVPLPQPAQSWVWLVD
LERTIALLIGRCLGGMLQGSPVSPEEQDTAYWMKTPLFSDGVEMDTPQLDKCMSCLLEVA
LSGNEEQKPFDYKLRPEIAVFVDLALGCSKEPARSLWISMQDYAISKDWDSATLSNESLL
DTVSRFVLAALLKHTNLLSQACGESRYQPGKSLSEVYRCVYKVRSRLLACKNLELIQTRS
SSRDRWISENQDSADVDPQEHSFTRTIDEEAEMEEQAERDREEGHPEPEDEEEEREHEVM
TAGKIFQCFLSAREVARSRDRDRMNSGAGSGARADDPPPQAQQERRVSTDLPEGQDVYTA
ACNSVIHRCALLILGVSPVIEELQKRREEGQLQQPSTSTSEGGGLMTRSESLTAESRLVH
ASPNYRLIKSRSESDLSQPESDEEGYALSGRRNVDLDLASSHRKRGPIHSQLESLSDSWT
RLKHNRDWLYSSSYSFESDFDLTKSLGVHTLIENVVSFVSGDVGNAPGFKEPEESMSTSP
QASIIAMEQQQLRAELRLEALHQILVLLSGMEEKGSISLTGSRLSSGFQSSTLLTSVRLQ
FLAGCFGLGTVGHAGAKGESGRLHHYQDGIRAAKRNIQIEIQVAVHKIYQQLSATLERAL
QANKHHIEAQQRLLLVTVFALSVHYQPVDVSLAISTGLLNVLSQLCGTDTMLGQPLQLLP
KTGVSQLSTALKVASTRLLQILAITTGTYADKLSPKVVQSLLDLLCSQLKNLLSQAGVLL
MASFGEGEEDEEGKKIDSSGETEKRDFRAALRKQHAAELHLGDFLVFLRRVVSSKAIQSK
MASPKWTEVLLNIASQKCSSGIPLVGNLRTRLLALHVLEAVLPACESGVEDDQMAQVVER
LFSLLSDCMWETPIAQAKHAIQIKEKEQEIKLQKQGELEEEDENLPIQEVSFDPEKAQCC
IVENGQILTHGSGGKGYGLASTGVTSGCYQWKFYIVKENRGNEGTCVGVSRWPVHDFNHR
TTSDMWLYRAYSGNLYHNGEQTLTLSSFTQGDFITCVLDMEARTISFGKNGEEPKLAFED
VDAAELYPCVMFYSSNPGEKVKICDMQMRGTPRDLLPGDPICSPVAAVLAEATIQLIRIL
HRTDRWTYCINKKMIERLHKIKICIKESGQKLKKSRSVQSREENEMREEKENKEEEKGKH
SRHGLADLSEPQLRTLCIEVWPVLAVIGGVDAGLRVGGRCVHKQTGRHATLLGVVKEGST
SAKVQWDEAEITISFPTFWSPSDTPLYNLEPCEPLPFDVARFRGLTASVLLDLTYLTGIH
EDMGKQSTKRHEKKHRHESEEKGDIEQKIESESALDTRTGLTSDDVKGTTSSKSENEIAS
FSLDSAVPGVESQHQITEGKRKNHEHMSKNHDIAQSEIRAVQLSYLYLGAMKSLSALLGC
SKYAELLLIPKVLAENGHNSDCASSPVVHEDVEMRAALQFLMRHMVKRAVMRSPIKRALG
LADLERAQAMIYKLVVHGLLEDQFGGKIKQEIDQQAEESDQAQQAQTPVTTSPSASSTTS
FMSSSLEDTTTATTPVTDTETVPASESPGVMPLSLLRQMFSSYPTTTVLPTRRAQTPPIS
SLPTSPSDEVGRRQSLTSPDSQSARPTNRTALSDPSSRLSTSPPPPAIAVPLLEMGFSLR
QIAKAMEATGARGEADAQNITVLAMWMIEHPGHEDEEEPQSGSTADSRHGAAVAGSGGKS
NDPCYLQSPGDIPSADAAEMEEGFSESADNLDHAENAASGSGPPARGRSAVTRRHKFDLA
ARTLLARAAGLYRSVQAHRNQSRREGISLQQDPGALYDFNLDEELEIDLDDEAMEAMFGQ
DLTSDNDILGMWIPEVLDWPTWHVCESEDREEVVVCELCECSVVNFNQHMKRNHPGCGRS
ANRQGYRSNGSYVDGWFGGECGSGNPYYLLCGSCREKYLALKTKSKATSSERYKGQAPDL
IGKQDSVYEEDWDMLDVDEDEKLTGEEEFELLAGPLGLNDRRIVPEPVQFPDSDPLGASV
AMVTATNSMEETLMQIGCHGSVEKSSSGRITLGEQAAALANPHDRVVALRRVTAAAQVLL
ARTMVMRALSLLSVSGSSCSLAAGLESLGLTDIRTLVRLMCLAAAGRAGLSTSPSAMASA
SERSRGGHSKANKPISCLAYLSTAVGCLASNTPSAAKLLVQLCTQNLISAATGVNLTTVD
DPIQRKFLPSFLRGIAEENKLVTSPNFVVTQALVALLADKGAKLRPNYDKSEVEKKGPLE
LANALAACCLSSRLSSQHRQWAAQQLVRTLAAHDRDNQTAPQTLADMGGDLRKCSFIKLE
AHQNRVMTCVWCNKKGLLATSGNDGTIRVWNVTKKQYSLQQTCVFNRLEGDAEESLGSPS
DPSFSPVSWSISGKYLAGALEKMVNIWQVNGGKGLVDIQPHWVSALAWPEEGPSTAWSGE
SPELLLVGRMDGSLGLIEVVDVSTMHRRELEHCYRKDVSVTCIAWFSEDRPFAVGYFDGK
LLLGTKEPLEKGGIVLIDAHKDTLISMKWDPTGHILMTCAKEENVKLWGPISGCWRCLHS
LCHPSIVNGIAWCSLPGKGSKLHLLMATGCQSGLVCVWRIPQDTTQTSVTSSEGWWDQES
SCQDGYRKSVGAKCVYQLRGHITPVRTVAFSSDGLALVSGGLGGLMNIWSLRDGSVLQTV
VIGSGAIQTTVWIPDVGVAACSNRSKDVLVVNCTAEWIAANHVLATCRTALKQQGILGLN
MAPCMRAFLERLPMMLQEQYAYEKPHVVCGDQLVHSPYMQCLASLAVGLHLDQLLCNPPV
PPHHQHCLPDPASWNPNEWAWLECFSTTIKAAEALTNGAQFPESFTVPDLEPVPEDELVL
LMDNSKWINGMDEQIMSWATSRPEDWHLGGKCDVYLWGAGRHGQLAEAGRNVMVPAAAPS
FSQAQQVICGQNCTFVIQANGTVLACGEGSYGRLGQGNSDDLHVLTVISALQGFVVTQLV
TSCGSDGHSMALTESGEVFSWGDGDYGKLGHGNSDRQRRPRQIEALQGEEVVQMSCGFKH
SAVVTSDGKLFTFGNGDYGRLGLGNTSNKKLPERVTALEGYQIGQVACGLNHTLAVSADG
SMVWAFGDGDYGKLGLGNSTAKSSPQKVDVLCGIGIKKVACGTQFSVALTKDGHVYTFGQ
DRLIGLPEGRARNHNRPQQIPVLAGVVIEDVAVGAEHTLALASTGDVYAWGSNSEGQLGL
GHTNHVREPTLVTVLQGKNVRQISAGRCHSAAWTAPPVPPRAPGVSVPLQLGLPDTVPPQ
YGALREVSIHTARARLRLLYHFSDLMYSSWRLLNLSPNNQNSTSHYNAGTWGIVQGQLRP
LLAPRVYTLPMVRSIGKTMVQGKNYGPQITVKRISTRGRKCKPIFVQIARQVVKLNASDL
RLPSRAWKVKLVGEGADDAGGVFDDTITEMCQELETGIVDLLIPSPNATAEVGYNRDRFL
FNPSACLDEHLMQFKFLGILMGVAIRTKKPLDLHLAPLVWKQLCCVPLTLEDLEEVDLLY
VQTLNSILHIEDSGITEESFHEMIPLDSFVGQSADGKMVPIIPGGNSIPLTFSNRKEYVE
RAIEYRLHEMDRQQVAAVREGMSWIVPVPLLSLLTAKQLEQMVCGMPEISVEVLKKVVRY
REVDEQHQLVQWFWHTLEEFSNEERVLFMRFVSGRSRLPANTADISQRFQIMKVDRPYDS
LPTSQTCFFQLRLPPYSSQLVMAERLRYAINNCRSIDMDNYMLSRNVDNAEGSDTDY
Download sequence
Identical sequences W5QHY1
ENSOARP00000022331

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]