SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOCUP00000021305 from Oryctolagus cuniculus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOCUP00000021305
Domain Number 1 Region: 1577-1803
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.6e-43
Family Laminin G-like module 0.0013
Further Details:      
 
Domain Number 2 Region: 725-830
Classification Level Classification E-value
Superfamily Cadherin-like 2.28e-29
Family Cadherin 0.00073
Further Details:      
 
Domain Number 3 Region: 620-732
Classification Level Classification E-value
Superfamily Cadherin-like 8.42e-29
Family Cadherin 0.00045
Further Details:      
 
Domain Number 4 Region: 1036-1148
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-28
Family Cadherin 0.00071
Further Details:      
 
Domain Number 5 Region: 1820-2036
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.18e-26
Family Laminin G-like module 0.0037
Further Details:      
 
Domain Number 6 Region: 403-506
Classification Level Classification E-value
Superfamily Cadherin-like 1.57e-25
Family Cadherin 0.00085
Further Details:      
 
Domain Number 7 Region: 508-618
Classification Level Classification E-value
Superfamily Cadherin-like 1.07e-24
Family Cadherin 0.001
Further Details:      
 
Domain Number 8 Region: 933-1035
Classification Level Classification E-value
Superfamily Cadherin-like 1.33e-24
Family Cadherin 0.0014
Further Details:      
 
Domain Number 9 Region: 1142-1242
Classification Level Classification E-value
Superfamily Cadherin-like 1.43e-24
Family Cadherin 0.00081
Further Details:      
 
Domain Number 10 Region: 831-931
Classification Level Classification E-value
Superfamily Cadherin-like 6.57e-20
Family Cadherin 0.00086
Further Details:      
 
Domain Number 11 Region: 1516-1556
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000152
Family EGF-type module 0.0082
Further Details:      
 
Domain Number 12 Region: 1244-1346
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000314
Family Cadherin 0.01
Further Details:      
 
Domain Number 13 Region: 2064-2110
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000012
Family EGF-type module 0.01
Further Details:      
 
Domain Number 14 Region: 2028-2062
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000017
Family EGF-type module 0.024
Further Details:      
 
Domain Number 15 Region: 2158-2197
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000544
Family Laminin-type module 0.0084
Further Details:      
 
Weak hits

Sequence:  ENSOCUP00000021305
Domain Number - Region: 2208-2260
Classification Level Classification E-value
Superfamily Hormone receptor domain 0.00129
Family Hormone receptor domain 0.0061
Further Details:      
 
Domain Number - Region: 1495-1521
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00689
Family EGF-type module 0.071
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOCUP00000021305   Gene: ENSOCUG00000025952   Transcript: ENSOCUT00000022227
Sequence length 3393
Comment pep:known_by_projection chromosome:OryCun2.0:9:16328732:16352684:-1 gene:ENSOCUG00000025952 transcript:ENSOCUT00000022227 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLERREAAAASPMLWWGRQKPGPLAKGALARNRCAQGAAARSDRPRGGGPGRRQGPGRGP
ERRGGPKARTGAGGGGGRVGRWRVMARRPPWQGLERPSTPVLLLLLLLSLFPLSREELGG
GGDQGWDPGVDLPRARISGGALALCPKPPGAREDEGPGLGVGEPVSVALRGGKQSAQSGQ
GSSEQPNAELAAEHGIQALGSRARDTGQEPRSLLCWRPEVSSCRRTGPLRRGSLSPEALS
LGVPGPGNSLSLPSDFLIQPGGPKPASFQANSGRSARKRVGTVRCCGELWVTGRRGQGER
AAPSRRERTDPRRECPPWAAESGPGLDSAPRTARTAPLAGSAPRELRTAPEAVPRRMRSR
GLARRRFLPQRPGPRPPGDPAAPEAWRTPPASRARPRRAANRHPQFPQYNYQTLVPENEA
AGTAVLRVVAQDPDAGEAGRLVYSLAALMNSRSLELFSIDPQTGLIRTAAALDRESMERH
YLRVTAQDHGSPRLSATTMVAVTVADRNDHAPVFEQAQYRETLRENVEEGYPILQLRATD
GDAPPNANLRYRFVGPPAARTAAAAAFEIDPRSGLISTSGRVDREHMESYELVVEASDQG
QEPGPRSATVRVHITVLDENDNAPQFSEKRYVAQVREDVRPHTVVLRVTATDRDKDANGL
VHYNIISGNSRGHFAIDSLTGEIQVVAPLDFEAEREYALRIRAQDAGRPPLSNNTGLASI
QVVDINDHAPIFVSTPFQVSVLENAPLGHSVIHIQAVDADHGENARLEYSLTGVAPDTPF
VINSATGWVSVSGPLDRESVEHYFFGVEARDHGSPPLSASASVTVTVLDVNDNRPEFTMK
EYHLRLNEDAAVGTSVVSVTAVDRDANSAISYQITGGNTRNRFAISTQGGVGLVTLALPL
DYKQERYFKLVLTASDRALHDHCYVHINITDANTHRPVFQSAHYSVSMNEDRPVGSTVVV
ISASDDDVGENARITYFLEDNLPQFRINADSGAITLQAPLDYEDQVTYTLAITARDNGIP
QKADTTYVEVMVNDVNDNAPQFVASHYTGLVSEDAPPFTSVLQISATDRDAHANGRVQYT
FQNGEDGDGDFTIEPTSGIVRTVRRLDREAVPVYELTAYAVDRGVPPLRTPVSIQVTVQD
VNDNAPVFPAEEFEVRVKENSIVGSVVAQITAVDPDEGPNAHIMYQIVDGNIPELFQMDI
FSGELTALIDLDYEARQEYVIVVQATSAPLVSRATVHVRLVDQNDNSPVLNNFQILFNNY
VSNRSDTFPSGVIGRIPAYDPDVSDHLFYSFERGNELQLLVVNQTSGELRLSRKLDNNRP
LVASMLVTVTDGLHSVTAQCVLRVIIITEELLANSLTVRLENMWQERFLSPLLGHFLEGV
AAVLATPAEDVFIFNIQNDTDVGGTVLNVSFSALAPRGAGAGAAGPWFSSEELQEQLYVR
RAALAARSLLDVLPFDDNVCLREPCENYMKCVSVLRFDSSAPFLASASTLFRPIQPIAGL
RCRCPPGFTGDFCETELDLCYSNPCRNGGACARREGGYTCVCRPRFTGEDCELDTEAGRC
VPGVCRNGGTCTDAPHGGFRCQCPVGGAFEGPRCEVAARSFPPSSFVMFRGLRQRFHLTL
SLSFATVQPSGLLFYNGRLNEKHDFLALELVAGQVRLTYSTGESNTVVSPTVPGGLSDGQ
WHTVHLRYYNKPRTDALGSAQGPSKDKVAVLSVDDCDVAVALQFGAEIGNYSCAAAGVQT
SSKKSLDLTGPLLLGGVPNLPENFPVSHKDFIGCMRDLHIDGRRVDMATFVANNGTTAGC
QAKLHFCDSSPCKNSGSCSERWGGFSCDCPVGFGGKDCRLTMAHPHHFRGNGSLSWDFGG
DTVVSVPWYLGLVFRTRATQGVLMYMQAGQHSTLLCQLERGLLSVTVARGSGRAAHLLLD
QVTVSDGRWHDLRLELQEEPGGQRGRHVLMVSLDFSLFQDTLAVGSELQGLKVKQLHVGG
LPPSSKEAVPQGLVGCIQGVWLGSTPSGAPALPPPSHRVNVEPGCVVTNSCASAPCPPHA
DCRDLWQTFSCTCRPGYYGPGCVDACLLNPCQNQGLCRRLPGAPHGYTCDCASGYFGHHC
EHRMDQQCPRGWWGSPTCGPCNCDVHKGFDPNCNKTNGQCHCKEFHYRPRGSDLCLPCDC
YPVGSTSRSCAPHSGQCPCRPGALGRQCNSCDSPFAEVTASGCRVLYNACPKSLRSGVWW
PQTKFGALATVPCPRGALGAAVRLCDEDRGWLEPDLFNCTSAAFRELSLLLDGLELNKTA
LDTLEAKKLAQRLREVTGHAGHYFSQDVRVTARLLAYLLAFESHQQGFGLTATQDAHFNE
NLLWAGSALLAPETGDLWAALGQRAPGGSPGSAGLVRHLEEYAATLARNMELTYLNPVGL
VTPNIMLSIDRMEHPGPTRGARRYPRYHSNLFRGQDAWDPHTHVLLPSQPPRPSPSEVLP
TGSSAENSSVGPPPAPPEPEPEPGMSIVILLVYRTLGGLLPAQFQAERRGARLPQNPVMN
SPVVSVAVFHGRSFLSGLLESPISLEFRLLQTANRSKAICVQWDPPGPADQHGMWTARDC
ELVHRNGSHARCRCSRTGTFGILMDASPRERLEGDLELLAVFTHVVVALSVAALVLTAAI
LLSLRSLKSNVRGIHANVAAALGVAELLFLLGIHRTHNQLVCTAVAILLHYFFLSTFAWL
LVQGLHLYRMQVEPRNVDHGAMRFYHALGWGVPAVLLGLAVGLDPEGYGNPDFCWISIHD
PLIWSFAGPVVLVIMMNGTLFLLAAHTSCSTGQREAKKTSVLPLRSSFLLLLLVSTSWLF
GLLAVNHSVLAFHYLHAALSGLQGLAVLLLFCVLNADARAAWTPACLGRKAAPEEARPAP
GTGPGAYNNTALFEESGLIRITLGASTVSSVSSARSGRAQDQDSQRGRGYLRDNVLVRHG
SAADHTDHSLQAHAGPTDLDVAMFHRDAGADSDSDSDLSLEEERSLSIPSSESEDNGRTR
GRFQRPLRRAAQSERLLTHPKDVDGNDLLSYWPALGECEAAPCALQTWGSERRLGLDSSK
DAANNNNQPDLALTSGDETSVGRAQRQRKGILKNRLQYPLVPQTRGATELSWCRAATLGH
RAVPAASYGRIYAGGGTGSLSQPASRYSSREQLDLLLRRQLSRERLEEAPAPVLCPLNRP
GSQERLDAAPGRLEPRDRGSTLPRRQPPRDYPGAVAGRFGSRDALNLGAPREWLSTLPPP
RRPQDLDPQHPPVSPSPQRQLSRDPLLPSRPLDSVSRSSNSRERLDEVPSRHPSREALGP
PLQLLRAREDPASGPSHGPSTEQLDILSSILASFNSSALSSMQSSSTPSGPHTTATPSAT
ASALGPSTPRSATSHSISELSPDSEVPRSEGHS
Download sequence
Identical sequences G1TW62
XP_008258836.1.1745 ENSOCUP00000021305

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]