SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSHAP00000014511 from Sarcophilus harrisii 76_7.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSHAP00000014511
Domain Number 1 Region: 1188-1415
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.09e-39
Family Laminin G-like module 0.0015
Further Details:      
 
Domain Number 2 Region: 228-340
Classification Level Classification E-value
Superfamily Cadherin-like 7.71e-29
Family Cadherin 0.00056
Further Details:      
 
Domain Number 3 Region: 333-438
Classification Level Classification E-value
Superfamily Cadherin-like 7.98e-29
Family Cadherin 0.0012
Further Details:      
 
Domain Number 4 Region: 644-756
Classification Level Classification E-value
Superfamily Cadherin-like 9.85e-28
Family Cadherin 0.00067
Further Details:      
 
Domain Number 5 Region: 120-226
Classification Level Classification E-value
Superfamily Cadherin-like 5.85e-26
Family Cadherin 0.0015
Further Details:      
 
Domain Number 6 Region: 744-861
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-25
Family Cadherin 0.00099
Further Details:      
 
Domain Number 7 Region: 1431-1645
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.8e-25
Family Laminin G-like module 0.0065
Further Details:      
 
Domain Number 8 Region: 15-118
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-24
Family Cadherin 0.0018
Further Details:      
 
Domain Number 9 Region: 541-643
Classification Level Classification E-value
Superfamily Cadherin-like 2.43e-24
Family Cadherin 0.002
Further Details:      
 
Domain Number 10 Region: 439-539
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-21
Family Cadherin 0.00065
Further Details:      
 
Domain Number 11 Region: 1127-1166
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000231
Family EGF-type module 0.0093
Further Details:      
 
Domain Number 12 Region: 852-954
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000414
Family Cadherin 0.0099
Further Details:      
 
Domain Number 13 Region: 1673-1719
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000666
Family EGF-type module 0.0097
Further Details:      
 
Domain Number 14 Region: 1638-1671
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000737
Family EGF-type module 0.017
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSHAP00000014511   Gene: ENSSHAG00000012387   Transcript: ENSSHAT00000014633
Sequence length 3010
Comment pep:known_by_projection scaffold:DEVIL7.0:GL841583.1:1020890:1062164:-1 gene:ENSSHAG00000012387 transcript:ENSSHAT00000014633 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LGSGQAXARRAANRHPHFPQYNYQALVAENQAPGTSVLSVAAQDPDAGEAGRLSYSMAAL
MNSRSLDLFRIDAASGLISTAEPLDRESMDLHYFRVTALDHGSPRLSATTMVAVTVADRN
DHAPVFEQAEYRETIRENVEEGYPILQLRATDGDAAANANIRYRFVDPPAAQAVFEIDPR
SGLITTSGRVDREQKENYELAVEASDQGGEPGPRSATVRVHITVLDENDNAPQFSEKRYL
AQVREDVRPHTEVLRVTATDLDKDTNALVHYNIISGNSRGHFAIDSLTGEIQVVAPLDFE
VEREYALRIRAQDSGRPPLSNNTGLASIQVVDINDHAPIFVSTPFQVSVLENAPLGHSVI
HIQAVDADHGENARLEYKLTGTALDTPFVVNSATGWITVSGPLDRESVEHYFFGVEALDH
GSPPLSASASVTITVMDVNDNRPEFTQKEYHLRLNEDAAVGTSVLSVTAVDRDINSAISY
QITGGNTRNRFAISTQGGMGLVTLALPLDYKQERYYKLIVTASDRTLHDNCYVHINITDA
NTHRPVFQSAHYSVSVNEDRPVGSTVVVISATDDDVGENARITYLLEDNLPQFRIDVDSG
AITLQAELDYEDQVTYTLAITAKDNGIPQKADTTYVEIMVNDVNDNAPQFVASYYPGVIS
EDAPPFTSVLQISATDRDAHANGRVQYTFQNGEDGDGDFTIEPTSGIVRTVRRLDRESVP
VYELTAYAVDRGVPPLRTPVNIQVTVQDVNDNAPVFPAEEFEVRVKENSIVGSVVAQITA
IDPDEGPNAQIMYQIVEGNIPELFQMDIFSGELTTLIDLDYEMRPEYVIVVQATSAPLVS
RATVHIRLIDQNDNSPVLKNFQILFNNYVSNRSNTFPSGIIGRIPAYDPDVSDRLFYTFE
RGNELELLVVNQTSGELRLSRKLDNNRPLVASMLVTVTDGLHSVTAQCVLRVTIITEDML
ANSLTVRLENMWQERFLSPLLASFLEGVASVLATPKEDVFIFNIQNDTDVGGSVLNVSFS
ALAPGGGAPGAGGGGGPFFSSEELQEQLYVRRAALAAASLLDVLPFDDNVCLREPCENYM
KCISVLRFDSSAPFLASASTLFRPIHPIAGLRCRCPAGFTGDYCETEINLCYSNPCRHGG
ACARREGGYTCICRPHFTGENCELDSRAGRCVPGVCRNGGTCTNTPSGGFRCQCPGGGGF
EGPYCEVAVRSFPPNSFLMFRGLRQRFHLTLSLSFATVRSHGLLFYNGRLNEKHDFLALE
IVAGQVRLTYSTGESSTVVSPIVPSGVSDGQWHTVQLKYYNKPRTSALGGVQGPSKDKVA
VLSVDECDTPVALQFGAEISNYSCAAEGVQTSSKKSLDLTGPLLLGGVPNLPENFPVSHK
EFVGCMRDLFIDSRRVDMAAFVANNGTIPGCHAKRAFCVSSPCQNGGTCSDGWAGFRCDC
PVGFGGKDCGLAMVHPYRFRGNGALSWDFGSEVTISVPWYLGLAFRTRTPHGVLLQIQAG
QHSSLLCQLERGLLSLVVDRGSGRSARLLLDQVTLSDGRWHDLRLELQDGRASRSGHYLL
TVTVDFGLFQDTMTVGSELHGLKVKHLHVGGLLKAHDVHRGLDGCIQGVWLGTVPSGAPA
LPPPSSKMNAEPGCSVLNSCASKPCPLHADCRDEWQTFSCVCHPGYYGKACVDACHLNPC
KNKGSCQRRAGAPHGYACECEDSHFGPHCEHRMDQQCPRGWWGSPACGPCNCDVNKGFDP
DCNKTNGQCYCKEFHYRPRGSDTCLPCDCYPVGSSSRSCSLDSGQCPCRPGVIGRQCNSC
DTPSAEVTASGCRVLYNGCPKSLKGGVWWPQTKFGLVATVPCPKGSLGAAIRHCDEEKGW
LEPDLFNCTSPAFRELGLLLEDLERNETELDTIEAKKLAQRLRAVTGHTERYFGNDVHIA
ARLLGRLLTFESRQQGFGLTATQDAHFNENLLWAGSAVLAPETRQLWDSPWPQVLGGTPA
RGASAGLVEQLEEYTATLAQNMELTYLNPVGLVAPNIMLSIDRLETPVPNLGGHRYPRYH
GSLFRGQDAWDPHTHVVLPPPVPRPPQPEARAGRSGGAGSTHCGLPSPALPTGGKGEGRG
SVTVILRLARLLRSLPADFAAVTAGAGLPRNPVMNSPVVSVSVFKERSFLQGVLESPVTL
EFRLLQTANRSKPLCVQWERPSPSDPWGVWTARDCELVHRNGTHVRCQCSRLGTFGVLMD
ASHRERLEGDLALLAVVTHVTVSVTVASLLLTAAVLLSLRGLKSNTRGIHANVAAALGLA
ELLFFLGIHRTHNQFLCTVIAIVLHYLFLGTFSWLFVQGLHLYRMQAEARNVDLGAMRFY
YALGWGVPAILLGLAVGLDPGGYGNSDFCWISIHDKLVWSFTGPIVLVISMNGVLFLLVA
RMSCSPGQREAKKMSVLVALCSSFLLLLLISASWLFGLLAVNHSVLAFHYLHAGFCGLQG
PAVLLLFCVLNEETRGAWASACLGKKAGTEEARPAPGTGPAAYNNTALFEESGLIRITLG
ASTVSSVSSARSARTQDSQRGRGGLLRENVLAKHGSTADHTDQSLLAHGGPTDLDVAMFH
RDAGGGMDSDSDSDLSLEEDRSLSIPSSESEDNVRPRGRFQHPLRRAAHSERLLTHPASS
TPKDVDGNDLLSYWPALGDCEVHPCSLQTWGSERRLGLDASKDAANNNQTELALTSGDEN
TMGPTQLGHTQRQRKGILKNRLQYPLALHGLPSTSARGAPELSWYRSSTLGHRAVPAASY
GRIYSGTGSLSQPTSRYSSREQLDLLLRRQMSREQLEEAVPLPMYSTSRHGSREQLETCL
GGSEPRFRGSTLPRRQPPRDHLDALATRFGSREPLDSGASQEWPSPLPHLRGSRDQLLPT
SQRSREQLDCLARSSNSPEQLDVRPCRHPSRENLEPLSRRLPSRENVAACPSRPPSTEQL
DILSSILASFNSSALSSSVQSSSTPSGPHTTATPSATASAPGHSTPRSATSHSISELSPD
SEVTRSEGHS
Download sequence
Identical sequences G3WGF6
ENSSHAP00000014511 ENSSHAP00000014511

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]