SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSHAP00000017121 from Sarcophilus harrisii 69_7.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSHAP00000017121
Domain Number 1 Region: 4249-4464
Classification Level Classification E-value
Superfamily GFP-like 3.14e-60
Family Domain G2 of nidogen-1 0.0023
Further Details:      
 
Domain Number 2 Region: 2890-2976
Classification Level Classification E-value
Superfamily Immunoglobulin 3.08e-23
Family I set domains 0.011
Further Details:      
 
Domain Number 3 Region: 4160-4251
Classification Level Classification E-value
Superfamily Immunoglobulin 4.02e-23
Family I set domains 0.022
Further Details:      
 
Domain Number 4 Region: 3524-3619
Classification Level Classification E-value
Superfamily Immunoglobulin 1.16e-22
Family I set domains 0.011
Further Details:      
 
Domain Number 5 Region: 2785-2881
Classification Level Classification E-value
Superfamily Immunoglobulin 1.53e-22
Family I set domains 0.014
Further Details:      
 
Domain Number 6 Region: 3064-3163
Classification Level Classification E-value
Superfamily Immunoglobulin 1.68e-22
Family I set domains 0.013
Further Details:      
 
Domain Number 7 Region: 2290-2387
Classification Level Classification E-value
Superfamily Immunoglobulin 2.24e-22
Family I set domains 0.0094
Further Details:      
 
Domain Number 8 Region: 2608-2696
Classification Level Classification E-value
Superfamily Immunoglobulin 2.96e-22
Family I set domains 0.019
Further Details:      
 
Domain Number 9 Region: 1914-2008
Classification Level Classification E-value
Superfamily Immunoglobulin 4.33e-22
Family I set domains 0.0072
Further Details:      
 
Domain Number 10 Region: 3886-3983
Classification Level Classification E-value
Superfamily Immunoglobulin 8.88e-22
Family I set domains 0.019
Further Details:      
 
Domain Number 11 Region: 448-545
Classification Level Classification E-value
Superfamily Immunoglobulin 9.2e-22
Family I set domains 0.01
Further Details:      
 
Domain Number 12 Region: 2206-2294
Classification Level Classification E-value
Superfamily Immunoglobulin 1.96e-21
Family I set domains 0.009
Further Details:      
 
Domain Number 13 Region: 730-825
Classification Level Classification E-value
Superfamily Immunoglobulin 2.02e-21
Family I set domains 0.014
Further Details:      
 
Domain Number 14 Region: 1728-1826
Classification Level Classification E-value
Superfamily Immunoglobulin 2.52e-21
Family I set domains 0.015
Further Details:      
 
Domain Number 15 Region: 2397-2483
Classification Level Classification E-value
Superfamily Immunoglobulin 4.5e-21
Family I set domains 0.023
Further Details:      
 
Domain Number 16 Region: 3175-3257
Classification Level Classification E-value
Superfamily Immunoglobulin 7.85e-21
Family I set domains 0.02
Further Details:      
 
Domain Number 17 Region: 365-461
Classification Level Classification E-value
Superfamily Immunoglobulin 2.36e-20
Family I set domains 0.045
Further Details:      
 
Domain Number 18 Region: 1345-1441
Classification Level Classification E-value
Superfamily Immunoglobulin 2.52e-20
Family I set domains 0.0091
Further Details:      
 
Domain Number 19 Region: 3253-3348
Classification Level Classification E-value
Superfamily Immunoglobulin 2.93e-20
Family I set domains 0.016
Further Details:      
 
Domain Number 20 Region: 3971-4070
Classification Level Classification E-value
Superfamily Immunoglobulin 2.94e-20
Family I set domains 0.047
Further Details:      
 
Domain Number 21 Region: 1817-1917
Classification Level Classification E-value
Superfamily Immunoglobulin 3.89e-20
Family I set domains 0.009
Further Details:      
 
Domain Number 22 Region: 3800-3892
Classification Level Classification E-value
Superfamily Immunoglobulin 4.1e-20
Family I set domains 0.0096
Further Details:      
 
Domain Number 23 Region: 2495-2581
Classification Level Classification E-value
Superfamily Immunoglobulin 6.23e-20
Family I set domains 0.008
Further Details:      
 
Domain Number 24 Region: 2686-2783
Classification Level Classification E-value
Superfamily Immunoglobulin 9.62e-20
Family I set domains 0.0059
Further Details:      
 
Domain Number 25 Region: 972-1009,1295-1349
Classification Level Classification E-value
Superfamily Immunoglobulin 1.11e-19
Family I set domains 0.028
Further Details:      
 
Domain Number 26 Region: 4080-4167
Classification Level Classification E-value
Superfamily Immunoglobulin 2.39e-19
Family I set domains 0.0071
Further Details:      
 
Domain Number 27 Region: 2969-3068
Classification Level Classification E-value
Superfamily Immunoglobulin 4.35e-19
Family I set domains 0.006
Further Details:      
 
Domain Number 28 Region: 546-631
Classification Level Classification E-value
Superfamily Immunoglobulin 5.47e-19
Family I set domains 0.02
Further Details:      
 
Domain Number 29 Region: 634-728
Classification Level Classification E-value
Superfamily Immunoglobulin 7.08e-19
Family I set domains 0.013
Further Details:      
 
Domain Number 30 Region: 2111-2194
Classification Level Classification E-value
Superfamily Immunoglobulin 1.25e-18
Family I set domains 0.0074
Further Details:      
 
Domain Number 31 Region: 3612-3711
Classification Level Classification E-value
Superfamily Immunoglobulin 1.33e-18
Family I set domains 0.0072
Further Details:      
 
Domain Number 32 Region: 40-211
Classification Level Classification E-value
Superfamily vWA-like 1.68e-18
Family Integrin A (or I) domain 0.021
Further Details:      
 
Domain Number 33 Region: 816-910
Classification Level Classification E-value
Superfamily Immunoglobulin 4.68e-18
Family I set domains 0.032
Further Details:      
 
Domain Number 34 Region: 3707-3802
Classification Level Classification E-value
Superfamily Immunoglobulin 8.28e-18
Family I set domains 0.027
Further Details:      
 
Domain Number 35 Region: 1525-1623
Classification Level Classification E-value
Superfamily Immunoglobulin 3.9e-17
Family I set domains 0.01
Further Details:      
 
Domain Number 36 Region: 1434-1535
Classification Level Classification E-value
Superfamily Immunoglobulin 4.5e-17
Family I set domains 0.0053
Further Details:      
 
Domain Number 37 Region: 1997-2097
Classification Level Classification E-value
Superfamily Immunoglobulin 1.3e-16
Family I set domains 0.034
Further Details:      
 
Domain Number 38 Region: 3441-3531
Classification Level Classification E-value
Superfamily Immunoglobulin 1.46e-16
Family I set domains 0.038
Further Details:      
 
Domain Number 39 Region: 3338-3431
Classification Level Classification E-value
Superfamily Immunoglobulin 0.0000000000000102
Family I set domains 0.015
Further Details:      
 
Domain Number 40 Region: 4490-4571,4598-4654,4706-4733
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000000000000272
Family Growth factor receptor domain 0.015
Further Details:      
 
Domain Number 41 Region: 4706-4815
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.000000000282
Family Growth factor receptor domain 0.018
Further Details:      
 
Domain Number 42 Region: 4572-4626
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000142
Family EGF-type module 0.021
Further Details:      
 
Domain Number 43 Region: 1667-1732
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000202
Family I set domains 0.031
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSHAP00000017121   Gene: ENSSHAG00000014551   Transcript: ENSSHAT00000017263
Sequence length 4924
Comment pep:novel scaffold:DEVIL7.0:GL834652.1:1515673:1694274:1 gene:ENSSHAG00000014551 transcript:ENSSHAT00000017263 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MTPGAPLSLPGPELLLLVVAAAAAGPPLGEPVLPSSGDATLAFVFDVTGSMWDDLKQVVE
GASRILERSLSGRSKAIANYALVPFHDPDIGPVTLTSDPAVFQRELRELYVQGGGDCPEM
SVGAIKAAVEISNPGSFIYVFSDARAKDYHRKPELLRLLQLKQSQVVFVLTGDCGDRTHP
GYLAYEEIAATSSGQVFHLDKQQVNEVHLVLRNEGKGDILGNGSNIKQNMLIFLRDSYQP
KNYYFGQHDLVEVWVSSIVTCSAVSVKKKSPPPSPPFPPPPPPPPPPPSSSPCKLSPKTT
RVSLSLSLSLLSFSLILSLCPSPFLCFCLLSLCLCFCLSLLSLSAVQIFCSDPSSSLTLC
SPTDPPPQLAPPQNVTVSPGETAVLSCHVLDKAPYNLTWIRDWRALPASPGRVAQLANLS
LEINGIIPSDGGRYQCLASNANGMAKASVWLFVREPPQISINTVSQHFSQGVEVRISCTA
SGHPPPNISWKHKAQTIVKEGRFFVDDQGTLIIQSVAPEDAGNYSCQATNEVGTDEQTVT
LFHTDPPSVSALRRVVRAPVGEEAVLECKASGVPPPRVIWYRGGREMILAPEVAHTGILR
IQEVQERDAGNYMCRAVNELGAASADIKLEVGHAPRLVESPRNMAVEMGKNTILACRTEG
SPSMRVTWARADGKPVPAHATEGGRARQQEAGVLFLDNVTPEDQGLYICEAHNAFGKVQA
EVQLTVIGHEPPQIASSASLVRVLEGQPASLPCVILAGKPFPQRHWLKNGQAPLLNGHYS
VRSDGSFHIDRALPEDAGRYSCMVTNAAGSQRQDIELAVQVLPSIQPAASHYVTSEGIPV
SLPCVSRGVPTPTITWRKETNALSSRDSHYQVLKEGTLYIPQPTAQDSGTYVCTAANSLG
ISSQEIQLSVNRIIILSWFWDELETLHVPHHKKFLGLLNFTLKSSCFHKNHPGFIPSHLH
DFFLTLAKPRIITNRPVTITAMAGKELTLLCEAQGSPTPLVTWTKDSVRQQGHRNVGLAH
PAWLVTSALVNWKLLVGGKKIANAFSPQVLKDTLFHVDVTSLLMNEILGSSCSLARPLKE
TRQKGLQWSCQPSRFLDHLNICLSGTSAFKGLSNALSLSTYTPENSWGSACQLTCGGGTT
FARSILFLPPSPSTSGIAQSEVDGCAIDEITLQKGKLRQFQVPRIILKQQVGIVCLGSDL
GFENRSMSSYMGSFDWGRKLVFERQMCHFSPGNSGQGKNPCKIRPQSHTAWSLIELPVCL
PETMLGCDCAEELRSQALIQKHCNCWATPLPASQFSYTVSRHSLLPSGSLKLAETSVEDS
GLYTCTASNTAGTASQSYVLRVQAPPTIWGSNETGEVAVMEDHVVRLQCDARGVPTPIIT
WFKDGDPLLAGPQVAFAKGGRHLQLGKAGVSDSGLYTCQASNAAGIAEKAVRLDVYVPPT
IEGAEGGPLVVKAVAGRPLTLGCLASGHPPPTLTWHQDGNPLTENNEMWLQEGGRVLRLE
RVAEAASGYYSCLANSPAGETVLHYLVEVQVPPQLLVGEGSGQVTTVKGHSLELPCQATG
SPTPTVQWLRSGRPAGELAGVDVSVDGAMLRIDRVEPDHSGLFACQATNEAGTAGAEVEV
VVHGEHLKMFLEVPQRQGHWGTSMSSHPFPPACDNPVCGSWDAHTKPQLVEEWVHTPWLL
YNTPAWIPTAGFLFRIEKVDLRDEGIYTCTATNLAGEAKRDVALKVLVPPNIEPGLVNKA
VLENTSASLECLASGVPTPRISWFRGRQLISPKPGLMVSADGRVLRIKRAQLSDAGSYRC
VASNVAGSSELKFGLRVNVPPRITLAPSLPGPILLNEPVRLMCNATGAPKPTLMWLKDGN
PVSATGISGLQIFPGGHVLTLASSRATDSGTYSCVAVNAVGEDRRDVTLQVHLPPSILGE
EQNVSVVVNQSVTLECQSQAVPPPVLTWRKDGHPLYTRPGVHLSPDGALLKVERAEVQDV
GRYTCEALNKAGRSEKHYNLNVWVPPEFSLWESRTLAVIEGHAISLSCECRGIPFPKITW
KKDGMLLPMDRGSTEPISAVGRLLYLGKAQPAQEGNYTCECSNIAGNSSQEQQLEVYASV
APKIPGSDDLLKELSVIQSGKVTLECEATGKPPPMVTWEKDGQPVAGDHGLLLQRQGRAL
QVERARAGHAGHYTCIAENEAGRAERRFDLSVLDLVPPELTGDTDPLTNVTVVLHSTLTL
LCEASGSPSPVLRWFRGEEPISPGEDIYFLAGGRILKLTQVQEEDAGLYLCLASNMVGEA
RKNFSVEVLVPPKIENENPEEEIKIPEGQSVSLTCNATGHPQPTVTWFKDGHSLSGGDPY
HLSPDGSVLEILQTNLSSSGHYSCIASNSVSEKTKHYKLTVLVVPTILGVTEDSPDEEVT
VTINNPISLICEALAFPSPNITWMKDGAPFQASGNTQLLPGTHGLQILNAQEQDAGRYTC
VVTNEVGEAVKNYHVEVLIPPSISKDDPLDEFSVKEVKAKVNSTLSLECESWAIPPPTIT
WYKDGQLVSADDHLHLLAEGRLLQISPTRSWDSGRYLCVATNVAGEDDKDFHVLIQVPPI
FQKIAGPNEVSGTGYQEEELRGGTMEYREIVENNPAYLYCDTSAVPPPQLTWYKDGQPLS
STEGVSVLQGGRVLQIPMVQAEDAGKYTCKASNEVGEDWLHYELLVLTPPVIQGDPEELV
EEVTVNANSTVSLQCQALGTPPPAILWLRNGLPLTPSSKHQALEDGQVLQVSVADVTDSA
SYMCVAENSAGSAEKLFTLKVQGIAPPRITGLNPERITAIVNSSVALPCDVHSHPSPEVT
WYKDGWALPFSEEVFLLPGTHTLQLPRPQPSDTGTYTCEALNVAGRDQKLVLLSVFAPPT
IKQTSSGQQDTIVVRVGDTAVLQCESDTLPEPVITWYKNGQQITLDQQVEMLLDGQKLEI
VNVQVADKGLYSCKVSNIVGEAVRTFALTVQVPPIFENPETETLSQVAGKSLVLVCDVVG
VPAPTVTWLKDRMPVESSVERGVVSRGGRLQLSRLQPSQEGTYTCVAENPEAEARKDFVV
MVLVAPRILSSGVPQEHNVLEDQEVRLECEAEGQPQPDILWLKDGRPLGIHISPHLRFYT
DGSSLVLKGLKASDSGAYTCLAQNSAGEDTKLHTLSVLVPPTIDKGANGSGTLISVPGEL
VTLACPARGSPPIQINWLKDGLPMPLSQRTHLHSSGRTLRISQIQVADAGTFTCVASSPA
GVAERTFSLQIHVPPVLEPSESKDAMAVVRGSDVTLPCEATGTPLPAVSWLKDGASLMVQ
SLGLGTGTSLQLEAVQADDAGTYSCVAVNEAGEAIRHFQLAVMEPPRIKDSGQAAEMLLL
PGAPLELICNALGNPMPNITWQKDGQAVARIGSITKNGRVLQVDDAGLYTCLAENPAGED
GKNFLVRVQAPPNIVGSRETRTVIGLAHGQLVLECPVEADPLPKIEWHREGILLQADAHT
LLLENGRFLQLQALDISDSGKYSCVASNAAGSTSLPFDVEIHMAPTIHPGPSVVNASVNQ
TALLPCKAKGIPESLVSWRKDGIPLVPGSRRLEFLPDGSLRIQPVYPEDSGYYLCQASNS
AGSDRQGRELRVFEPPTIAPGPSNLTLTVHNQSILPCEARGSPKPHVIWKKNGQTLSLDR
PQGAYRLLPSSSLVLTDPDLQDTAQFECLVSNDAGEAHRLYWVTVYVPPTIADDRTDFTV
TKMAPVVLTCHTTGVPAPVVSWSKGGAQLGKRGSGYRVSPTGALEIGQALPIHTGRYTCT
ARNDVGVAHKHVILTVQASPVIKPLPGVVHVMALADVVLPCEASGIPRPTITWQKEGLSI
PTGIGAQILPNGQLRISQASAEDAGNYLCIAKNPSGTALGKTRLVVQVPPVIKGGQSDLS
AAEGSQALLPCMAQGIPEPHITWKKDGFIVSSMEGKYVIQPSGELLVKNSEWRDAGTYTC
TAENAAGSTSRRVHLSILSLPTFTTLPGDLSLNQGEKLWLRCTARGSPTPHISWMLNNRL
ITEGVSEQDGGSTLQRAAVTREDSGTYTCWAENIVGKVQTVSFVHVKEAPALQGETSSHL
VELLGDSARLDCAARGDPAPVIRWIKDGLPVLSSYHRRQLHNGSLAIHRTVMEDAGHYLC
LAENEVGVVEKEVILILRSAPIFVVEPQDVVVRAGGTVVLLCQAAGEPNPTVEWTQAGRP
IRVSQRLQTLPNGSLQLKGVEMEDMGEYECVAHNLLGTAITQAFVAVKGEPRGSRGSMVG
VINGQEFGVASLNTSVLQEIEDGATTIQSSINNIPPDVGPLMRILVVAIAPIYWVLAGQS
GEALNGYSLTRGNFRQESQVEFATGELLHLTQVARGLDRDGLLLLDVVVNGFIPEVLPTA
HLQVQDFRERYVQIGPGELYVGSTQTFLKDGAPTDFHCNHTIKYDSALGPQPQLVQHLRA
TEVSSAFDPGAEALRFQLTTALQTEDNEAGCPEGFVLDSLQGFCTDKDECSSTKSPCSHI
CHNFLGRFSCSCPDGYALAWDNRNCRDVDECAWETSVCHDGQRCVNLLGSYQCLPHCKTG
FQATADGTGCEDINECLGHMDECRYNQICENTVGGHRCTCPPGYRSQGFGRPCLDINECL
QLPRVCAYQCQNLQGSYRCLCPPGQALLQDGKGCARLEKMEGNITTFSHQGSFPAWLRPR
ARAPGGSYHAWISFRPIGRPLSSISRTWCPPGFTRRNGACTDLDECQVRTLCQHACRNTE
GSYQCLCPAGYRLLSSGKNCQDINECIEEGIKCGPSQMCFNTRGSYQCVDTPCPALYRRG
SSPGMCFRRCALDCSSGGPFTLQYKLLTLPFGIRADHDVVRLTAFSDGGVLPNRTVLTVL
EPDPSSPFALREPQGVQGTIYTRRPLTDAGIYRLKVQAVTYGEHRVLRYQNIFVILISVS
PYPY
Download sequence
Identical sequences G3WNW6
ENSSHAP00000017121 ENSSHAP00000017121

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]