SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSSCP00000018030 from Sus scrofa 69_10.2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSSCP00000018030
Domain Number 1 Region: 992-1330
Classification Level Classification E-value
Superfamily NHL repeat 3.4e-24
Family NHL repeat 0.0011
Further Details:      
 
Domain Number 2 Region: 687-762
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.0000000844
Family Pre-dockerin domain 0.06
Further Details:      
 
Domain Number 3 Region: 1300-1476
Classification Level Classification E-value
Superfamily NHL repeat 0.0000392
Family NHL repeat 0.012
Further Details:      
 
Weak hits

Sequence:  ENSSSCP00000018030
Domain Number - Region: 440-465
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00055
Family EGF-type module 0.04
Further Details:      
 
Domain Number - Region: 537-562
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00156
Family EGF-type module 0.024
Further Details:      
 
Domain Number - Region: 507-531
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00215
Family EGF-type module 0.096
Further Details:      
 
Domain Number - Region: 373-400
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00311
Family Integrin beta EGF-like domains 0.051
Further Details:      
 
Domain Number - Region: 408-432
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00419
Family EGF-type module 0.054
Further Details:      
 
Domain Number - Region: 563-603
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00754
Family EGF-type module 0.084
Further Details:      
 
Domain Number - Region: 343-367
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0526
Family EGF-type module 0.08
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSSCP00000018030   Gene: ENSSSCG00000017018   Transcript: ENSSSCT00000018530
Sequence length 2534
Comment pep:known_by_projection chromosome:Sscrofa10.2:16:60190379:60526928:-1 gene:ENSSSCG00000017018 transcript:ENSSSCT00000018530 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
SGPPNHHSQSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTP
ESVQLQDSWVLNSNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNT
FSRKAFKLKKPSKYCSWKCAALSAIAAALLLAILLAYFIAMHLLGLNWQLQPADGHTFNN
GIRTGLPGNDDVATMPSGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQP
QFLKFNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSVVESPRERRSIQTLVQN
EAVFVQYLDVGLWHLAFYNDGKDKEMVSFNTVVLDSVQDCPRNCHGNGECVSGLCHCFPG
FLGADCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPLNQCIDPSCGGHGSCIDGNC
VCSAGYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTY
LPDTGLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIE
HGTCKDGKCECREGWNGEHCTIDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGCNVAME
TSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDSPAVKSFYDR
IKLLAGKDSTHIIPGDNPFNSSLVSLIRGQVVTTDGTPLIGVNVSFVKYPKYGYTITRQD
GTFDLIANGGASLTLHFERAPFMSQERTVWLPWNSFYAMDTLVMKTEENSIPSCDLSGFV
RPDPIIISSPLSTFFSAAPGQNPIVPETQVLHEEIELPGSTVKLRYLSSRTAGYKSLLKI
TMTQSTVPLNLIKVHLMVAVEGHLFQKSFQASPNLAYTFIWDKTDAYGQRVYGLSDAVVS
VGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLT
QQPAIITSIMGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLFVGDFNYIRRIFPS
RNVTSILELRNKEFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRIYRVKSLSGAKDLAGN
SEVVAGTGEQCLPFDEARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDATMIRKVDQNGI
ISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRITENHQ
VSIIAGRPMHCQVPGIDYSLSKLAIHSALESASAIAISHTGVLYITETDEKKIINRLRQV
TTNGEICLLAGAASDCDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGN
IRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYSADND
VTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMTYDG
NTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDV
TVITNLSSVEASYTVVQDQVRNSYQLCNNGTLRVMYANGMGVSFHSEPHILAGTITPTIG
RCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKIYDDH
RKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSR
MFADGKVWSYSYLDKSMVLLLQSQRQYIFEYDSSERLHAVTMPSVARHSMSTHTSIGYIR
NTYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAITFGYDE
TTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFEYTYHDNSFRIASIK
PVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIKEVQ
YEMFRSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRY
SYDLNGNLHLLNPGNSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGADIFEYNSKG
LLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSNSEIT
SLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSNPD
FQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMFKSNN
PLSNELDLKNYVTVSTLDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASESG
QLITGVQQTTERHNQAFLALEGQVISKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGR
VTTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLLTLGTTIGR
KVLENGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDSLDEEKARVLDQAR
QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS
SNIQFLRQNEMGKR
Download sequence
Identical sequences ENSSSCP00000018030 ENSSSCP00000018030

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]