SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for NP_001229495.1.31192 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  NP_001229495.1.31192
Domain Number 1 Region: 1467-1660
Classification Level Classification E-value
Superfamily vWA-like 6.85e-44
Family Integrin A (or I) domain 0.00096
Further Details:      
 
Domain Number 2 Region: 1271-1464
Classification Level Classification E-value
Superfamily vWA-like 1.14e-42
Family Integrin A (or I) domain 0.0000000347
Further Details:      
 
Domain Number 3 Region: 1652-1862
Classification Level Classification E-value
Superfamily vWA-like 1.41e-40
Family Integrin A (or I) domain 0.000000144
Further Details:      
 
Domain Number 4 Region: 647-709
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000000000155
Family BSTI 0.054
Further Details:      
 
Domain Number 5 Region: 294-348
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.00000000018
Family BSTI 0.052
Further Details:      
 
Domain Number 6 Region: 773-829
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000213
Family ATI-like 0.014
Further Details:      
 
Domain Number 7 Region: 2198-2254
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.000000997
Family BSTI 0.065
Further Details:      
 
Domain Number 8 Region: 2576-2650
Classification Level Classification E-value
Superfamily FnI-like domain 0.00000146
Family VWC domain 0.022
Further Details:      
 
Domain Number 9 Region: 1143-1196
Classification Level Classification E-value
Superfamily Serine protease inhibitors 0.0000196
Family ATI-like 0.045
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) NP_001229495.1.31192
Sequence length 2813
Comment von Willebrand factor precursor [Equus caballus]; AA=GCF_000002305.2; RF=representative genome; TAX=9796; STAX=9796; NAME=Equus caballus; breed=thoroughbred; AL=Chromosome; RT=Major
Sequence
MVPARLARVLLALALTLPGTFCAEGTLGRSSMGRCSLFGGDFVNTFDESMYSFAGDCSYL
LAGDCQKRSFSLIGAFQNGKRVSLSVYLGEFFDIHLFVNGTVLQGDQSVSMPYASKGLYL
ETEAGYYKLSSEAYGFVARIDSGGNFQVLLSDRYFNKTCGLCGNFNIFAEDDFITQEGTL
TSDPYDFANSWALSSGEERCKRASPPSSPCNVSSEEMQKDMWEQCQLLKTVSVFARCHPL
VDPEPFVALCEKALCPCAQGLQCSCAALLEYARACAQQGMVLYGWTDHSACRPACPAGME
YKECVSPCTRTCQSLHINEVCQEQCVDGCSCPEGQLLDEGRCVESAECPCVHSGKRYPPG
ASLSQDCNTCICRNSLWICSNEECPGECLVTGQSHFKSFDNRYFTFSGVCQYLLARDCQD
HSFSIVIETVQCADDPDAVCTRSVTIRLPSPHNSLVRLKHGGGVAMDGQDVQIPLLQGDL
RIQHTVMGSVRLSYGEDLQMDWDGRGRLLVKLSPLYAGKTCGLCGNYNGNKGDDFLTPAG
LVEALVEAFGNSWKLRADCEDLQKQHSDPCSLNPRLSRFAEEACALLTSPTFEACHGAVG
PGPYLRNCRYDVCSCSDGRDCLCGAVANYAAACARKGVHIGWREPGFCALSCPQGQVYLQ
CGTPCNRTCHSLSHPDEECDEVCLEGCFCPPGLYLDERGDCVPKAQCPCYYDGEIFQPED
IFSDHHTMCYCEDGFMHCSTSGAPGSLLPDAVLSSPLSHRSKRSLSCRPPMVKLVCPADN
PRAEGLECAKTCQNYDLECMSMGCVSGCLCPPGMVRHENRCVALERCPCFHQGREYAPGE
TVKIDCNTCVCRDRKWNCTDHVCDATCSAIGMAHYLTFDGLKYMFPGECQYILVQDYCGS
NSGTFRILVANEGCVYPSVKCKKRVTILVEEGEIELFNGEVKVKRPMRDDSHFEVVESGR
YIILLVNKALSVVWDHHLGISVVLKQTYQEQVCGLCGNFDGIQNNDFTSSRLQVEENPVD
FGNSWKVSPQCADTRKVPLDSSPASCHNNIMKQTIVDSSCRILTSDLFRDCNKLVDPEPF
LDVCIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVTWRTATLCPQNCEERNLRENGY
ECEWRYNSCAPACPITCQHPEPLACPVQCVDGCHAHCPPGKILDELLQTCVDPEDCPACE
VAGQRLAPGRKVTLNPSDPQHCQICHCDGVSLMCEACGKAGVLEVPPTEGPVGPTTAYVE
DTPEPPLHDFYCSKLLDLVFLLDGSSKLSEAEFEVLKAFVVGMMERLHISQKRIRVAVVE
YHDGSHAYIELRDRKRPSELRRIASQVKYAGSEVASTTEVLKYTLYQIFGKIDRPEASRI
ALLLMASQEPPRLARNLVNYVRNLKKKKQVIVIPVGIGPHANVKQIRLIEKQAPENKAFV
LSGVDELEQRRDDIISYLCDLAPEPPAPTQRPRMPQVTKSPEISGISSLAPKRNSMVLDV
VFVLEGSDKIGEANFNRSREFMEEVIQRMDVGQDTIHITVLQYSYRVTMEYTFLETQSKR
DVLQRVREIRYQGGNRTNTGLALQYLSEHSFLASQGDREQAPNLVYMVTGNPASDVIKRM
PGDIHVVPIGVGPHADVQELERIGWPNAPILIQDFETLPREAPDLVLQRCCSGEGLQIPT
LPPAPDCSQPLDVVLLLDGSSSFPASDFDKMKSFAKAFISKANIGPQLTRVSVLQYGSIT
TIDVPWNVPQEKASLLSLVDLMQREGGPSQVGDALAFAVRYVTSEIHGARPGASKVVVIL
VMDVSTDVVDAAADAARANRVGVFPIGIGDRYDEAQLRILAGSGASSNLVKLQRIEDLST
VLLGNSFLRKLCSGFVSVCMDEDGNEKRPGDVWTLPDQCHTVTCLPDGQTLLKSHRVNCD
RGPRPSCPNGQSPMKVEETCGCRWTCPCACMGSSTRHIVTFDGQNFKLTGDCSYVLLQNK
EQDLEVILHNGACGPGARQACMKSIEVKHNGLSVELHSDMEVSVNGRLVPVPYVGGNMEV
SVYGAIMFEIRFNHLGHIFTFTPVNNEFQLQLSPKTFASQMFGLCGICDENGANDFMLRD
GTITTDWKRLIQEWTVQQPGQTCQPVPEEQCPVSSSSHCQVLLSALFAECHKILAPATFY
AICQQDSCHQEQVCEAIASYAHLCRTNGVCVDWRTADFCAMSCPPSLVYNHCEHGCPRHC
EGNTSSCGDHPSEGCFCPQHQVMLEGSCVPEEACTQCVSEDGVRHQFLETWVPAHQPCQI
CTCLSGRRVNCTLQPCPTARAPTCGPCEVTRLRQNAGQCCPEYECVCDLVNCDPPLVPHC
EGGLQPTLTNPGECRPNFACACRKEECTRQSPPACPPHRTPTLRKTQCCDEYECACSCVN
STVSCPLGYLASTVTNDCGCTTTTCLPDKVCVHRGTIYPVGQFWEEGCDVCTCTDMEDAV
MGLRVAQCSQKPCEDSCRSGFTYVLHEGECCGRCLPSACEVVTGSPRGDSQSYWQNVGSH
WASPENPCLINECVRVKEEVFVQQRNVSCPELDVTTCPTGFQLSCRTSECCPSCHCEPVQ
ACMLNGTIIGPGRSLMIDPCTTCRCTVQVGAISGFKLECRKITCEACPVGYEEEKIQGEC
CGRCLPTACTIQLRGGQTMTLKRDEMLQDGCDSHFCKVSERGEYIWEKRIMSCPPFDEHK
CLAEGGKIMSVPGTCCHTCEAPECRDITATLQYVKVGDCQSEEEVDIHYCQGKCASKAVY
SIDTEDVEEQCSCCSPTRKQPMHVPLRCTNGSVIYHEVLNAMQCTCNPRKCST
Download sequence
Identical sequences A0A061DBP6
NP_001229495.1.31192

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]