SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSECAP00000018917 from Equus caballus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSECAP00000018917
Domain Number 1 Region: 24-217
Classification Level Classification E-value
Superfamily vWA-like 3.76e-55
Family Integrin A (or I) domain 0.00033
Further Details:      
 
Domain Number 2 Region: 625-813
Classification Level Classification E-value
Superfamily vWA-like 8.76e-52
Family Integrin A (or I) domain 0.00087
Further Details:      
 
Domain Number 3 Region: 1403-1607
Classification Level Classification E-value
Superfamily vWA-like 2.36e-47
Family Integrin A (or I) domain 0.00082
Further Details:      
 
Domain Number 4 Region: 1022-1202
Classification Level Classification E-value
Superfamily vWA-like 1.23e-44
Family Integrin A (or I) domain 0.001
Further Details:      
 
Domain Number 5 Region: 239-419
Classification Level Classification E-value
Superfamily vWA-like 2.37e-43
Family Integrin A (or I) domain 0.00045
Further Details:      
 
Domain Number 6 Region: 1229-1402
Classification Level Classification E-value
Superfamily vWA-like 7.79e-43
Family Integrin A (or I) domain 0.00066
Further Details:      
 
Domain Number 7 Region: 832-1008
Classification Level Classification E-value
Superfamily vWA-like 6.97e-40
Family Integrin A (or I) domain 0.00087
Further Details:      
 
Domain Number 8 Region: 1621-1813
Classification Level Classification E-value
Superfamily vWA-like 1.31e-39
Family Integrin A (or I) domain 0.00074
Further Details:      
 
Domain Number 9 Region: 444-604
Classification Level Classification E-value
Superfamily vWA-like 2.41e-36
Family Integrin A (or I) domain 0.00053
Further Details:      
 
Domain Number 10 Region: 2611-2800
Classification Level Classification E-value
Superfamily vWA-like 2.23e-29
Family Integrin A (or I) domain 0.0016
Further Details:      
 
Domain Number 11 Region: 2391-2582
Classification Level Classification E-value
Superfamily vWA-like 6.4e-28
Family Integrin A (or I) domain 0.013
Further Details:      
 
Domain Number 12 Region: 3094-3150
Classification Level Classification E-value
Superfamily BPTI-like 1.23e-20
Family Small Kunitz-type inhibitors & BPTI-like toxins 0.00022
Further Details:      
 
Domain Number 13 Region: 1830-2025
Classification Level Classification E-value
Superfamily vWA-like 6.12e-19
Family Integrin A (or I) domain 0.016
Further Details:      
 
Domain Number 14 Region: 2979-3070
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000328
Family Fibronectin type III 0.0042
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSECAP00000018917   Gene: ENSECAG00000020887   Transcript: ENSECAT00000022862
Sequence length 3162
Comment pep:known_by_projection chromosome:EquCab2:6:23450320:23517034:-1 gene:ENSECAG00000020887 transcript:ENSECAT00000022862 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MRKHRHLPLVAIFYLFLSSFSFTRAQQQADVKNGAAADIIFLVDSSWSIGKEHFQLVREF
LYDVIKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSNQTGKG
LEYVMQTHLTQAAGSRASDGVPQVIVVLTDGHSEDGLALPTAELKSADVNVFAIGVEDAD
EGALKEIASEPLNMHVFNLENFTSLHDIVGNLVSCVHSSVTPGRAGDTGTLKDITAQDSA
DIIFLIDGSNNTGSVNFAVIRDFLVNLLERLSIGTQQIRVGVVQYSDEPRTMFSLDTYST
KAQVLDAVKALGFTGGELANVGLALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDE
IRDGVVALKQASVFSFGLGAQAASKAELQQIATNDNMVFTVPEFRSFGDLQEQLLLYIVG
VAQRRIVLQPPTIVTQVIEVNKRDIVFVDGSSALGQNNFNARDFIARVIQRLEIGQDLIQ
AVAQYADTVRPEFYFNSYPSKREVVNAVRKMKSLEGPALYTGSALDFVRNNLFTSAAGYR
AAEGVPKILVLITGGKSLDGISQPAQELKRNGIMAFAIGNKAADKAELEEIAFDPTLVFI
PAEFRIAPLQGMLPSLLAPLRTLSGTTEVHVNKRDIIFLLDGSSNVGETNFPYVRDFVMN
IVNSLDVGSDNIRVGLVQFSDTPVTEFSLNTYQTKAELLAHLRQLQLQGGSGLNTGSALS
YVHANHFTEAGGSRIRERVPQLLLLLTAGQSEDSYLQAANALARAGILTFCVGARQANKA
ELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVSGDVEEVPIAQTESKRDILFLFDG
SANLVGQFPVVRDFLYKIIDELNVKPDGTRVAVAQYSDDVRVESRFDEHQNKPEILNLVK
RMKIKTGKALNLGYALDYAQRYIFVESAGSRIEDGVLQFLVLLVAGRSSDRVDTPALNLK
QSGVVPFIFQAKNADPAELELIVPSPAFILTAESLPKIGDLQPQIVNLLKSVQNGAPTPV
SGEKDVVFLIDGSEGVRSGFPLLKEFVQRVVESLDVGPDRVRVAVVQYSDRTRPEFYLNS
YMDQQSVVGAIRRLTLLGGPTPNTGAALNFVLRNILIRSAGSRIEEGVPQLLIVLTAERS
GDDVRGPSVVLKREGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISER
VTQLSREELSRLKPDLLPPTSPGVGSKKDVVFLIDGSQSAAPEFQYIRTLIERLVDYLDV
GFDTTRVAVIQFSEDPRVEFLLNAHSSKDEVQNAVRRLRPKGGRQINIGGALEYVSKNIF
KRPLGSRIEEGVPQFLVLISSGKSNDEVDDSAAELKQFGVAPFTIARNADPEELVKISLS
PEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQRILASTPYPPPAVESDAADIVFLIDSS
DSVRPDGIAHIRDFISKIVQRLNIGPNKVRIGVVQFSNEVFPEFYLKTYKSRTAVLDAIR
RLRFKGGSPLNTGKALEHVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDISRYSQVIS
SSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEEKIITSFGPSGVTPAPPG
VDTPSPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVDTLYEGGDSIQVGLVQYN
SDPTDEFFLKDFSTKQEIIDAINKVVYKGGRHANTKVGIEHLRLNHFVPEAGSRLDQRVP
QIAFVITGGKSVEDAQEASLLTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQEL
SELSEQVLETLHDAMHETLCPGVSDVSRACNLDVILGFDGSSDQNVFVTQKGLEAQVDTI
LNRISQMQRISCSGTQMPTVRVSVVANTPSGPVEAFDFAEYQPELFEKFRNMRHQHPYVL
TADTLKVYQNKFRQSSPDSVKVVIHFTDGVDGDLADLQRASEELRQEGVRALILVGLERV
ANLERLMQLEFGRGFTYNRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQRGDRGP
IGSIGPKGVPGEDGYRGYPGDEGGPGDRGPPGVNGTQGFQGCPGQRGIKGSRGFPGEKGE
LGEIGLDGLDGEDGDKGLPGSSGEKGNPGRRGDKGPKGDQGERGDVGIRGDPGDSGRDSQ
QRGPKGETGDLGPMGLPGSDGVSGSPGEPGRDGGFGRRGPPGAKGNKGGPGQQGTVGEQG
TRGAQGPPGSTGPPGLIGEQGIPGPRGSGGAVGVPGERGRTGPLGRKGEPGEPGAKGGLG
PRGPRGETGDDGRDGVGSEGQKGKKGERGFPGYPGPKGTRGEPGTDGTLGPKGVRGRRGD
SGPPGAAGQKGDPGYPGPSGLKGNRGDSIDQCALVQSIKDKCPCCYGPLECPVFPTELAF
ALDTSEGVTQDTFSLMRDVLLSVVGDLTIAESNCPRGARVAVVTYNNEVTTEIRFADSKK
KSVLLERIKNLQLSLTSKQPSLETAMSFVARNTFKRVRNGFLMRKVAIFFSDKPTRASPQ
LREAVLKLSDAGITPLFLTRQADPQLVNALQINNTAVGHALVLPARSDLEDFLKNVLTCH
VCLDICNIDPSCGFGSWRPSFRDRRAAGSNADLDVAFILDSSESTTPFQFNEMRKYVGYL
VRQLDVSPDPKASQHFARVALVQHAPYESVGNSSVPPVKVEFSLTDYGSKEKLVNFLSSR
MMQLQGTRALGSAIDYTIENIFESAPNPRDLKIVVLMLTGEVQKEQLEEAQRVILQAKCK
GYFFVILGIGRKVNVKEVYSFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENA
FYLSPDIRKQCDWFQGDQPSKNLVKFAHKQVNVPNNVTSSPTSKPVTTTKPVTTAQPVTT
TTKPVPVVNLPASKPAPAKQAPARPAVARPVAAKPEAAKTATVKPAVAVKSAAAKPAAAR
PPTVAKPVATKPEVPRPQGARSAATKPATANLMVKASREVQVSEITENSAKLHWERPEPP
SPYFYDLTVTSALDQSLVLKQNLSVTDRVIGGLLAGQTYHVTVVCYLRSQVRATYQGSFS
TKKTPPPPPQPARSASSSTINLMVSTEPLAGTDTDICKLPKEEGTCRKFMLKWYYDSETK
SCARFWYGGCGGNENRFNSQKECEKVCASVLVNPGVIAAIGT
Download sequence
Identical sequences F6R735
ENSECAP00000018917 ENSECAP00000018917 9796.ENSECAP00000018917

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]