SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGALP00000011455 from Gallus gallus 76_4

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGALP00000011455
Domain Number 1 Region: 87-266
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.24e-28
Family Clostridium neurotoxins, the second last domain 0.026
Further Details:      
 
Domain Number 2 Region: 357-481,545-613
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0000000000000224
Family Reprolysin-like 0.067
Further Details:      
 
Domain Number 3 Region: 1415-1477
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000032
Family Complement control module/SCR domain 0.0044
Further Details:      
 
Domain Number 4 Region: 1345-1420
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000101
Family Complement control module/SCR domain 0.0032
Further Details:      
 
Domain Number 5 Region: 1284-1350
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.0000000118
Family Complement control module/SCR domain 0.0039
Further Details:      
 
Domain Number 6 Region: 1214-1287
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000001
Family Complement control module/SCR domain 0.005
Further Details:      
 
Weak hits

Sequence:  ENSGALP00000011455
Domain Number - Region: 674-696,879-938
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000837
Family Fibronectin type III 0.008
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGALP00000011455   Gene: ENSGALG00000007079   Transcript: ENSGALT00000011469
Sequence length 1629
Comment pep:known_by_projection chromosome:Galgal4:17:3040970:3209266:1 gene:ENSGALG00000007079 transcript:ENSGALT00000011469 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MQLWSLLLPLALLCTALASGPECGMDERSRRARRDTRHSRQLLYTAPGTCATRLARGRRS
TAGLEPGHVPRRRQQREVEDGEESLTPSRALYFSGQGDQLRLKADIELPRDAFTLQVWLK
AEGGQRSPAVIAGLYDKCSYTSRDRGWVLGINTVSDQGNRDPRYFFSLKTDRARKVTTIA
AHRSYLPNQWVHLAATYDGHLMKLYVNGAQVATSGEQVGSIFSLLTLKCKVLMVGGNALN
QNYRGYVEHFSLWRTARSQKEILLDMGQAIHRQDMPLPQLVLQDSLLNVKNTWSPMKDGS
SPQSKSSYHHGYLLDTSLEPPLCGQTVCDNTDVIASYNKLPSFRRNKIVRYRVVNLYDDK
HQNPTVSQEQIEFQHQHLNEAFSRYNITWELEVLEVKNSSLRHRLILANCDISKIGDENC
DPECNHTLTGYDGGDCRHVRHTLFNKKKQNGVCDMDCNYERYNFDGGECCNPEITEVTKT
CFDPYSPYRAYLDVNELKNILKLDGSTHLNIFFANSSEEELAGVATWPWDKEALMHLGGI
VLNPSFYGIPGHTHTMIHEIGHSLGLYHVFRGISEILSCSDPCMETEPSFETGDLCRDTN
PAPKHKLCGDPGPGNDTCGFHSFLNTPFSNFMSYADDDCTDSFTPNQVARMHCYLDLVYQ
SWQPAKKPAPVAIAPQIVARTPTSVTLEWFPPIDGHFFEREVGSACDLCMEGRVLVQHAF
SASSPMPCDPSGHWSPREAEGHPDVEQPCKSSVRTWSPNSAVNQHTVPPACPEPQGCYLQ
LEFRYPLTPESLTVWVTFVSPDWDSSGAVNDVKLLTVSGKNISLGPQNVFCDIPLTIKLD
AGQVGEEVYGIQIYTLDEHLEIDAAMLSSVPHSTLCTDCKPIQYKVVRDPPFQSGSPVVI
SNLSRRFIDMELSDSTTYTYQVIVVSGAEESEPSPELVYISGSGYCGDGVIQTDLGEECD
DMNKINGDGCSLFCLQELSFNCIDEPSRCYFHDGDGVCEEFEQMTSIKDCGVYTPKGFLD
QWASNVSVSHHSDQQCPGWVVIGQPAATQTCRTKVIDLNDGVSQYAWYPCTANFQYSHMA
QTFWLKAYFSTPMVAAAVLVHLVTDGTYYLDQKQETIGVQLFDTKEQSHDLGVHVLSCRN
NPLIIPVIHDLSHPFYHTQAVLISFSSQFVAISGVALRSFHNFDPITVSSCQRGQTYSPA
EQSCVHYACEATDCQKLEIDNALLNCTGGGWYNGAQCNVSCRTGYILQVQRDDDLSKSQT
ESSITMTCTDGKWSKLVTCEPVDCGVPDQYHVYPATFNCSEGTTYGKKCSFTCRPPALLK
GNNSNLTCMEDGLWSFPEALCELMCRAPSIVPNADLQTTRCLEDKHKVGSFCKYKCKPGY
HVPGSSRKARNMGRRAFKIQCTQDGTWLPGACVPVTCDPPPSKFHGLYQCSNGFQFNSEC
RIKCEDDDSQSGRGSNVIHCRKDGTWSGSFHLCREMQGQCALPTQLNSHLKLQCSGGYGI
GAECTTSCLDHSHEPILLRVNETVQDIQHWMNPQRVKSVVCTAGLKWYPHPSLIHCVKGC
EPFMGDNYCDSINNRAFCNYDGGDCCASTVKTKKVTPFPMSCDLQGECACRDPNAQEHNQ
KDLRGFSLG
Download sequence
Identical sequences ENSGALP00000011455

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]